Qwen
Qwen
/Qwen3.6-35B-A3B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
Unsloth
Unsloth
9.4 GB32.3 tok/s349.6 tok/sRuns ok
Unsloth
Unsloth
10.0 GB27.5 tok/s321.9 tok/sRuns ok
Unsloth
Unsloth
10.7 GBN/AN/AN/A
Frank Denis
Frank Denis
14.1 GBN/AN/AN/A
MLX Community
MLX Community
19.0 GBN/AN/AN/A
MLX Community
MLX Community
19.0 GB66.7 tok/s608.3 tok/sRuns ok
MLX Community
MLX Community
19.2 GBN/AN/AN/A
Unsloth
Unsloth
19.5 GBN/AN/AN/A
LM Studio
LM Studio
19.7 GBN/AN/AN/A
Unsloth
Unsloth
35.8 GBN/AN/AN/A

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

20 devices