Qwen
Qwen
/Qwen3.5-0.8B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
Unsloth
Unsloth
322.6 MB75.5 tok/s2,137.9 tok/sRuns great
Unsloth
Unsloth
448.4 MB72.1 tok/s2,228.2 tok/sRuns great
Unsloth
Unsloth
507.8 MB101.9 tok/s2,985.9 tok/sRuns great
MLX Community
MLX Community
570.4 MB166.9 tok/s2,512.2 tok/sRuns great
MLX Community
MLX Community
596.3 MB213.9 tok/s2,913.2 tok/sRuns great
Unsloth
Unsloth
774.2 MB108.8 tok/s4,028.9 tok/sRuns great
MLX Community
MLX Community
954.8 MB168.6 tok/s2,202.7 tok/sRuns great

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

63 devices