Qwen
Qwen
/Qwen3-Coder-Next

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
MLX Community
MLX Community
41.8 GB47.1 tok/s648.5 tok/sRuns poorly
Unsloth
Unsloth
45.2 GBN/AN/AN/A

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

3 devices