Google
Google
/Gemma 4 26B A4B IT

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
Unsloth
Unsloth
10.4 GBN/AN/AN/A
MLX Community
MLX Community
13.8 GBN/AN/AN/A
MLX Community
MLX Community
14.5 GBN/AN/AN/A
MLX Community
MLX Community
14.5 GBN/AN/AN/A
Unsloth
Unsloth
15.0 GBN/AN/AN/A
LM Studio
LM Studio
15.6 GBN/AN/AN/A
ggml
ggml
15.6 GB57.0 tok/s700.5 tok/sRuns well
Unsloth
Unsloth
15.7 GBN/AN/AN/A
Unsloth
Unsloth
16.0 GB48.5 tok/s705.5 tok/sRuns ok
MLX Community
MLX Community
17.4 GBN/AN/AN/A
Bartowski
Bartowski
21.3 GB46.6 tok/s682.1 tok/sRuns well
MLX Community
MLX Community
25.0 GB43.7 tok/s699.4 tok/sRuns ok
MLX Community
MLX Community
26.0 GBN/AN/AN/A

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

28 devices
Gemma 4 26B A4B IT by Google | whatcani.run