Google
Google
/Gemma 4 E4B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
MLX Community
MLX Community
4.9 GB24.1 tok/s422.0 tok/sRuns ok
MLX Community
MLX Community
8.3 GBN/AN/AN/A

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

2 devices