Meta
Meta
Meta/Llama 3.2 3B Instruct

Quantizations

Quant	Quantized by	Size	Decode	Prefill	Score	Actions
4bit	MLX Community MLX Community	1.7 GB	68.4 tok/s	652.0 tok/s	Runs well

Results include trials with 4,096 input tokens and 1,024 output tokens only.

5 devices