whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
Qwen
Qwen
Qwen
/
Qwen3.6-35B-A3B
35B (3B active)
April 16, 2026
Apache 2.0
Share
Overview
Runs
Quantizations
Quant
Quantized by
Size
Decode
Prefill
Score
Actions
ud-iq1_m
Unsloth
Unsloth
9.4
GB
32.3
tok/s
349.6
tok/s
Runs ok
Run
ud-iq2_xxs
Unsloth
Unsloth
10.0
GB
27.5
tok/s
321.9
tok/s
Runs ok
Run
ud-iq2_m
Unsloth
Unsloth
10.7
GB
N/A
N/A
N/A
Run
3bit
Frank Denis
Frank Denis
14.1
GB
N/A
N/A
N/A
Run
nvfp4
MLX Community
MLX Community
19.0
GB
N/A
N/A
N/A
Run
4bit
MLX Community
MLX Community
19.0
GB
66.7
tok/s
608.3
tok/s
Runs ok
Run
4bit-dwq
MLX Community
MLX Community
19.2
GB
N/A
N/A
N/A
Run
ud-q4_k_s
Unsloth
Unsloth
19.5
GB
N/A
N/A
N/A
Run
q4_k_m
LM Studio
LM Studio
19.7
GB
N/A
N/A
N/A
Run
ud-q8_k_xl
Unsloth
Unsloth
35.8
GB
N/A
N/A
N/A
Run
Device Comparison
Results include trials with
4,096
input tokens and
1,024
output tokens only.
Decode / Prefill Speeds
20 devices
All quants
M1 Max
·
64 GB
M1 Max
·
64 GB
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run