Skip to main content
Full GPU

Best variant: FP16

Full GPU inference — 192 GB VRAM meets the 62.4 GB recommendation.

GPU VRAM
192 GB
Min VRAM (best fit)
55.2 GB
Recommended VRAM
62.4 GB
Estimated tok/s
~16.7

Share this matchup

Send this page so a friend can see if Apple M3 Ultra fits Devstral Small 2 24B.

Every Devstral Small 2 24B quantization on Apple M3 Ultra

Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.

QuantizationFile SizeMin VRAMRec VRAMContextVerdictEstimated tok/s
Q4_K_M12 GB13.8 GB15.6 GB8K / 8KFull GPU~53.3
Q5_K_M15 GB17.3 GB19.5 GB8K / 8KFull GPU~46.4
Q8_024 GB27.6 GB31.2 GB8K / 8KFull GPU~31.7
FP16Best fit48 GB55.2 GB62.4 GB8K / 8KFull GPU~16.7

Apple M3 Ultra is solid pick for Devstral Small 2 24B

Need second card or fresh build? These links help support site at no extra cost.

All hardware for Devstral Small 2 24BBest GPU for Devstral Small 2 24BModels that fit Apple M3 UltraFull model detailsBrowse all models