Skip to main content
Full GPU

Best variant: Q8_0

Full GPU inference — 96 GB VRAM meets the 40 GB recommendation.

GPU VRAM
96 GB
Min VRAM (best fit)
35 GB
Recommended VRAM
40 GB
Estimated tok/s
~11.5

Share this matchup

Send this page so a friend can see if Apple M2 Max fits Gemma 4 31B.

Every Gemma 4 31B quantization on Apple M2 Max

Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.

QuantizationFile SizeMin VRAMRec VRAMContextVerdictEstimated tok/s
Q3_K_M14.5 GB16.5 GB20 GB8K / 256KFull GPU~19
Q4_K_M18.4 GB20.5 GB24 GB8K / 256KFull GPU~17.4
Q8_0Best fit33.2 GB35 GB40 GB8K / 256KFull GPU~11.5

Apple M2 Max is solid pick for Gemma 4 31B

Need second card or fresh build? These links help support site at no extra cost.

All hardware for Gemma 4 31BBest GPU for Gemma 4 31BModels that fit Apple M2 MaxFull model detailsBrowse all models