Skip to main content
Full GPU

Best variant: Q5_K_M

Full GPU inference — 24 GB VRAM meets the 16.3 GB recommendation.

GPU VRAM
24 GB
Min VRAM (best fit)
14.4 GB
Recommended VRAM
16.3 GB
Estimated tok/s
~65.1

Share this matchup

Send this page so a friend can see if NVIDIA GeForce RTX 3090 fits GPT-OSS 20B.

Every GPT-OSS 20B quantization on NVIDIA GeForce RTX 3090

Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.

QuantizationFile SizeMin VRAMRec VRAMContextVerdictEstimated tok/s
Q4_K_M10 GB11.5 GB13 GB8K / 8KFull GPU~74.9
Q5_K_MBest fit12.5 GB14.4 GB16.3 GB8K / 8KFull GPU~65.1
Q8_020 GB23 GB26 GB8K / 8KPartial GPU~31.2
FP1640 GB46 GB52 GB8K / 8KHybrid CPU+GPU~7

NVIDIA GeForce RTX 3090 is solid pick for GPT-OSS 20B

Need second card or fresh build? These links help support site at no extra cost.

All hardware for GPT-OSS 20BBest GPU for GPT-OSS 20BModels that fit NVIDIA GeForce RTX 3090Full model detailsBrowse all models