Reasoning
Best reasoning models for local use
Reasoning-first models for math, planning, and slower deeper thinking.
46 models in this collection.
Kimi K2.5
Kimi · 1000B
GLM 5
GLM · 744B
DeepSeek V3.2
DeepSeek · 685B
DeepSeek R1 671B
DeepSeek · 671B
GLM 4.7
GLM · 355B
MiMo V2 Flash
MiMo · 309B
Llama 3.1 Nemotron Ultra 253B
Nemotron · 253B
Qwen3 235B A22B
Qwen · 235B
MiniMax M2.5
MiniMax · 230B
Step 3.5 Flash
Step · 196B
Devstral 2 123B
Mistral · 123B
GPT-OSS 120B
GPT-OSS · 120B
Qwen3 Coder Next 80B A3B
Qwen · 80B
DeepSeek R1 Distill Llama 70B
DeepSeek · 70B
Best for Enterprise-grade local pilots
Qwen3.5 35B A3B
Qwen · 35B
Best for RTX 5090 and 4090 class systems
Qwen3.6 35B A3B
Qwen · 35B
DeepSeek R1 Distill Qwen 32B
DeepSeek · 32B
Best for High-value reasoning tasks
Qwen3 32B
Qwen · 32B
QwQ 32B
Qwen · 32B
Best for Expert analytical users
Gemma 4 31B
Gemma · 31B
Qwen3 30B A3B
Qwen · 30B
Qwen3 Coder 30B A3B
Qwen · 30B
Qwen3.5 27B
Qwen · 27B
Best for High-capacity local APIs
Gemma 4 26B A4B
Gemma · 26B
Devstral Small 2 24B
Mistral · 24B
Mistral Small 24B
Mistral · 24B
Mistral Small 3.1 24B
Mistral · 24B
GPT-OSS 20B
GPT-OSS · 20B
DeepSeek R1 Distill Qwen 14B
DeepSeek · 14B
Best for Reasoning-intensive workflows
Phi-4 14B
Phi · 14B
Best for Analysis-heavy assistants
Phi-4 Reasoning 14B
Phi · 14B
Phi-4 Reasoning Plus 14B
Phi · 14B
Qwen3 14B
Qwen · 14B
Qwen3.5 9B
Qwen · 9B
Best for Upgraded general local assistant
DeepSeek R1 Distill Llama 8B
DeepSeek · 8B
Qwen3 8B
Qwen · 8B
DeepSeek R1 Distill Qwen 7B
DeepSeek · 7B
Best for Reasoning tasks
Gemma 4 E4B
Gemma · 4.5B
Best for On-device multimodal assistants
Qwen3 4B
Qwen · 4B
Qwen3.5 4B
Qwen · 4B
Phi-4 Mini 3.8B
Phi · 3.8B
Best for Low-VRAM devices
SmolLM3 3B
SmolLM · 3B
Gemma 4 E2B
Gemma · 2.3B
Qwen3 1.7B
Qwen · 1.7B
DeepSeek R1 Distill Qwen 1.5B
DeepSeek · 1.5B
Qwen3 0.6B
Qwen · 0.6B