Skip to main content

36 models in this collection.

Gemma 4 26B A4B

Gemma · 26B

13.3 GB
chatgeneralmultimodalreasoning+2
VRAM15 GB
RAM18 GB
QuantQ3_K_M

Gemma 4 31B

Gemma · 31B

14.5 GB
chatgeneralmultimodalreasoning+1
VRAM16.5 GB
RAM20 GB
QuantQ3_K_M

Command R 35B

Command · 35B

20 GB
chatragtool-usemultilingual
VRAM22 GB
RAM24 GB
QuantQ4_K_M

Command R+ 104B

Command · 104B

48 GB
chatragtool-usemultilingual+1
VRAM50 GB
RAM56 GB
QuantQ3_K_M

DeepSeek Coder V2 Lite 16B

DeepSeek · 16B

9.5 GB
codingmoechat
VRAM11 GB
RAM16 GB
QuantQ4_K_M

DeepSeek R1 671B

DeepSeek · 671B

240 GB
reasoningmoefrontierchat
VRAM245 GB
RAM260 GB
QuantQ2_K

DeepSeek R1 Distill Llama 70B

DeepSeek · 70B

33 GB

Best for Enterprise-grade local pilots

reasoningdistillchat
VRAM35 GB
RAM40 GB
QuantQ3_K_M

DeepSeek R1 Distill Llama 8B

DeepSeek · 8B

4.9 GB
reasoningdistillchat
VRAM5.5 GB
RAM8 GB
QuantQ4_K_M

DeepSeek R1 Distill Qwen 1.5B

DeepSeek · 1.5B

1 GB
reasoningdistillsmallchat
VRAM2 GB
RAM4 GB
QuantQ4_K_M

DeepSeek R1 Distill Qwen 14B

DeepSeek · 14B

8.7 GB

Best for Reasoning-intensive workflows

reasoningdistillchat
VRAM10 GB
RAM12 GB
QuantQ4_K_M

DeepSeek R1 Distill Qwen 32B

DeepSeek · 32B

15 GB

Best for High-value reasoning tasks

reasoningdistillchat
VRAM17 GB
RAM20 GB
QuantQ3_K_M

DeepSeek R1 Distill Qwen 7B

DeepSeek · 7B

4.7 GB

Best for Reasoning tasks

reasoningdistillchat
VRAM5.5 GB
RAM8 GB
QuantQ4_K_M

DeepSeek V3 671B

DeepSeek · 671B

240 GB
chatgeneralmoefrontier
VRAM245 GB
RAM260 GB
QuantQ2_K

Gemma 4 E2B

Gemma · 2.3B

2.7 GB
chatsmalledgemultimodal+2
VRAM3.5 GB
RAM4 GB
QuantQ4_K_M

Gemma 4 E4B

Gemma · 4.5B

4.1 GB

Best for On-device multimodal assistants

chatsmallmultimodalreasoning+1
VRAM5 GB
RAM6 GB
QuantQ4_K_M

Hermes 3 Llama 3.1 70B

Hermes · 70B

40 GB
chatgeneralfunction-calling
VRAM42 GB
RAM48 GB
QuantQ4_K_M

Hermes 3 Llama 3.1 8B

Hermes · 8B

4.9 GB
chatgeneralfunction-calling
VRAM5.5 GB
RAM8 GB
QuantQ4_K_M

Llama 3.1 405B

Llama · 405B

145 GB
chatgeneralfrontier
VRAM150 GB
RAM160 GB
QuantQ2_K

Llama 3.1 70B

Llama · 70B

25 GB
chatgeneralcoding
VRAM27 GB
RAM32 GB
QuantQ2_K

Llama 3.1 8B

Llama · 8B

3.9 GB

Best for General use

chatgeneralcoding
VRAM4.5 GB
RAM6 GB
QuantQ3_K_M

Llama 3.1 Nemotron 70B

Nemotron · 70B

40 GB
chatgeneralinstruct
VRAM42 GB
RAM48 GB
QuantQ4_K_M

Llama 3.2 1B

Llama · 1.24B

0.75 GB
chatsmalledgeinstruct
VRAM1.5 GB
RAM2 GB
QuantQ4_K_M

Llama 3.2 3B

Llama · 3.21B

2 GB
chatsmalledgeinstruct
VRAM3 GB
RAM4 GB
QuantQ4_K_M

Llama 3.3 70B

Llama · 70B

33 GB

Best for High-end workstations

chatgeneralcodinginstruct
VRAM35 GB
RAM40 GB
QuantQ3_K_M

Llama 4 Maverick 17B (128E)

Llama · 17B

230 GB
chatmoemultimodalfrontier
VRAM235 GB
RAM256 GB
QuantQ4_K_M

Llama 4 Scout 17B (16E)

Llama · 17B

60 GB

Best for Mid/high-end hardware

chatmoemultimodal
VRAM63 GB
RAM68 GB
QuantQ4_K_M

Mistral Nemo 12B

Mistral · 12B

7.3 GB

Best for Multilingual assistants

chatgeneralmultilingual
VRAM8.5 GB
RAM12 GB
QuantQ4_K_M

Mistral Small 3.1 24B

Mistral · 24B

12.6 GB
chatgeneralreasoningmultimodal+2
VRAM14 GB
RAM20 GB
QuantQ4_K_M

Qwen 2.5 1.5B

Qwen · 1.5B

1 GB
chatsmalledge
VRAM2 GB
RAM4 GB
QuantQ4_K_M

Qwen 2.5 14B

Qwen · 14B

8.7 GB

Best for Internal team assistants

chatgeneralmultilingual
VRAM10 GB
RAM12 GB
QuantQ4_K_M

Qwen 2.5 32B

Qwen · 32B

15 GB

Best for High-end local deployments

chatgeneralmultilingual
VRAM17 GB
RAM20 GB
QuantQ3_K_M

Qwen 2.5 3B

Qwen · 3B

1.9 GB
chatsmall
VRAM3 GB
RAM4 GB
QuantQ4_K_M

Qwen 2.5 72B

Qwen · 72B

27 GB
chatgeneralmultilingual
VRAM29 GB
RAM36 GB
QuantQ2_K

Qwen 2.5 7B

Qwen · 7B

3.7 GB

Best for General multilingual assistants

chatgeneralmultilingual
VRAM4.5 GB
RAM6 GB
QuantQ3_K_M

Qwen 2.5 Coder 32B

Qwen · 32B

19 GB
codinginstructchat
VRAM21 GB
RAM24 GB
QuantQ4_K_M

QwQ 32B

Qwen · 32B

15 GB

Best for Expert analytical users

reasoningchat
VRAM17 GB
RAM20 GB
QuantQ3_K_M