Models

Popular AI models on nfer

The most popular open-weight and frontier models with a dedicated hosting-cost page — pick one to see API token pricing, dedicated capacity, and rented GPU costs ranked side-by-side.

DeepSeek-V3.2

685B
DeepSeek · 8 providers
API from $0.26 /1M inGPU from $3.50/hr

Llama-3.1-8B-Instruct

8.0B
Meta · 12 providers
API from $0.02 /1M inGPU from $0.28/hr

DeepSeek-R1

685B
DeepSeek · 7 providers
API from $1.35 /1M inGPU from $3.50/hr

Mistral-7B-Instruct-v0.3

7.2B
Mistral AI · 10 providers
API from $0.11 /1M inGPU from $0.28/hr

Qwen3-Coder-30B-A3B-Instruct

31B
qwen · 9 providers
API from $0.07 /1M inGPU from $0.55/hr

gemma-3-4b-it

4.3B
Google · 11 providers
API from $0.04 /1M inGPU from $0.14/hr

Meta-Llama-3-8B-Instruct

8.0B
Meta · 11 providers
API from $0.03 /1M inGPU from $0.28/hr

MiniMax-M2.5

229B
minimaxai · 10 providers
API from $0.15 /1M inGPU from $1.24/hr

NVIDIA-Nemotron-3-Super-120B-A12B-BF16

124B
NVIDIA · 7 providers
API from $0.15 /1M inGPU from $1.24/hr

Llama-3.3-70B-Instruct

71B
Meta · 12 providers
API from $0.13 /1M inGPU from $1.13/hr

Llama-3.1-70B-Instruct

71B
Meta · 10 providers
API from $0.40 /1M inGPU from $1.13/hr

Mixtral-8x7B-Instruct-v0.1

47B
Mistral AI · 11 providers
API from $0.54 /1M inGPU from $1.10/hr

MiniMax-M2.7

229B
minimaxai · 8 providers
API from $0.30 /1M inGPU from $1.24/hr

gemma-3-27b-it

27B
Google · 10 providers
API from $0.08 /1M inGPU from $0.55/hr

Qwen3.5-397B-A17B

403B
qwen · 6 providers
API from $0.60 /1M inGPU from $3.50/hr

Devstral-Small-2-24B-Instruct-2512

24B
Mistral AI · 10 providers
API from $0.10 /1M inGPU from $0.28/hr

Qwen3-Next-80B-A3B-Instruct

81B
qwen · 7 providers
API from $0.14 /1M inGPU from $1.24/hr

Qwen2.5-VL-72B-Instruct

73B
qwen · 9 providers
API from $0.25 /1M inGPU from $1.13/hr

GLM-4.7

358B
Zyphra · 9 providers
API from $0.60 /1M inGPU from $3.50/hr

MiniMax-M2

229B
minimaxai · 8 providers
API from $0.26 /1M inGPU from $1.24/hr