Models

Popular AI models on nfer

The most popular open-weight and frontier models with a dedicated hosting-cost page — pick one to see API token pricing, dedicated capacity, and rented GPU costs ranked side-by-side.

DeepSeek-V3.2

DeepSeek · 8 providers

API from $0.26 /1M inGPU from $3.50/hr

Llama-3.1-8B-Instruct

Meta · 12 providers

API from $0.02 /1M inGPU from $0.28/hr

DeepSeek-R1

DeepSeek · 7 providers

API from $1.35 /1M inGPU from $3.50/hr

Mistral-7B-Instruct-v0.3

Mistral AI · 10 providers

API from $0.11 /1M inGPU from $0.28/hr

Qwen3-Coder-30B-A3B-Instruct

qwen · 9 providers

API from $0.07 /1M inGPU from $0.55/hr

gemma-3-4b-it

Google · 11 providers

API from $0.04 /1M inGPU from $0.14/hr

Meta-Llama-3-8B-Instruct

Meta · 11 providers

API from $0.03 /1M inGPU from $0.28/hr

MiniMax-M2.5

minimaxai · 10 providers

API from $0.15 /1M inGPU from $1.24/hr

NVIDIA-Nemotron-3-Super-120B-A12B-BF16

NVIDIA · 7 providers

API from $0.15 /1M inGPU from $1.24/hr

Llama-3.3-70B-Instruct

Meta · 12 providers

API from $0.13 /1M inGPU from $1.13/hr

Llama-3.1-70B-Instruct

Meta · 10 providers

API from $0.40 /1M inGPU from $1.13/hr

Mixtral-8x7B-Instruct-v0.1

Mistral AI · 11 providers

API from $0.54 /1M inGPU from $1.10/hr

MiniMax-M2.7

minimaxai · 8 providers

API from $0.30 /1M inGPU from $1.24/hr

gemma-3-27b-it

Google · 10 providers

API from $0.08 /1M inGPU from $0.55/hr

Qwen3.5-397B-A17B

qwen · 6 providers

API from $0.60 /1M inGPU from $3.50/hr

Devstral-Small-2-24B-Instruct-2512

Mistral AI · 10 providers

API from $0.10 /1M inGPU from $0.28/hr

Qwen3-Next-80B-A3B-Instruct

qwen · 7 providers

API from $0.14 /1M inGPU from $1.24/hr

Qwen2.5-VL-72B-Instruct

qwen · 9 providers

API from $0.25 /1M inGPU from $1.13/hr

GLM-4.7

Zyphra · 9 providers

API from $0.60 /1M inGPU from $3.50/hr

MiniMax-M2

minimaxai · 8 providers

API from $0.26 /1M inGPU from $1.24/hr