Deployment guide
Cheapest way to deploy DeepSeek-R1 in 2026
7 providers compared. API token-pricing, dedicated capacity, and rented GPU costs side-by-side, normalized to monthly cost.
Cheapest API
$1.35 / 1M input tokens
at Google Cloud Vertex AI
Cheapest GPU rental
$3.50 / hour
at Nebius on H200 SXM
Provider rate cards
7 providers compared
| Provider | Hardware | Region | Commitment | Source | |||
|---|---|---|---|---|---|---|---|
| H200 SXM | 8 | 141 | Netherlands | $3.50 | On-demand | Source ↗ | |
| B200 SXM | 8 | 192 | Netherlands | $5.50 | On-demand | Source ↗ | |
| RTX PRO 6000 | 16 | 96 | Multiple regions | $5.50 | On-demand | Source ↗ | |
| B300 SXM | 8 | 288 | Netherlands | $6.10 | On-demand | Source ↗ | |
| H200 SXM | 8 | 141 | Finland | $27.12 | On-demand | Source ↗ | |
| B300 SXM | 4 | 288 | Finland | $27.96 | On-demand | Source ↗ | |
| B200 SXM | 8 | 192 | Finland | $39.12 | On-demand | Source ↗ | |
| H200 SXM | 8 | 141 | United States | $50.44 | On-demand | Source ↗ | |
| MI300X | 8 | 192 | Multiple regions | $57.60 | On-demand | Source ↗ | |
| H200 SXM | 8 | 141 | Multiple regions | $63.30 | On-demand | Source ↗ | |
| B200 SXM | 8 | 192 | United States | $68.80 | On-demand | Source ↗ | |
| H200 SXM | 8 | 141 | Multiple regions | $84.80 | On-demand | Source ↗ |
Break-even chart
API vs. GPU rental
The crossover point at which renting a GPU full-time becomes cheaper than paying per token: ~25M tokens/day.
Your monthly cost
Top 5 cheapest for your workload
Adjust the assumptions below — token volume, input/output ratio, days and hours of usage — to see how the cheapest options shift.
| Rank | Provider | Pricing | Hardware | Monthly |
|---|---|---|---|---|
| #1 | GPU · On-demand | H200 SXM | $2.52k | |
| #2 | GPU · On-demand | B200 SXM | $3.96k | |
| #3 | GPU · On-demand | RTX PRO 6000 | $3.96k | |
| #4 | GPU · On-demand | B300 SXM | $4.39k | |
| #5 | API | — | $6.88k |
Spec sheet
DeepSeek-R1 at a glance
- VRAM (native precision)
- 685 GB
- Parameters
- 684.5314B
- Native precision
- fp8
- Context length
- —
- License
- mit
- Knowledge cutoff
- —
- Modalities
- text
- Access type
- Open source
- EU developed
- No
- Origin country
- CN
Why these numbers
A second opinion on the data
Hardware footprint
DeepSeek-R1 is a 684.5314B-parameter model that needs 685 GB VRAM at fp8 when self-hosted at native precision (needs 8x H100 or larger cluster). Quantization to int8 typically halves the VRAM requirement; int4 quarters it, at modest accuracy cost. 5 GPU rental providers in nfer's index currently offer hardware that fits this model at native precision.
Cheapest path today
For DeepSeek-R1: The cheapest API offering is Google Cloud Vertex AI at $1.35/1M input + $5.40/1M output tokens. The cheapest GPU rental that fits the model is Nebius on H200 SXM at $3.50/hour. The break-even point between paying per token and renting a GPU depends on your daily volume — see the chart above.
Licensing and fit
Released under the mit license, DeepSeek-R1 ships with a context length not specified; open-source weights are publicly available.
FAQ
Common questions
What's the cheapest way to host DeepSeek-R1?
The cheapest API option for DeepSeek-R1 in nfer's index is Google Cloud Vertex AI at $1.350/1M input + $5.400/1M output tokens. For self-hosted workloads, the cheapest GPU rental that fits is Nebius on H200 SXM at $3.50/hour. The right choice depends on your daily token volume — see the break-even chart on this page.How much VRAM does DeepSeek-R1 need?
DeepSeek-R1 685 GB at native precision; roughly 343 GB at int8 and 171 GB at int4. Native precision is fp8. Quantization roughly halves (int8) or quarters (int4) the VRAM footprint at modest accuracy cost.Can I use DeepSeek-R1 commercially?
Yes — released under the mit license, which permits commercial use.What's the difference between API and GPU rental for DeepSeek-R1?
Token-priced API providers (like Google Cloud Vertex AI) bill per million input/output tokens — best for low or bursty volume. Renting a GPU (e.g. Nebius at $3.50/hour) is a flat ~$2520.00/month regardless of usage — better economics once you sustain enough tokens per day to justify the fixed cost. The break-even chart on this page shows the exact crossover point.Is DeepSeek-R1 available with EU data residency?
DeepSeek-R1 is not a European-developed model. 1 EU-owned provider offers hosting in nfer's index — filter on EU sovereignty in the comparator to see them.
Prices last updated · 2026-04-30