AI Model Pricing Comparator

Stop guessing.
Start comparing.

Compare per-token API costs, GPU hourly rates, and reserved instances — with realistic monthly estimates based on your actual workload.



Your workload

Tokens vol.

1M1B

Input / Output

100% Input100% Output

Active days / mo

1 day30 days

Active hours / day

1 hr24 hrs

Cache hit rate

0%100%

Batch APIs

How it works

Three steps to efficient deployment

Search a model
Llama, Mistral, Claude...

Set your assumptions
Volume, I/O ratio, usage

Compare & choose
Realistic monthly estimates



API vs GPU vs Reserved

Compare pay-per-token API pricing against GPU hourly rates and committed reserved instances - side by side, same model, same page.



Realistic monthly estimates

Set your token volume, usage pattern, and I/O ratio. See what you can expect to pay - not just per-token rates that hide the real cost.



Filter by what matters

Sovereignty, certifications, region, quantization, license - filter by any dimension, not just price.

25+

Providers

100+

Models

30+

Hardware SKUs

600+

Price points tracked

Adding new providers and models every week

Mistral

OVHcloud

Nebius

Together AI

Groq

Cerebras

SambaNova

OpenAI

Google AI

AWS

Replicate

Lambda Labs

Clarifai

Mistral

OVHcloud

Nebius

Together AI

Groq

Cerebras

SambaNova

OpenAI

Google AI

AWS

Replicate

Lambda Labs

Clarifai

Scaleway

Hetzner

Verda

Fireworks AI

DeepInfra

DeepSeek

Anthropic

Google Cloud

Azure

Baseten

CoreWeave

Alibaba Cloud

Moonshot AI

Z.AI

MiniMax