LLM API Pricing Comparison 2026
Compare input and output token pricing across 25 large language models from OpenAI, Anthropic, Google, Meta, and more. Sort by any column, filter by provider or capability, and click any model to see full benchmarks and details.
Data verified Apr 3, 2026
Capabilities:
Last verified: Apr 3, 2026Showing 25 of 25 models
| Model | Provider | Input $/M | Output $/M | Arena ELO |
|---|---|---|---|---|
| o4-mini | OpenAI | $1.10 | $4.40 | 1350 |
| Gemini 2.5 Pro | $1.25 | $10.00 | 1340 | |
| Claude Opus 4 | Anthropic | $15.00 | $75.00 | 1330 |
| DeepSeek R1 | DeepSeek | $0.550 | $2.19 | 1310 |
| o3-mini | OpenAI | $1.10 | $4.40 | 1310 |
| Grok 3 | xAI | $3.00 | $15.00 | 1300 |
| GPT-4.1 | OpenAI | $2.00 | $8.00 | 1290 |
| Llama 4 Maverick | Meta | $0.200 | $0.600 | 1290 |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | 1280 |
| DeepSeek V3 | DeepSeek | $0.270 | $1.10 | 1280 |
| Gemini 2.0 Flash | $0.100 | $0.400 | 1260 | |
| GPT-4o | OpenAI | $2.50 | $10.00 | 1260 |
| Qwen 2.5 Max | Alibaba | $0.160 | $0.640 | 1260 |
| Llama 4 Scout | Meta | $0.100 | $0.300 | 1250 |
| Mistral Large | Mistral | $2.00 | $6.00 | 1245 |
| GPT-4.1 Mini | OpenAI | $0.400 | $1.60 | 1240 |
| Claude Haiku 4 | Anthropic | $0.800 | $4.00 | 1220 |
| GPT-4o Mini | OpenAI | $0.150 | $0.600 | 1220 |
| Grok 3 Mini | xAI | $0.300 | $0.500 | 1220 |
| Command R+ | Cohere | $2.50 | $10.00 | 1200 |
| Gemini 2.0 Flash Lite | $0.075 | $0.300 | 1200 | |
| Mistral Small | Mistral | $0.100 | $0.300 | 1185 |
| GPT-4.1 Nano | OpenAI | $0.100 | $0.400 | 1180 |
| Phi-4 | Microsoft | $0.070 | $0.140 | 1150 |
| Command R | Cohere | $0.150 | $0.600 | 1140 |
Frequently Asked Questions
- Which LLM API is the cheapest in 2026?
- As of April 2026, GPT-4.1 Nano and Gemini 2.0 Flash Lite offer the lowest per-token pricing for production workloads. Prices vary by input vs. output tokens, so the cheapest option depends on your specific usage pattern.
- How often are LLM API prices updated?
- We verify pricing directly from provider documentation every week. Each model listing shows a 'Last verified' date so you can confirm the data is current.
- What is the difference between input and output token pricing?
- Input tokens are the tokens you send to the API (your prompt), while output tokens are the tokens the model generates in its response. Most providers charge different rates for each, with output tokens typically costing 2-5x more than input tokens.
- Do any LLM APIs offer free tiers?
- Several providers offer limited free tiers or trial credits. Google's Gemini API has a generous free tier for lower rate limits. OpenAI and Anthropic offer sign-up credits for new accounts. Check each provider's pricing page for current free tier details.