LLM API pricing in 2026 varies dramatically across 25 models from 10 providers. Here's a comprehensive breakdown:
Alibaba:
Qwen 2.5 Max: $0.160/M input, $0.640/M outputAnthropic:
Claude Haiku 4: $0.800/M input, $4.00/M outputClaude Sonnet 4: $3.00/M input, $15.00/M outputClaude Opus 4: $15.00/M input, $75.00/M outputCohere:
Command R: $0.150/M input, $0.600/M outputCommand R+: $2.50/M input, $10.00/M outputDeepSeek:
DeepSeek V3: $0.270/M input, $1.10/M outputDeepSeek R1: $0.550/M input, $2.19/M outputGoogle:
Gemini 2.0 Flash Lite: $0.075/M input, $0.300/M outputGemini 2.0 Flash: $0.100/M input, $0.400/M outputGemini 2.5 Pro: $1.25/M input, $10.00/M outputMeta:
Llama 4 Scout: $0.100/M input, $0.300/M outputLlama 4 Maverick: $0.200/M input, $0.600/M outputMicrosoft:
Phi-4: $0.070/M input, $0.140/M outputMistral:
Mistral Small: $0.100/M input, $0.300/M outputMistral Large: $2.00/M input, $6.00/M outputOpenAI:
GPT-4.1 Nano: $0.100/M input, $0.400/M outputGPT-4o Mini: $0.150/M input, $0.600/M outputGPT-4.1 Mini: $0.400/M input, $1.60/M outputo4-mini: $1.10/M input, $4.40/M outputo3-mini: $1.10/M input, $4.40/M outputGPT-4.1: $2.00/M input, $8.00/M outputGPT-4o: $2.50/M input, $10.00/M outputxAI:
Grok 3 Mini: $0.300/M input, $0.500/M outputGrok 3: $3.00/M input, $15.00/M outputThe cheapest model is Phi-4 at $0.070/M input tokens. The most expensive output pricing is Claude Opus 4 at $75.00/M output tokens.
Use our interactive pricing table for sortable, filterable comparisons, or try the cost calculator to estimate your specific monthly spend.