LLM API Pricing Comparison — Complete Guide

LLM API pricing in 2026 varies dramatically across 25 models from 10 providers. Here's a comprehensive breakdown:


Alibaba:

  • Qwen 2.5 Max: $0.160/M input, $0.640/M output

  • Anthropic:

  • Claude Haiku 4: $0.800/M input, $4.00/M output
  • Claude Sonnet 4: $3.00/M input, $15.00/M output
  • Claude Opus 4: $15.00/M input, $75.00/M output

  • Cohere:

  • Command R: $0.150/M input, $0.600/M output
  • Command R+: $2.50/M input, $10.00/M output

  • DeepSeek:

  • DeepSeek V3: $0.270/M input, $1.10/M output
  • DeepSeek R1: $0.550/M input, $2.19/M output

  • Google:

  • Gemini 2.0 Flash Lite: $0.075/M input, $0.300/M output
  • Gemini 2.0 Flash: $0.100/M input, $0.400/M output
  • Gemini 2.5 Pro: $1.25/M input, $10.00/M output

  • Meta:

  • Llama 4 Scout: $0.100/M input, $0.300/M output
  • Llama 4 Maverick: $0.200/M input, $0.600/M output

  • Microsoft:

  • Phi-4: $0.070/M input, $0.140/M output

  • Mistral:

  • Mistral Small: $0.100/M input, $0.300/M output
  • Mistral Large: $2.00/M input, $6.00/M output

  • OpenAI:

  • GPT-4.1 Nano: $0.100/M input, $0.400/M output
  • GPT-4o Mini: $0.150/M input, $0.600/M output
  • GPT-4.1 Mini: $0.400/M input, $1.60/M output
  • o4-mini: $1.10/M input, $4.40/M output
  • o3-mini: $1.10/M input, $4.40/M output
  • GPT-4.1: $2.00/M input, $8.00/M output
  • GPT-4o: $2.50/M input, $10.00/M output

  • xAI:

  • Grok 3 Mini: $0.300/M input, $0.500/M output
  • Grok 3: $3.00/M input, $15.00/M output

  • The cheapest model is Phi-4 at $0.070/M input tokens. The most expensive output pricing is Claude Opus 4 at $75.00/M output tokens.


    Use our interactive pricing table for sortable, filterable comparisons, or try the cost calculator to estimate your specific monthly spend.

    Related Questions