How Much Does Claude API Cost?

Claude API pricing varies by model tier. Anthropic offers three model tiers with different performance and cost trade-offs:

Claude Opus 4: $15.00/M input tokens, $75.00/M output tokens. Context window: 200K. Cached input: $1.50/M.

Claude Sonnet 4: $3.00/M input tokens, $15.00/M output tokens. Context window: 200K. Cached input: $0.300/M.

Claude Haiku 4: $0.800/M input tokens, $4.00/M output tokens. Context window: 200K. Cached input: $0.080/M.

For most production workloads, Claude Sonnet 4 offers the best balance of quality and cost. Use Haiku 4 for high-volume, low-complexity tasks like classification and extraction. Opus 4 is best for complex reasoning, coding, and analysis where quality is paramount.

Tip: Use prompt caching to reduce costs by up to 90% on repeated system prompts. Anthropic also offers batch API pricing at roughly 50% of standard rates for non-time-sensitive workloads.

Related Tools

Full Pricing Table Cost Calculator

Related Questions

What's the Cheapest LLM for Coding?

Find the most affordable LLM API for coding tasks, ranked by coding ELO and price.

ChatGPT vs Claude: Which Is Better?

Head-to-head comparison of OpenAI and Anthropic models across pricing, benchmarks, and capabilities.

Best LLM API for Production Use

Which LLM API is best for production? Evaluating reliability, speed, pricing, and capabilities.

LLM API Pricing Comparison — Complete Guide

Complete guide to LLM API pricing in 2026 across all major providers.

How to Reduce LLM API Costs

Practical strategies for cutting LLM API spend: caching, batching, model routing, and more.

Which LLM Has the Largest Context Window?

Ranking of all LLMs by context window size, from 1M tokens down to 32K.

Fastest LLM API — Speed Comparison

Which LLM API is fastest? Comparing TTFT and throughput across all major providers.