How Much Does Claude API Cost?
Claude API pricing varies by model tier. Anthropic offers three model tiers with different performance and cost trade-offs:
Claude Opus 4: $15.00/M input tokens, $75.00/M output tokens. Context window: 200K. Cached input: $1.50/M.
Claude Sonnet 4: $3.00/M input tokens, $15.00/M output tokens. Context window: 200K. Cached input: $0.300/M.
Claude Haiku 4: $0.800/M input tokens, $4.00/M output tokens. Context window: 200K. Cached input: $0.080/M.
For most production workloads, Claude Sonnet 4 offers the best balance of quality and cost. Use Haiku 4 for high-volume, low-complexity tasks like classification and extraction. Opus 4 is best for complex reasoning, coding, and analysis where quality is paramount.
Tip: Use prompt caching to reduce costs by up to 90% on repeated system prompts. Anthropic also offers batch API pricing at roughly 50% of standard rates for non-time-sensitive workloads.