Pricing & Cost
Input Tokens
Quick Answer
Tokens in the prompt, billed separately (usually cheaper than output tokens).
Input tokens are tokens in your prompt/context. APIs charge per input token (usually less than output tokens). Understanding input token count is crucial for cost optimization. Techniques reducing input tokens: prompt compression, semantic caching, instruction templating. Cached input tokens cost less (prompt caching feature). Input token optimization is often more valuable than output optimization. Different APIs have different per-token pricing.
Last verified: 2026-04-08