Pricing & Cost

Batch API

Quick Answer

An API for asynchronous batch processing of many requests at discounted rates.

Batch APIs accept multiple requests for processing at lower cost (usually 50% discount). Batch processing is asynchronous—results aren't immediate. Batch APIs are cost-effective for non-time-sensitive tasks (daily reports, large-scale classification). Typical batch processing takes hours. Batch APIs reduce infrastructure costs. Trade-off: latency for cost savings. Batch APIs are practical for background processing. Not suitable for real-time applications.

Last verified: 2026-04-08

Compare models

See how different LLMs compare on benchmarks, pricing, and speed.

Browse all models →