Best LLMs for Summarization (2026)

Large language models that produce concise, accurate, and readable summaries of long documents, research papers, meeting transcripts, and web pages.

Why Claude Sonnet 4 is Best for Summarization

Claude Sonnet 4 ranks highest for this use case based on Arena ELO score, benchmark performance, and capability coverage. It provides the best combination of quality, speed, and reliability for these specific tasks.

Cost Estimate

For a typical workload (~50M tokens/month, 60% input / 40% output), the cheapest qualifying model (Claude Haiku 4) costs approximately $130.00/month. The most capable model may cost more but delivers higher quality results.

Price vs Quality for Summarization

Anthropic
Cohere
Google
Openai

Top 5 Models Compared

RankModelProviderInput $/MOutput $/MArena ELOSpeed (tok/s)
#1Claude Sonnet 4Anthropic$3.00$15.00128078
#2Gemini 2.5 ProGoogle$1.25$10.00143070
#3GPT-4oOpenAI$2.50$10.00126095
#4Claude Haiku 4Anthropic$1.00$5.001220130
#5GPT-4.1OpenAI$2.00$8.00129088
#1Claude Sonnet 4
Anthropic
ELO 1280
Input

$3.00/M

Output

$15.00/M

VisionJSON ModeFunctionsMultimodal
#2Gemini 2.5 Pro
Google
ELO 1430
Input

$1.25/M

Output

$10.00/M

VisionJSON ModeFunctionsMultimodalCode Exec
#3GPT-4o
OpenAI
ELO 1260
Input

$2.50/M

Output

$10.00/M

VisionJSON ModeFunctionsMultimodalCode Exec
#4Claude Haiku 4
Anthropic
ELO 1220
Input

$1.00/M

Output

$5.00/M

VisionJSON ModeFunctionsMultimodal
#5GPT-4.1
OpenAI
ELO 1290
Input

$2.00/M

Output

$8.00/M

VisionJSON ModeFunctionsMultimodalCode Exec
#6Command R+
Cohere
ELO 1200
Input

$2.50/M

Output

$10.00/M

JSON ModeFunctions

Other Categories