Context Windows

Data verified April 2026

AI Context Window Comparison

Context windows range from 32K to 1M tokens across today's AI models. The right size depends on your task — here's exactly what each model supports and when the larger context actually matters.

Largest context

1M tokens

Claude + Gemini

Smallest tracked

32K tokens

Budget models

≥ 128K context

20 models

For large files

1M token models

7 models

Full codebases

Context window by model

2M tokens

2M tokens

1M tokens

1M tokens

1M tokens

1M tokens

1M tokens

512K tokens

272K tokens

256K tokens

256K tokens

200K tokens

200K tokens

128K tokens

128K tokens

128K tokens

128K tokens

128K tokens

128K tokens

128K tokens

OpenAIAnthropicGooglexAIMetaMistralDeepSeekPerplexity

Full comparison table

Model	Provider	Context	~Pages	Tier	Input $/M
Gemini 3.1 Pro	Google	2M	~6,000	Premium	$2.00
Grok 4	xAI	2M	~6,000	Balanced	$2.00
GPT-5.5	OpenAI	1M	~3,000	Premium	$5.00
Claude Opus 4.7	Anthropic	1M	~3,000	Premium	$5.00
Claude Opus 4.6	Anthropic	1M	~3,000	Premium	$15.00
Claude Sonnet 4.6	Anthropic	1M	~3,000	Premium	$3.00
Gemini 3.1 Flash	Google	1M	~3,000	Budget	$0.50
Llama 4 Scout	Meta	512K	~1,536	Budget	$0.50
GPT-5.4	OpenAI	272K	~816	Premium	$2.50
Llama 4 Maverick	Meta	256K	~768	Budget	$0.60
Codestral 25.01	Mistral	256K	~768	Budget	$0.90
GPT-5.2	OpenAI	200K	~600	Premium	$12.00
Claude 4 Haiku	Anthropic	200K	~600	Budget	$0.80
GPT-5.2 Mini	OpenAI	128K	~384	Balanced	$1.20
GPT-4o	OpenAI	128K	~384	Balanced	$5.00
Mistral Large 2	Mistral	128K	~384	Balanced	$3.00
DeepSeek V3	DeepSeek	128K	~384	Budget	$0.27
DeepSeek R1	DeepSeek	128K	~384	Budget	$0.55
GPT-4o Mini	OpenAI	128K	~384	Budget	$0.15
Mistral Small 3.1	Mistral	128K	~384	Budget	$0.10

Pages estimate assumes 250 words per page, 0.75 words per token.

How much context do you actually need?

Most tasks don't need 1M tokens. Here's a practical guide by task type.

Quick tasks

Up to 4K tokens

Drafting a short email, answering a question, writing a code function. Any model works.

GPT-5.4 miniGemini FlashMistral Small

Professional documents

8K–32K tokens

10–30 page reports, PR reviews, meeting transcripts, blog post drafts.

Claude Sonnet 4.6GPT-5.4Gemini 2.0 Pro

Large files and research

32K–128K tokens

Book chapters, large technical specs, detailed research papers, mid-size codebases.

Claude Sonnet 4.6GPT-5.4Gemini 2.0 ProLlama 4 Maverick

Entire codebases

128K–1M tokens

Refactoring full projects, analyzing complete repositories, multi-document synthesis.

Claude Sonnet 4.6 (1M)Gemini 2.0 Pro (1M)

Frequently asked questions

What is a context window in AI?

A context window is the maximum amount of text an AI model can read and consider at once — your prompt, the model's previous responses, and any documents you include. It's measured in tokens (roughly 4 characters or 0.75 words each). A 1M token context window can hold approximately 750,000 words, or about 1,500 pages.

How many tokens is a typical document?

A short email is 100–300 tokens. A 10-page PDF is roughly 5,000–8,000 tokens. A full book chapter is 10,000–20,000 tokens. A complete novel is 100,000–200,000 tokens. A large codebase can be 200,000–800,000 tokens. For most tasks, even a 32K window is sufficient — 1M context matters for entire codebases or book-length research.

Which AI model has the longest context window?

Gemini 2.0 Pro and Claude Sonnet 4.6 (Anthropic) both offer 1M token context windows — the longest currently available. This is enough to fit multiple books or an entire medium-sized codebase in a single prompt.

Does a longer context window mean better performance?

Not necessarily. A large context window allows more input, but models can struggle to use information from the very middle of very long contexts (the 'lost in the middle' problem). Claude Sonnet 4.6 is specifically noted for maintaining coherence across its full 1M context. Always test on your actual task.

When do I actually need a large context window?

You need a large context window when working with: entire codebases for refactoring or debugging, long legal or research documents, multi-turn conversations that must retain full history, or when combining multiple large files in one prompt. For single-file tasks or conversational use, 32K–128K is more than enough.

What is GPT-5.4's context window?

GPT-5.4 has a 272K token context window — large enough for most professional tasks, but about 3.7× smaller than Claude Sonnet 4.6 or Gemini 2.0 Pro's 1M token windows.

Related comparisons

AI capability matrix Best long-context AI Benchmark scores API pricing All AI models Find my AI

Full comparison table

Model	Provider	Context	~Pages	Tier	Input $/M
Gemini 3.1 Pro	Google	2M	~6,000	Premium	$2.00
Grok 4	xAI	2M	~6,000	Balanced	$2.00
GPT-5.5	OpenAI	1M	~3,000	Premium	$5.00
Claude Opus 4.7	Anthropic	1M	~3,000	Premium	$5.00
Claude Opus 4.6	Anthropic	1M	~3,000	Premium	$15.00
Claude Sonnet 4.6	Anthropic	1M	~3,000	Premium	$3.00
Gemini 3.1 Flash	Google	1M	~3,000	Budget	$0.50
Llama 4 Scout	Meta	512K	~1,536	Budget	$0.50
GPT-5.4	OpenAI	272K	~816	Premium	$2.50
Llama 4 Maverick	Meta	256K	~768	Budget	$0.60
Codestral 25.01	Mistral	256K	~768	Budget	$0.90
GPT-5.2	OpenAI	200K	~600	Premium	$12.00
Claude 4 Haiku	Anthropic	200K	~600	Budget	$0.80
GPT-5.2 Mini	OpenAI	128K	~384	Balanced	$1.20
GPT-4o	OpenAI	128K	~384	Balanced	$5.00
Mistral Large 2	Mistral	128K	~384	Balanced	$3.00
DeepSeek V3	DeepSeek	128K	~384	Budget	$0.27
DeepSeek R1	DeepSeek	128K	~384	Budget	$0.55
GPT-4o Mini	OpenAI	128K	~384	Budget	$0.15
Mistral Small 3.1	Mistral	128K	~384	Budget	$0.10

Pages estimate assumes 250 words per page, 0.75 words per token.

How much context do you actually need?

Most tasks don't need 1M tokens. Here's a practical guide by task type.

Quick tasks

Up to 4K tokens

Drafting a short email, answering a question, writing a code function. Any model works.

GPT-5.4 miniGemini FlashMistral Small

Professional documents

8K–32K tokens

10–30 page reports, PR reviews, meeting transcripts, blog post drafts.

Claude Sonnet 4.6GPT-5.4Gemini 2.0 Pro

Large files and research

32K–128K tokens

Book chapters, large technical specs, detailed research papers, mid-size codebases.

Claude Sonnet 4.6GPT-5.4Gemini 2.0 ProLlama 4 Maverick

Entire codebases

128K–1M tokens

Refactoring full projects, analyzing complete repositories, multi-document synthesis.

Claude Sonnet 4.6 (1M)Gemini 2.0 Pro (1M)

Frequently asked questions

What is a context window in AI?

How many tokens is a typical document?

Which AI model has the longest context window?

Does a longer context window mean better performance?

When do I actually need a large context window?

What is GPT-5.4's context window?

GPT-5.4 has a 272K token context window — large enough for most professional tasks, but about 3.7× smaller than Claude Sonnet 4.6 or Gemini 2.0 Pro's 1M token windows.