UseRightAI
UseRightAI logo
HomeModelsComparePricingWhat's New
UseRightAI
Cut through AI hype. Pick what works.
UseRightAI logo
Cut through AI hype. Pick what works.

Independent AI model tracker. Live pricing, real benchmarks, zero vendor bias.

X (Twitter)LinkedInUpdatesContact

Compare

ChatGPT vs ClaudeGPT-4o vs Claude SonnetClaude vs GeminiDeepSeek vs ChatGPTMistral vs ClaudeGemini Flash vs GPT-4o MiniLlama vs ChatGPTBuild your own →

Best For

CodingWritingDevelopersProduct ManagersDesignersSalesBest Cheap AIBest Free AI

Pricing & Data

API Token PricingPrice HistoryBenchmark ScoresPrivacy & SafetySubscription PlansCost CalculatorWhich AI is Cheapest?

Company

About UseRightAIContactWhat ChangedAll ModelsDisclosuresPrivacy PolicyTerms of Service

© 2026 UseRightAI. Independent · Free forever · Not affiliated with any AI provider.

Affiliate links are clearly labeled. See disclosures.

San Francisco, CA · Founded 2015

OpenAI

The lab that started the frontier AI race.

OpenAI created GPT-4o, GPT-5.4, and GPT-5.5 — the models powering ChatGPT and thousands of enterprise products. Their API is the most widely used in the industry.

Rankings refresh dailyScored on 6 criteriaNo paid rankings
  • GPT-5.5 scores 82.7% on Terminal-Bench for agentic workflows
  • GPT-5.4 is the only frontier model with native desktop computer-use via API
  • ChatGPT has over 300 million weekly active users
46 models

All OpenAI Models

Every OpenAI model in the directory, ranked by overall capability score.

OpenAIBalanced

OpenAI: GPT-5

GPT-5 is OpenAI's flagship multimodal model, superseding GPT-4o with significantly improved reasoning, instruction-following, and knowledge breadth. It handles text, images, and complex multi-step tasks with state-of-the-art performance across most benchmarks.

Verdict
OpenAI's best general-purpose model — a strong flagship pick that punches above its price on input costs while delivering top-tier reasoning and multimodal capability.
Quality score
87%
Pricing

OpenAI API Pricing

Per 1 million tokens. Updated when providers change prices.

ModelInput / 1MOutput / 1MContextSpeed
OpenAI: GPT-5
Balanced
$30.00/1M$180.00/1M400KBalanced
GPT-5.5
Premium
$5.00/1M$30.00/1M1MBalanced
GPT-5.4
Premium
$0.20/1M$1.25/1M272KBalanced
OpenAI: GPT-5 Image
Premium
$10.00/1M$10.00/1M400KBalanced
OpenAI: GPT-5 Image Mini
Balanced

Compare OpenAI Models

Head-to-head comparisons for the most-searched questions.

OpenAI: GPT-5 vs GPT-5.5OpenAI: GPT-5 vs GPT-5.4GPT-5.5 vs GPT-5.4GPT-5.5 vs OpenAI: GPT-5 ImageGPT-5.4 vs OpenAI: GPT-5 ImageGPT-5.4 vs OpenAI: GPT-5 Image MiniOpenAI: GPT-5 Image vs OpenAI: GPT-5 Image MiniOpenAI: GPT-5 Image vs GPT-4o MiniOpen compare tool →

Newsletter

Get notified when OpenAI releases new models

Pricing changes, new releases, and ranking shifts — straight to your inbox.

No spam. Useful updates only. Affiliate disclosures always clearly labeled.

OpenAI FAQ

What is OpenAI's best model in 2026?

GPT-5.5 is OpenAI's most capable model in 2026 — it leads on Terminal-Bench (82.7%) and integrates with Codex for agentic workflows. GPT-5.4 is the only model that can control a desktop via API. GPT-5.2 is the best value OpenAI model for most teams.

How much does OpenAI's API cost?

GPT-5.5 costs $5/1M input and $30/1M output tokens. GPT-5.4 is $2.50/$15. GPT-5.2 is $1.50/$7.50. GPT-5.2 Mini is $0.15/$0.60 — the cheapest OpenAI option. ChatGPT Plus subscription is $20/month for consumer access.

Is OpenAI better than Anthropic?

It depends on the task. Claude Opus 4.7 (Anthropic) leads OpenAI on SWE-Bench Pro coding at 64.3% vs GPT-5.5's 58.6%. GPT-5.4 and GPT-5.5 are stronger for OpenAI-native agentic workflows and computer-use. For writing and general use, Claude Sonnet 4.6 and GPT-5.4 are closely matched.

Does OpenAI have open-source models?

No — all current OpenAI models (GPT-5.x, GPT-4o series) are closed-weight and available only through their API or ChatGPT. For open-weight alternatives, see Meta's Llama 4, Mistral, or DeepSeek models.

Explore other providers

AnthropicGooglexAIMetaMistralDeepSeekBrowse all models →
$30.00/1M in
$180.00/1M out
Speed
Balanced
Best for high-stakes professional tasks requiring deep reasoning, precise instruction-following, and reliable multimodal understanding.
Context
400k tokens
Pricing is asymmetric: cheap on input ($1.25/1M) but expensive on output ($10/1M), so it favors read-heavy or summarization tasks over verbose generation. The 400K context window is one of the largest available at this price tier. Supersedes GPT-4o, which remains available at lower cost for lighter workloads.
FlagshipMultimodalLong ContextOpenAIReasoning
Best for
High-stakes professional tasks requiring deep reasoning, precise instruction-following, and reliable multimodal understanding.
View model
OpenAIPremium

GPT-5.5

OpenAI's latest agentic flagship for coding, research, computer-use workflows, and long multi-step knowledge work.

Verdict
Best OpenAI flagship for agentic coding, research, and computer-use work.
Quality score
92%
Pricing
$5.00/1M in
$30.00/1M out
Speed
Balanced
Best for agentic coding, computer-use workflows, and complex research tasks
Context
1M tokens
Ranked from public benchmark and pricing data verified April 26, 2026: SWE-Bench Pro 58.6%, Terminal-Bench 2.0 82.7%, $5/$30 per 1M tokens, 1M API context.
AgenticCodingComputer useLong contextPremium
Best for
Agentic coding, computer-use workflows, and complex research tasks
View model
OpenAIPremium

GPT-5.4

OpenAI's latest flagship with unique desktop-control capabilities — it can see your screen, click, and navigate apps via the API.

Verdict
Best for agentic automation and desktop control workflows.
Quality score
84%
Pricing
$0.20/1M in
$1.25/1M out
Speed
Balanced
Best for agentic workflows, desktop automation, and complex multi-step reasoning
Context
272k tokens
Unique value is the computer-use capability. If you're building agents that operate software, nothing else compares right now.
AgenticDesktop controlReasoningPremium
Best for
Agentic workflows, desktop automation, and complex multi-step reasoning
View model
OpenAIPremium

OpenAI: GPT-5 Image

GPT-5 Image is OpenAI's multimodal flagship optimized for deep visual understanding and generation tasks, built on the GPT-5 architecture with a 400K context window. It supersedes GPT-4o with significantly improved image reasoning, analysis, and generation capabilities.

Verdict
OpenAI's most capable eye for visuals, but you'll pay a premium over equally capable rivals.
Quality score
79%
Pricing
$10.00/1M in
$10.00/1M out
Speed
Balanced
Best for complex workflows combining visual analysis, image generation, and long-document understanding in a single model call.
Context
400k tokens
Flat $10/1M input and output pricing is unusual — most flagship models charge more for output tokens. Verify whether image token costs (typically higher per effective token) are included under this pricing or billed separately, as OpenAI historically charges additional fees for image inputs.
MultimodalImage AILong ContextOpenAIPremium
Best for
Complex workflows combining visual analysis, image generation, and long-document understanding in a single model call.
View model
OpenAIBalanced

OpenAI: GPT-5 Image Mini

GPT-5 Image Mini is OpenAI's mid-tier multimodal model optimized for image understanding and generation tasks at a balanced price point. It supersedes GPT-4o with improved visual reasoning capabilities while maintaining a large 400K context window.

Verdict
A capable multimodal workhorse for image-heavy workflows that don't justify full GPT-5 flagship pricing.
Quality score
72%
Pricing
$2.50/1M in
$2.00/1M out
Speed
Fast
Best for teams needing strong image analysis and generation integrated with text workflows at a reasonable cost.
Context
400k tokens
Output cost of $2/1M tokens is unusual — lower than input cost, which favors use cases with long inputs but short outputs like image captioning or document summarization. Verify image generation token pricing separately, as image outputs are often billed differently by OpenAI.
MultimodalImage GenerationLong ContextBalanced PriceGPT-5 Family
Best for
Teams needing strong image analysis and generation integrated with text workflows at a reasonable cost.
View model
OpenAIBudget

GPT-4o Mini

OpenAI's most affordable production-grade model — faster and cheaper than GPT-4o with strong enough performance for the majority of everyday tasks.

Verdict
OpenAI's fastest, cheapest option for everyday high-volume tasks.
Quality score
65%
Pricing
$0.15/1M in
$0.60/1M out
Speed
Very fast
Best for high-volume everyday tasks where gpt-4o quality is overkill
Context
128k tokens
GPT-4o Mini punches well above its price for classification, summarisation, and simple writing. It struggles when tasks get complex.
BudgetFastOpenAIHigh volume
Best for
High-volume everyday tasks where GPT-4o quality is overkill
View model
OpenAIBalanced

OpenAI: GPT-5.1

GPT-5.1 is OpenAI's mid-tier flagship model, succeeding GPT-4o with improved reasoning, instruction-following, and a 400K context window at a competitive price point. It sits between GPT-4o and full GPT-5 in capability and cost.

Verdict
A solid, practical upgrade over GPT-4o that hits the sweet spot between capability and cost — but not the best in any single category.
Quality score
76%
Pricing
$1.25/1M in
$10.00/1M out
Speed
Balanced
Best for teams needing reliable, high-quality outputs across coding, writing, and analysis without paying premium gpt-5 prices.
Context
400k tokens
Pricing structure heavily favors input-heavy use cases like RAG and retrieval. The $10/1M output cost makes it expensive for long-form generation at scale. Context window of 400K is competitive but not best-in-class against Gemini 3.1 Pro's 1M+ window.
Mid-tier flagshipLong contextAgenticOpenAIBalanced cost
Best for
Teams needing reliable, high-quality outputs across coding, writing, and analysis without paying premium GPT-5 prices.
View model
OpenAIPremium

GPT-5.2

Reliable OpenAI flagship for serious coding and product work — a strong default before GPT-5.4 was released.

Verdict
Capable but outclassed — GPT-5.4 is now cheaper and better.
Quality score
79%
Pricing
$1.75/1M in
$14.00/1M out
Speed
Balanced
Best for serious coding and complex product work
Context
200k tokens
Worth considering only if you have existing integrations built around this model.
Former top pickCodingReasoningPremium
Best for
Serious coding and complex product work
View model
OpenAIBalanced

OpenAI: GPT-4.1

GPT-4.1 is OpenAI's refined successor to GPT-4o, offering sharper instruction-following, stronger coding performance, and a massive 1M token context window at a mid-tier price point. It targets developers and power users who need reliable, precise outputs without paying flagship reasoning model prices.

Verdict
The sharpest everyday workhorse in OpenAI's lineup, best when you need precise instructions met over long documents or complex codebases.
Quality score
76%
Pricing
$2.00/1M in
$8.00/1M out
Speed
Balanced
Best for developers and researchers needing accurate instruction-following and long-document analysis at a cost-efficient rate.
Context
1.0M tokens
Priced at $2/1M input and $8/1M output tokens — cheaper than GPT-4o at launch. The 1M context window is real but performance near the ceiling is less tested than Gemini's equivalent. No built-in image generation or voice modality.
Long ContextInstruction-FollowingCodingBalanced PriceGPT-4 Series
Best for
Developers and researchers needing accurate instruction-following and long-document analysis at a cost-efficient rate.
View model
OpenAIBalanced

GPT-4o

Versatile multimodal model that handles image-related workflows and mixed-media prompts well.

Verdict
Best all-around pick for image-heavy and multimodal workflows.
Quality score
67%
Pricing
$0.15/1M in
$0.60/1M out
Speed
Fast
Best for multimodal tasks and image-adjacent workflows
Context
128k tokens
Strong when your work lives between visuals, messaging, and product context.
ImagesMultimodalCreative
Best for
Multimodal tasks and image-adjacent workflows
View model
OpenAIBalanced

GPT-5.2 Mini

Lower-cost OpenAI model that keeps a solid balance of usefulness, speed, and affordability for everyday tasks.

Verdict
Solid OpenAI budget option, though Gemini Flash offers better value.
Quality score
67%
Pricing
$1.20/1M in
$4.80/1M out
Speed
Fast
Best for budget technical workflows and high-volume product integrations
Context
128k tokens
Best when you specifically need an OpenAI model in your stack.
Budget codingFastOpenAI
Best for
Budget technical workflows and high-volume product integrations
View model
OpenAIBudget

OpenAI: GPT-5 Mini

GPT-5 Mini is OpenAI's budget-tier distillation of GPT-5, designed for high-volume, cost-sensitive tasks that don't require full flagship reasoning depth. It supersedes GPT-4o with improved instruction following and a massively expanded 400K context window at a fraction of the cost.

Verdict
The new budget default for OpenAI API users: faster, cheaper, and smarter than GPT-4o with a context window that punches well above its price tier.
Quality score
66%
Pricing
$0.25/1M in
$2.00/1M out
Speed
Very fast
Best for high-volume production workloads — chatbots, summarization pipelines, and document q&a — where cost efficiency matters more than peak reasoning.
Context
400k tokens
Output cost of $2/1M tokens is higher than some competing budget models (Gemini Flash at ~$0.60/1M output). At scale, output-heavy tasks may erode cost advantages — monitor token ratios carefully. Supersedes GPT-4o, which may be deprecated on a rolling basis.
BudgetFastLong ContextHigh VolumeOpenAI
Best for
High-volume production workloads — chatbots, summarization pipelines, and document Q&A — where cost efficiency matters more than peak reasoning.
View model
OpenAIBudget

OpenAI: GPT-4.1 Mini

GPT-4.1 Mini is OpenAI's cost-optimized small model from the GPT-4.1 family, designed to deliver strong instruction-following and coding performance at a fraction of flagship pricing. It targets high-volume, latency-sensitive applications where cost efficiency matters more than peak capability.

Verdict
The go-to budget workhorse for high-volume OpenAI API users who need GPT-4.1 quality at GPT-3.5 prices.
Quality score
65%
Pricing
$0.40/1M in
$1.60/1M out
Speed
Very fast
Best for high-volume production workloads that need reliable gpt-4-class instruction following without flagship pricing.
Context
1.0M tokens
Pricing shown is $0.40 input / $1.60 output per 1M tokens. Cached input tokens are significantly cheaper. The 1M token context window is a standout feature at this price tier — few competitors match it. Supersedes GPT-4o as the recommended default for cost-conscious applications.
BudgetFastLong ContextOpenAIProduction
Best for
High-volume production workloads that need reliable GPT-4-class instruction following without flagship pricing.
View model
OpenAIBalanced

OpenAI: GPT-5 Chat

GPT-5 Chat is OpenAI's flagship conversational model, succeeding GPT-4o with improved reasoning, instruction-following, and multimodal capabilities. It targets professional and enterprise use cases where output quality matters more than cost.

Verdict
A polished, capable flagship that earns its place but faces stiff competition at its price point.
Quality score
75%
Pricing
$1.25/1M in
$10.00/1M out
Speed
Balanced
Best for complex professional tasks requiring nuanced reasoning, strong writing quality, and reliable instruction-following across long conversations.
Context
128k tokens
Pricing is asymmetric — input is relatively affordable at $1.25/1M but output at $10/1M can accumulate quickly in agentic or verbose-output workflows. Cached input pricing may apply through the OpenAI API. Not to be confused with GPT-5 reasoning variants (o-series) which use chain-of-thought and have separate pricing.
FlagshipMultimodalOpenAIProfessionalGPT-5
Best for
Complex professional tasks requiring nuanced reasoning, strong writing quality, and reliable instruction-following across long conversations.
View model
OpenAIBalanced

OpenAI: o4 Mini

o4 Mini is OpenAI's compact reasoning model that applies chain-of-thought thinking to complex problems at a fraction of the cost of o4. It delivers strong mathematical, coding, and logical reasoning capabilities while remaining accessible to developers on tighter budgets.

Verdict
The most cost-efficient reasoning model for serious STEM and coding workloads.
Quality score
70%
Pricing
$1.10/1M in
$4.40/1M out
Speed
Deliberate
Best for developers and analysts who need serious reasoning power for stem tasks without paying full o4 or o3 prices.
Context
200k tokens
Priced at $1.1/$4.4 per 1M tokens (input/output), o4 Mini is significantly cheaper than o3 ($10/$40) and o4. Output tokens are 4x the input price, so verbose reasoning traces can add up — use max_completion_tokens limits in production pipelines.
ReasoningSTEMBudget-FriendlyLong ContextCoding
Best for
Developers and analysts who need serious reasoning power for STEM tasks without paying full o4 or o3 prices.
View model
OpenAIPremium

OpenAI: GPT-5 Pro

GPT-5 Pro is OpenAI's most capable flagship model, designed for complex reasoning, advanced coding, and high-stakes professional tasks. It supersedes GPT-4o with substantially improved intelligence at a premium price point reflecting its top-tier positioning.

Verdict
The most capable model OpenAI offers, but the steep output cost means it's only justifiable for genuinely high-stakes, complex tasks.
Quality score
84%
Pricing
$15.00/1M in
$120.00/1M out
Speed
Deliberate
Best for demanding professional workflows requiring deep reasoning, nuanced writing, and sophisticated multi-step problem solving where cost is secondary to quality.
Context
400k tokens
Output cost of $120/1M tokens is exceptionally high and will compound quickly in agentic or multi-turn workflows. Budget carefully. Context window of 400K is generous but falls short of Gemini 3.1 Pro's 1M+ offering for ultra-long document tasks.
FlagshipPremiumDeep ReasoningLong ContextOpenAI
Best for
Demanding professional workflows requiring deep reasoning, nuanced writing, and sophisticated multi-step problem solving where cost is secondary to quality.
View model
OpenAIBudget

OpenAI: GPT-5.1-Codex-Mini

GPT-5.1-Codex-Mini is OpenAI's budget-tier coding-specialized model built on the GPT-5.1 architecture, optimized for code generation, completion, and debugging at low cost. It offers a 400K context window, making it practical for large codebases without the price tag of flagship models.

Verdict
The sharpest budget coding model available if you need speed, volume, and a long context window without breaking your API budget.
Quality score
63%
Pricing
$0.25/1M in
$2.00/1M out
Speed
Very fast
Best for high-volume code generation, autocomplete pipelines, and developer tooling where cost efficiency matters more than peak reasoning depth.
Context
400k tokens
At $2/1M output tokens, costs can accumulate in verbose code-generation tasks — monitor output token usage carefully in agentic loops. Not a general-purpose flagship replacement; best deployed alongside a stronger model for planning/reasoning layers.
CodingBudgetLong ContextFastCodex
Best for
High-volume code generation, autocomplete pipelines, and developer tooling where cost efficiency matters more than peak reasoning depth.
View model
OpenAIBalanced

OpenAI: o3 Mini

OpenAI's o3 Mini is a compact reasoning model optimized for STEM tasks, offering chain-of-thought capabilities at a fraction of the cost of o3. It excels at math, coding, and logical problem-solving while maintaining a large 200K context window.

Verdict
The most cost-efficient way to access serious chain-of-thought reasoning for STEM and coding work.
Quality score
68%
Pricing
$1.10/1M in
$4.40/1M out
Speed
Deliberate
Best for cost-effective deep reasoning on math, code, and structured logic problems where o3's full price isn't justified.
Context
200k tokens
Supports three reasoning effort settings via the API (low, medium, high), which significantly affect latency and token usage. No vision/image input support. Available via OpenAI API and ChatGPT Plus.
ReasoningSTEMCodingBudget-FriendlyChain-of-Thought
Best for
Cost-effective deep reasoning on math, code, and structured logic problems where o3's full price isn't justified.
View model
OpenAIBalanced

OpenAI: o4 Mini High

o4 Mini High is OpenAI's compact reasoning model running at its maximum reasoning effort setting, trading speed for deeper multi-step logical analysis. It applies extended chain-of-thought processing to complex problems while remaining significantly cheaper than full o3 or o4 class flagships.

Verdict
Maximum-effort reasoning at mid-tier pricing — excellent for hard problems, overkill for everything else.
Quality score
70%
Pricing
$1.10/1M in
$4.40/1M out
Speed
Deliberate
Best for developers and researchers who need strong reasoning accuracy on hard stem, math, or logic problems without paying full o3 pricing.
Context
200k tokens
The 'High' suffix denotes maximum reasoning effort, distinct from o4 Mini (balanced) and o4 Mini Low. Higher effort means higher token consumption in internal reasoning traces, which can push effective cost above the stated $1.1/$4.4 per million for very complex queries. No image generation capability.
reasoningSTEMcost-efficientlong-contextcoding
Best for
Developers and researchers who need strong reasoning accuracy on hard STEM, math, or logic problems without paying full o3 pricing.
View model
OpenAIBalanced

OpenAI: o3

OpenAI's o3 is a frontier reasoning model that uses extended chain-of-thought to solve complex problems in math, science, coding, and logic. It represents a significant step up from o1 in reasoning depth and accuracy.

Verdict
The go-to model when you need the right answer, not the fast answer.
Quality score
73%
Pricing
$2.00/1M in
$8.00/1M out
Speed
Deliberate
Best for tackling hard technical problems — from competition-level math to multi-step code debugging — where accuracy matters more than speed.
Context
200k tokens
Pricing at $2/$8 per 1M input/output tokens is moderate for a reasoning model, but long internal reasoning traces can significantly inflate output token counts. Not available via all API tiers — check OpenAI access levels.
ReasoningMathCodingFrontierChain-of-thought
Best for
Tackling hard technical problems — from competition-level math to multi-step code debugging — where accuracy matters more than speed.
View model
OpenAIBalanced

OpenAI: GPT-5.1-Codex

GPT-5.1-Codex is OpenAI's coding-specialized flagship model, purpose-built for software development tasks with a massive 400K context window. It supersedes GPT-4o with deeper code comprehension, multi-file reasoning, and tighter integration with developer workflows.

Verdict
The go-to model for large-codebase engineering tasks, but expensive output costs limit its appeal for high-throughput pipelines.
Quality score
70%
Pricing
$1.25/1M in
$10.00/1M out
Speed
Balanced
Best for professional software engineers who need a high-capacity model for large codebase analysis, complex refactoring, and multi-file code generation.
Context
400k tokens
Asymmetric pricing ($1.25 input / $10 output) rewards read-heavy workflows like code review and repo analysis over generation-heavy tasks. The 400K context window is among the largest in the balanced price tier. No image input/output support confirmed at launch.
CodingLarge ContextDeveloperOpenAIFlagship
Best for
Professional software engineers who need a high-capacity model for large codebase analysis, complex refactoring, and multi-file code generation.
View model
OpenAIBalanced

OpenAI: o3 Mini High

o3 Mini High is OpenAI's compact reasoning model running at maximum reasoning effort, delivering deep chain-of-thought problem-solving in a cost-efficient package. It specializes in STEM tasks — math, coding, and logic — where extended deliberation yields significantly better results than standard chat models.

Verdict
The best bang-for-buck reasoning model for STEM and coding tasks that can tolerate slow response times.
Quality score
66%
Pricing
$1.10/1M in
$4.40/1M out
Speed
Deliberate
Best for solving hard math, competitive programming, and multi-step logical reasoning problems where accuracy matters more than speed.
Context
200k tokens
The 'High' suffix refers to the reasoning_effort parameter set to 'high', which increases token usage and latency significantly versus o3 Mini at medium or low effort. Priced at $1.1/$4.4 per million tokens, it is far cheaper than o1 ($15/$60) and full o3, making it attractive for batch workloads.
ReasoningSTEMCodingBudget-FriendlyChain-of-Thought
Best for
Solving hard math, competitive programming, and multi-step logical reasoning problems where accuracy matters more than speed.
View model
OpenAIBalanced

OpenAI: GPT-5 Codex

GPT-5 Codex is OpenAI's specialized coding-focused evolution of GPT-5, designed for software development tasks with a massive 400K context window for handling large codebases. It bridges the gap between raw language capability and developer-specific tooling, succeeding GPT-4o as OpenAI's primary coding workhorse.

Verdict
A serious coding model with repository-scale context that earns its place in any developer's toolkit.
Quality score
68%
Pricing
$1.25/1M in
$10.00/1M out
Speed
Balanced
Best for professional developers who need to reason across large codebases, generate production-ready code, and debug complex multi-file projects.
Context
400k tokens
The $10/1M output cost means heavy code generation workloads can get expensive fast — budget carefully for bulk generation use cases. Context window of 400K is among the largest in its price tier. Supersedes GPT-4o, so existing GPT-4o coding workflows should consider migrating for improved performance.
CodingLarge CodebaseOpenAIDeveloper ToolLong Context
Best for
Professional developers who need to reason across large codebases, generate production-ready code, and debug complex multi-file projects.
View model
OpenAIPremium

OpenAI: GPT-4 Turbo

GPT-4 Turbo is OpenAI's high-capability flagship model featuring a 128K context window, trained on data up to April 2024. It delivers strong reasoning, coding, and instruction-following across complex tasks.

Verdict
A capable but aging flagship that has been outpaced by cheaper, faster successors in OpenAI's own lineup.
Quality score
75%
Pricing
$10.00/1M in
$30.00/1M out
Speed
Balanced
Best for complex multi-step tasks requiring deep reasoning, long document analysis, or sophisticated code generation where cost is secondary to quality.
Context
128k tokens
GPT-4 Turbo is available via the OpenAI API. It has largely been succeeded by GPT-4o, which is faster, supports vision natively, and is cheaper. Organizations should evaluate whether migrating to GPT-4o or o3 makes more sense before building new workflows on this model.
128K contextGPT-4 classfunction callingOpenAIpremium
Best for
Complex multi-step tasks requiring deep reasoning, long document analysis, or sophisticated code generation where cost is secondary to quality.
View model
OpenAIBalanced

OpenAI: GPT-5.1-Codex-Max

GPT-5.1-Codex-Max is OpenAI's specialized coding-focused flagship model, built on the GPT-5 architecture with deep optimization for software development, code generation, and technical problem-solving. It supersedes GPT-4o with significantly improved code comprehension and a 400K context window suited for large codebases.

Verdict
The strongest choice for serious software engineering work, provided you can absorb the output-side pricing.
Quality score
70%
Pricing
$1.25/1M in
$10.00/1M out
Speed
Balanced
Best for professional developers and engineering teams working with complex, multi-file codebases who need accurate code generation, debugging, and architectural reasoning.
Context
400k tokens
Output cost of $10/1M tokens is the key budget consideration — input is competitively priced but output costs mirror GPT-4 Turbo-tier pricing. Best paired with a cheaper model for lightweight or repetitive coding subtasks. Context window of 400K is well-suited to monorepo analysis but verify token limits on your deployment tier.
CodingLarge ContextOpenAITechnicalFlagship
Best for
Professional developers and engineering teams working with complex, multi-file codebases who need accurate code generation, debugging, and architectural reasoning.
View model
OpenAIBalanced

OpenAI: GPT-5.3 Chat

GPT-5.3 Chat is OpenAI's mid-cycle refinement of the GPT-5 series, offering improved instruction-following and reasoning over GPT-5.2 at a balanced price point. It targets professionals needing strong general-purpose performance without paying flagship model premiums.

Verdict
A solid GPT-5 series refinement with strong reasoning, but its output pricing makes it hard to recommend over Claude Sonnet 4.6 unless you're OpenAI-first.
Quality score
71%
Pricing
$1.75/1M in
$14.00/1M out
Speed
Balanced
Best for professionals and developers who need reliable, high-quality text generation and reasoning at a cost that scales reasonably with usage.
Context
128k tokens
Output cost of $14/1M tokens is the primary budget consideration — workloads with high output-to-input ratios will accumulate costs quickly. No image generation capability. Supersedes GPT-5.2, which should be deprecated or deprioritized.
GPT-5 seriesOpenAImid-tier flagshipinstruction-followingreasoning
Best for
Professionals and developers who need reliable, high-quality text generation and reasoning at a cost that scales reasonably with usage.
View model
OpenAIBudget

OpenAI: GPT-5 Nano

GPT-5 Nano is OpenAI's smallest and fastest model in the GPT-5 family, optimized for high-throughput, low-latency tasks at near-minimal cost. It supersedes GPT-4o as the go-to option for lightweight inference at scale.

Verdict
The fastest and cheapest way into the GPT-5 ecosystem, built for scale rather than depth.
Quality score
58%
Pricing
$0.05/1M in
$0.40/1M out
Speed
Very fast
Best for high-volume, latency-sensitive applications like classification, autocomplete, summarization, and lightweight chat where cost-per-token matters most.
Context
400k tokens
Output cost of ~$0.40/1M tokens means output-heavy workloads (long generations) will accumulate cost faster than input-heavy ones. Best suited for tasks where outputs are short-to-medium length. No image generation capability.
BudgetFastHigh VolumeLong ContextGPT-5 Family
Best for
High-volume, latency-sensitive applications like classification, autocomplete, summarization, and lightweight chat where cost-per-token matters most.
View model
OpenAIPremium

OpenAI: o3 Pro

OpenAI's o3 Pro is the highest-tier reasoning model in the o3 family, designed for maximum accuracy on the most demanding intellectual tasks. It applies extended compute and deeper chain-of-thought reasoning to outperform standard o3 on math, science, coding, and complex analysis.

Verdict
The most powerful reasoning model OpenAI offers — but its extreme pricing means you should reach for it only when accuracy genuinely cannot be compromised.
Quality score
77%
Pricing
$20.00/1M in
$80.00/1M out
Speed
Deliberate
Best for elite-level reasoning tasks where accuracy is paramount and cost is not a constraint — graduate-level math, competitive programming, and rigorous scientific analysis.
Context
200k tokens
o3 Pro is only available via the OpenAI API and ChatGPT Pro subscription tier. Response times can range from tens of seconds to several minutes depending on problem complexity. Output pricing at $80/M tokens is 4x the cost of standard o3.
reasoningSTEMpremiumdeep thinkingflagship
Best for
Elite-level reasoning tasks where accuracy is paramount and cost is not a constraint — graduate-level math, competitive programming, and rigorous scientific analysis.
View model
OpenAIBudget

OpenAI: GPT-4.1 Nano

GPT-4.1 Nano is OpenAI's smallest and most cost-efficient model in the GPT-4.1 family, designed for high-throughput, latency-sensitive tasks at near-commodity pricing. It offers a 1M token context window at just $0.10/1M input tokens, making it one of the cheapest large-context models available.

Verdict
The best pick for budget-conscious, high-volume workloads that don't demand frontier intelligence.
Quality score
54%
Pricing
$0.10/1M in
$0.40/1M out
Speed
Very fast
Best for high-volume production workloads like classification, extraction, summarization, and simple q&a where cost and speed matter more than frontier reasoning.
Context
1.0M tokens
Pricing is $0.10/1M input and $0.40/1M output tokens. Officially supersedes GPT-4o in OpenAI's lineup for lightweight use cases. Context window of ~1.047M tokens is one of the largest available at this price tier.
BudgetFastLong ContextHigh VolumeOpenAI
Best for
High-volume production workloads like classification, extraction, summarization, and simple Q&A where cost and speed matter more than frontier reasoning.
View model
OpenAIBalanced

OpenAI: o4 Mini Deep Research

o4 Mini Deep Research is OpenAI's cost-efficient reasoning model specialized for autonomous multi-step research tasks, capable of browsing the web, synthesizing sources, and producing detailed research reports. It brings deep research capabilities to a mid-tier price point by trading some of o4's raw power for significantly lower inference costs.

Verdict
The pragmatic choice for automated deep research at scale — capable enough, priced right, but don't expect o4-level depth.
Quality score
61%
Pricing
$2.00/1M in
$8.00/1M out
Speed
Deliberate
Best for automated research pipelines that require web browsing, source synthesis, and structured report generation at scale without flagship-model costs.
Context
200k tokens
Deep Research mode requires agentic tool access (web browsing); pricing reflects token usage but research tasks can consume significant tokens across multi-step retrieval loops. Availability may depend on API tier or organizational access level. Not a drop-in replacement for the standard o4 Mini in general-purpose workflows.
Deep ResearchReasoningWeb BrowsingCost-EfficientLong Context
Best for
Automated research pipelines that require web browsing, source synthesis, and structured report generation at scale without flagship-model costs.
View model
OpenAIBalanced

OpenAI: GPT-5.1 Chat

GPT-5.1 Chat is OpenAI's mid-tier conversational model, positioned as a capable successor to GPT-4o with improved instruction-following, reasoning, and knowledge depth at a balanced price point.

Verdict
A reliable mid-tier upgrade over GPT-4o for instruction-heavy tasks, but the context window and output pricing limit its value against Sonnet-class competitors.
Quality score
67%
Pricing
$1.25/1M in
$10.00/1M out
Speed
Balanced
Best for teams and developers who need gpt-4o-level quality with incremental improvements in accuracy and instruction adherence without paying flagship model prices.
Context
128k tokens
Output cost of $10/1M tokens is asymmetric compared to the $1.25 input price — high-volume generation tasks will become expensive quickly. No vision or image generation confirmed based on available specs. Supersedes GPT-4o in the OpenAI lineup but does not replace o-series reasoning models.
BalancedGPT-5 FamilyInstruction-TunedAPI-ReadyMid-Tier
Best for
Teams and developers who need GPT-4o-level quality with incremental improvements in accuracy and instruction adherence without paying flagship model prices.
View model
OpenAIPremium

OpenAI: o1-pro

o1-pro is OpenAI's highest-tier reasoning model, running o1 with extended compute time for deeper, more reliable problem-solving on complex tasks. It is designed for users who need maximum accuracy and thoroughness over speed.

Verdict
The most powerful reasoning model available, but its extreme cost means it's only justified for the hardest problems where no other model will do.
Quality score
75%
Pricing
$150.00/1M in
$600.00/1M out
Speed
Deliberate
Best for solving the hardest math, science, and engineering problems where accuracy is non-negotiable and cost is secondary.
Context
200k tokens
o1-pro is available only via the OpenAI API and ChatGPT Pro subscription ($200/month). It does not support streaming and has longer latency than any other OpenAI model. Not suitable for high-volume workloads.
Max ReasoningUltra-PremiumResearch-GradeMath & ScienceHigh Accuracy
Best for
Solving the hardest math, science, and engineering problems where accuracy is non-negotiable and cost is secondary.
View model
OpenAIBalanced

OpenAI: GPT-5.3-Codex

GPT-5.3-Codex is OpenAI's specialized coding-focused model in the GPT-5 lineage, built for deep software engineering tasks including code generation, debugging, and repository-level reasoning. It succeeds GPT-5.2 with improved instruction-following for complex multi-file codebases and a significantly expanded 400K context window.

Verdict
The go-to model for large-codebase reasoning, but its output pricing makes it a considered rather than casual choice.
Quality score
65%
Pricing
$1.75/1M in
$14.00/1M out
Speed
Balanced
Best for professional developers tackling large-scale coding tasks, refactoring legacy codebases, or working across multi-file projects where deep context retention is critical.
Context
400k tokens
Priced asymmetrically with low input cost ($1.75/1M) and high output cost ($14/1M), which rewards concise prompting but penalizes verbose code generation. The 400K context window is one of the largest available at this price tier. Supersedes GPT-5.2 with improved multi-file coherence; users on GPT-5.2 should migrate. No multimodal input support confirmed at launch.
coding-specialistlarge-contextOpenAIGPT-5developer-tool
Best for
Professional developers tackling large-scale coding tasks, refactoring legacy codebases, or working across multi-file projects where deep context retention is critical.
View model
OpenAIPremium

OpenAI: GPT-4 Turbo Preview

GPT-4 Turbo Preview is an early access version of GPT-4 Turbo, OpenAI's then-flagship model featuring a 128K context window and knowledge improvements over the original GPT-4. It was designed to deliver GPT-4-class reasoning at reduced cost compared to the original GPT-4.

Verdict
A once-capable flagship now overshadowed by faster, cheaper, and smarter successors.
Quality score
67%
Pricing
$10.00/1M in
$30.00/1M out
Speed
Balanced
Best for complex multi-step reasoning, long-document analysis, and professional writing tasks requiring strong instruction-following.
Context
128k tokens
This is a 'preview' variant that OpenAI has largely deprecated in favor of gpt-4-turbo and gpt-4o. The endpoint may be retired or redirected by OpenAI without notice. Check the OpenAI model deprecation schedule before building production applications on this model.
GPT-4Long ContextLegacyPremiumOpenAI
Best for
Complex multi-step reasoning, long-document analysis, and professional writing tasks requiring strong instruction-following.
View model
OpenAIPremium

OpenAI: o3 Deep Research

OpenAI's o3 Deep Research is a reasoning-heavy model purpose-built for multi-step research tasks, capable of autonomously browsing the web, synthesizing sources, and producing detailed analytical reports. It combines o3's chain-of-thought reasoning with agentic tool use to tackle complex, open-ended research questions.

Verdict
The gold standard for autonomous AI research — if you can afford to run it.
Quality score
67%
Pricing
$10.00/1M in
$40.00/1M out
Speed
Deliberate
Best for conducting exhaustive, multi-source research that would take a human analyst hours to compile manually.
Context
200k tokens
Deep Research mode involves agentic tool calls and web browsing, which can multiply effective token costs significantly. Pricing is per token but real-world research sessions often consume large amounts of both. Available via ChatGPT Plus/Pro and API; API access may require higher usage tiers.
Deep ResearchAgenticReasoningPremiumWeb Browsing
Best for
Conducting exhaustive, multi-source research that would take a human analyst hours to compile manually.
View model
OpenAIPremium

OpenAI: GPT-4 Turbo (older v1106)

GPT-4 Turbo (v1106) is an older snapshot of OpenAI's flagship GPT-4 Turbo model released in November 2023, offering a 128K context window with strong general-purpose reasoning and instruction-following capabilities. It predates later GPT-4 Turbo updates and GPT-4o, making it a legacy choice for workflows locked to this specific version.

Verdict
A reliable but outdated GPT-4 snapshot that only makes sense when version pinning is a hard requirement.
Quality score
66%
Pricing
$10.00/1M in
$30.00/1M out
Speed
Balanced
Best for teams requiring a pinned, stable version of gpt-4 turbo for reproducible outputs in long-document analysis or complex instruction pipelines.
Context
128k tokens
This is a pinned model snapshot (v1106) and will not receive capability updates. OpenAI may deprecate older snapshots over time. Knowledge cutoff is April 2023. Not recommended for new deployments given the superior cost-performance of GPT-4o and GPT-4.1.
Legacy128K ContextPinned SnapshotGPT-4Premium
Best for
Teams requiring a pinned, stable version of GPT-4 Turbo for reproducible outputs in long-document analysis or complex instruction pipelines.
View model
OpenAIPremium

OpenAI: o1

OpenAI's o1 is a reasoning-focused model that uses chain-of-thought processing to tackle complex, multi-step problems in math, science, and coding. It deliberately 'thinks before answering,' trading speed for significantly improved accuracy on hard problems.

Verdict
The original deep-thinker that excels at hard reasoning problems, now overshadowed by newer o-series models but still formidable for complex STEM work.
Quality score
69%
Pricing
$15.00/1M in
$60.00/1M out
Speed
Deliberate
Best for solving complex reasoning tasks where accuracy matters more than response time, such as competitive programming, advanced mathematics, and rigorous scientific analysis.
Context
200k tokens
At $15 input / $60 output per 1M tokens, a single complex back-and-forth session can cost dollars. o1-mini is available at a fraction of the price for lighter reasoning tasks. OpenAI has since released o3 and o3-mini, which largely supersede o1 for most reasoning use cases.
ReasoningMathSciencePremiumChain-of-Thought
Best for
Solving complex reasoning tasks where accuracy matters more than response time, such as competitive programming, advanced mathematics, and rigorous scientific analysis.
View model
OpenAIBalanced

OpenAI: GPT Audio Mini

GPT Audio Mini is OpenAI's cost-efficient audio-capable model that handles real-time speech input and output alongside text, built on the GPT-4o Mini architecture. It's designed for voice-driven applications where low latency and affordable pricing matter more than peak intelligence.

Verdict
The most practical choice for cost-conscious voice application developers who need native audio I/O without compromising too much on intelligence.
Quality score
44%
Pricing
$0.60/1M in
$2.40/1M out
Speed
Fast
Best for building voice assistants, audio bots, and speech-enabled applications that need real-time audio processing at scale without breaking the budget.
Context
128k tokens
Audio tokens are priced differently from text tokens in OpenAI's API — audio input/output carries a significant premium over text tokens, so real-world costs for voice-heavy workloads will be substantially higher than the listed text token price suggests. Check OpenAI's audio token pricing separately.
AudioVoice AIReal-timeBudgetMultimodal
Best for
Building voice assistants, audio bots, and speech-enabled applications that need real-time audio processing at scale without breaking the budget.
View model
OpenAIBalanced

OpenAI: GPT Audio

GPT Audio is OpenAI's speech-capable model variant optimized for real-time audio input and output, enabling natural voice conversations and audio processing. It extends GPT-4o's multimodal capabilities with native audio understanding and generation without requiring separate transcription pipelines.

Verdict
The go-to choice for native voice AI applications, but overkill and potentially costly for anything without real audio requirements.
Quality score
43%
Pricing
$2.50/1M in
$10.00/1M out
Speed
Balanced
Best for building voice assistants, real-time spoken dialogue systems, and applications that need to process or generate natural speech end-to-end.
Context
128k tokens
Audio tokens are counted differently from text tokens — a few seconds of audio can consume hundreds of tokens, so monitor usage carefully. Real-time audio streaming requires WebSocket or Realtime API endpoints, not the standard Chat Completions API. Availability may be limited by tier or region.
Voice AIAudioMultimodalReal-timeSpeech
Best for
Building voice assistants, real-time spoken dialogue systems, and applications that need to process or generate natural speech end-to-end.
View model
OpenAIBudget

OpenAI: GPT-3.5 Turbo

GPT-3.5 Turbo is OpenAI's legacy fast and affordable chat model, optimized for dialogue and straightforward text tasks at low cost. It was the backbone of early ChatGPT and remains a go-to for high-volume, cost-sensitive deployments.

Verdict
A once-dominant budget model now outclassed by cheaper, smarter alternatives like GPT-4o mini.
Quality score
35%
Pricing
$0.50/1M in
$1.50/1M out
Speed
Very fast
Best for high-volume, low-complexity tasks like chatbots, classification, summarization, and simple q&a where cost matters more than cutting-edge quality.
Context
16k tokens
GPT-3.5 Turbo is still available via OpenAI API and supports fine-tuning, which keeps it relevant for teams with existing trained models. However, OpenAI has deprioritized its development in favor of the GPT-4o family. Not multimodal — text only.
BudgetLegacyFastHigh-volumeChatbot
Best for
High-volume, low-complexity tasks like chatbots, classification, summarization, and simple Q&A where cost matters more than cutting-edge quality.
View model
OpenAIBudget

OpenAI: gpt-oss-safeguard-20b

A 20-billion parameter open-weights safety-focused model from OpenAI, designed primarily for content moderation, policy enforcement, and safeguard classification tasks. It is purpose-built to detect harmful, policy-violating, or unsafe content rather than serve as a general-purpose assistant.

Verdict
A purpose-built safety classifier that's excellent at its narrow job and essentially useless outside it.
Quality score
27%
Pricing
$0.07/1M in
$0.30/1M out
Speed
Fast
Best for automated content moderation pipelines and safety classification at scale.
Context
131k tokens
This is an open-weights safety/moderation-specific model, not a general assistant. Pricing reflects its budget-tier positioning. Availability may be limited or subject to change as it appears to be a research/infrastructure model rather than a consumer product. Verify OpenAI's terms around usage and redistribution for the OSS weights.
Content ModerationSafetyOpen WeightsBudgetClassification
Best for
Automated content moderation pipelines and safety classification at scale.
View model
OpenAIPremium

OpenAI: GPT-4

GPT-4 is OpenAI's original flagship large language model, released in March 2023, offering strong reasoning and instruction-following across text tasks. It represents the foundational GPT-4 release before later variants like GPT-4 Turbo or GPT-4o improved speed, cost, and context length.

Verdict
A once-groundbreaking model now badly outclassed by cheaper, faster, and more capable successors — only use it if you have no choice.
Quality score
51%
Pricing
$30.00/1M in
$60.00/1M out
Speed
Balanced
Best for teams or workflows locked into the original gpt-4 api that require reliable, high-quality text reasoning without needing long context or multimodal input.
Context
8k tokens
At $30/$60 per million tokens, this is one of the most expensive text-only models available. The 8,191-token context window is a hard ceiling that makes it unsuitable for most document-processing tasks. OpenAI continues to offer it for API backward compatibility but actively recommends migrating to GPT-4o or GPT-4 Turbo. New projects should not default to this model.
Legacy flagshipText-onlyHigh costOpenAIGPT-4
Best for
Teams or workflows locked into the original GPT-4 API that require reliable, high-quality text reasoning without needing long context or multimodal input.
View model
OpenAIBalanced

OpenAI: GPT-3.5 Turbo 16k

GPT-3.5 Turbo 16k is OpenAI's extended-context variant of their older flagship chat model, offering double the context window of the base 3.5 Turbo at a higher price point. It handles general-purpose text tasks but has been largely superseded by newer, more capable models.

Verdict
An outdated model that's been lapped by cheaper, more capable competitors on every meaningful dimension.
Quality score
37%
Pricing
$3.00/1M in
$4.00/1M out
Speed
Fast
Best for legacy integrations or applications that need slightly longer documents processed without upgrading to a modern model.
Context
16k tokens
OpenAI has been gradually deprecating older GPT-3.5 variants. Availability may be limited or sunset in the future. At $3/$4 per million tokens, this is dramatically overpriced relative to its capability in 2024-2025.
LegacyExtended ContextGeneral PurposeAffordable
Best for
Legacy integrations or applications that need slightly longer documents processed without upgrading to a modern model.
View model
OpenAIBalanced

OpenAI: GPT-3.5 Turbo (older v0613)

An older versioned snapshot of GPT-3.5 Turbo (v0613), OpenAI's once-dominant mid-tier language model optimized for fast chat completions and instruction following. This specific checkpoint is frozen in time, predating later capability improvements introduced in subsequent GPT-3.5 Turbo updates.

Verdict
A once-useful workhorse now completely overshadowed by cheaper, more capable successors.
Quality score
31%
Pricing
$1.00/1M in
$2.00/1M out
Speed
Very fast
Best for high-volume, cost-sensitive text tasks like classification, summarization, and simple q&a where bleeding-edge quality is not required.
Context
4k tokens
This is a pinned legacy snapshot (v0613) and may eventually be deprecated by OpenAI. The 4,095-token context window is its most significant practical limitation. OpenAI's own GPT-4o mini offers drastically more context and better quality at a comparable price — strongly consider migrating.
LegacyBudgetFastShort ContextOpenAI
Best for
High-volume, cost-sensitive text tasks like classification, summarization, and simple Q&A where bleeding-edge quality is not required.
View model
OpenAIBalanced

OpenAI: GPT-3.5 Turbo Instruct

GPT-3.5 Turbo Instruct is a legacy completion-style model from OpenAI, designed for instruction-following tasks using the older text completion API rather than the chat API. It excels at structured text generation, fill-in-the-middle tasks, and traditional NLP workflows that predate the chat paradigm.

Verdict
A legacy model only worth using if your pipeline depends on the text completion API.
Quality score
30%
Pricing
$1.50/1M in
$2.00/1M out
Speed
Very fast
Best for legacy completion api workflows, structured text generation, and simple instruction-following tasks where the chat format is not required.
Context
4k tokens
Uses the legacy /v1/completions endpoint, not /v1/chat/completions. The 4,095-token context window is a hard constraint that makes it unsuitable for most modern tasks. OpenAI has not deprecated it, but it receives no capability updates.
LegacyCompletion APILow LatencyNarrow TasksOld Gen
Best for
Legacy completion API workflows, structured text generation, and simple instruction-following tasks where the chat format is not required.
View model
OpenAIPremium

OpenAI: GPT-4 (older v0314)

GPT-4 v0314 is a frozen snapshot of the original GPT-4 release from March 2023, preserved for reproducibility and regression testing. It offers the same core reasoning capabilities as early GPT-4 but lacks all subsequent improvements, fine-tuning updates, and safety refinements.

Verdict
An expensive museum piece: only justified if you need this exact model snapshot for legacy reproducibility.
Quality score
40%
Pricing
$30.00/1M in
$60.00/1M out
Speed
Balanced
Best for reproducible research or legacy workflows that require consistent, version-locked gpt-4 outputs.
Context
8k tokens
This is a frozen March 2023 snapshot of GPT-4, not a current model. OpenAI may deprecate legacy snapshots with limited notice. The 8,191-token context window is a hard constraint. Cost is identical to much more capable current models, making this a poor choice for new projects.
LegacyGPT-4Version-lockedResearchDeprecated
Best for
Reproducible research or legacy workflows that require consistent, version-locked GPT-4 outputs.
View model
$2.50/1M
$2.00/1M
400K
Fast
GPT-4o Mini
Budget
$0.15/1M$0.60/1M128KVery fast
OpenAI: GPT-5.1
Balanced
$1.25/1M$10.00/1M400KBalanced
GPT-5.2
Premium
$1.75/1M$14.00/1M200KBalanced
OpenAI: GPT-4.1
Balanced
$2.00/1M$8.00/1M1.047576MBalanced
GPT-4o
Balanced
$0.15/1M$0.60/1M128KFast
GPT-5.2 Mini
Balanced
$1.20/1M$4.80/1M128KFast
OpenAI: GPT-5 Mini
Budget
$0.25/1M$2.00/1M400KVery fast
OpenAI: GPT-4.1 Mini
Budget
$0.40/1M$1.60/1M1.047576MVery fast
OpenAI: GPT-5 Chat
Balanced
$1.25/1M$10.00/1M128KBalanced
OpenAI: o4 Mini
Balanced
$1.10/1M$4.40/1M200KDeliberate
OpenAI: GPT-5 Pro
Premium
$15.00/1M$120.00/1M400KDeliberate
OpenAI: GPT-5.1-Codex-Mini
Budget
$0.25/1M$2.00/1M400KVery fast
OpenAI: o3 Mini
Balanced
$1.10/1M$4.40/1M200KDeliberate
OpenAI: o4 Mini High
Balanced
$1.10/1M$4.40/1M200KDeliberate
OpenAI: o3
Balanced
$2.00/1M$8.00/1M200KDeliberate
OpenAI: GPT-5.1-Codex
Balanced
$1.25/1M$10.00/1M400KBalanced
OpenAI: o3 Mini High
Balanced
$1.10/1M$4.40/1M200KDeliberate
OpenAI: GPT-5 Codex
Balanced
$1.25/1M$10.00/1M400KBalanced
OpenAI: GPT-4 Turbo
Premium
$10.00/1M$30.00/1M128KBalanced
OpenAI: GPT-5.1-Codex-Max
Balanced
$1.25/1M$10.00/1M400KBalanced
OpenAI: GPT-5.3 Chat
Balanced
$1.75/1M$14.00/1M128KBalanced
OpenAI: GPT-5 Nano
Budget
$0.05/1M$0.40/1M400KVery fast
OpenAI: o3 Pro
Premium
$20.00/1M$80.00/1M200KDeliberate
OpenAI: GPT-4.1 Nano
Budget
$0.10/1M$0.40/1M1.047576MVery fast
OpenAI: o4 Mini Deep Research
Balanced
$2.00/1M$8.00/1M200KDeliberate
OpenAI: GPT-5.1 Chat
Balanced
$1.25/1M$10.00/1M128KBalanced
OpenAI: o1-pro
Premium
$150.00/1M$600.00/1M200KDeliberate
OpenAI: GPT-5.3-Codex
Balanced
$1.75/1M$14.00/1M400KBalanced
OpenAI: GPT-4 Turbo Preview
Premium
$10.00/1M$30.00/1M128KBalanced
OpenAI: o3 Deep Research
Premium
$10.00/1M$40.00/1M200KDeliberate
OpenAI: GPT-4 Turbo (older v1106)
Premium
$10.00/1M$30.00/1M128KBalanced
OpenAI: o1
Premium
$15.00/1M$60.00/1M200KDeliberate
OpenAI: GPT Audio Mini
Balanced
$0.60/1M$2.40/1M128KFast
OpenAI: GPT Audio
Balanced
$2.50/1M$10.00/1M128KBalanced
OpenAI: GPT-3.5 Turbo
Budget
$0.50/1M$1.50/1M16KVery fast
OpenAI: gpt-oss-safeguard-20b
Budget
$0.07/1M$0.30/1M131KFast
OpenAI: GPT-4
Premium
$30.00/1M$60.00/1M8KBalanced
OpenAI: GPT-3.5 Turbo 16k
Balanced
$3.00/1M$4.00/1M16KFast
OpenAI: GPT-3.5 Turbo (older v0613)
Balanced
$1.00/1M$2.00/1M4KVery fast
OpenAI: GPT-3.5 Turbo Instruct
Balanced
$1.50/1M$2.00/1M4KVery fast
OpenAI: GPT-4 (older v0314)
Premium
$30.00/1M$60.00/1M8KBalanced
Compare all providers →