BY THE NUMBERS
May 2024
GPT-4o Release
~87.5%
o3 ARC-AGI Score (Dec 2024)
Q2 2026
GPT-5 Expected Release
1. What Is GPT-5 and Why Does It Matter
GPT-5 is OpenAI's next flagship large language model, expected to unify the capabilities of the general-purpose GPT series with the reasoning capabilities of the o-series (o1, o3). OpenAI's stated roadmap for 2026 is to collapse the distinction between "fast/cheap" chat models and "slow/deep" reasoning models into a single system that performs at the top of both categories.
This matters for two reasons. First, the separation between GPT-4o (good at conversation, fast) and o3 (good at hard problems, slow) creates friction for users who need both capabilities in a single product. Second, the competitive pressure from Anthropic's Claude 3.5 Sonnet and Google's Gemini 2.0 Ultra has been accelerating in 2025 and 2026. GPT-5 is OpenAI's response to maintaining technical leadership in the flagship model category.
For full context on OpenAI's current product and research direction, including the Pentagon classified AI deal and Sam Altman's government oversight testimony, see the OpenAI hub.
2. The Confirmed Model Progression | GPT-4o to o3 to GPT-4.5
To understand where GPT-5 fits, it helps to understand what OpenAI has already shipped. The progression from early 2024 through early 2025 represents a deliberate strategy shift from a single unified model architecture toward specialized tracks, before presumably merging them in GPT-5.
GPT-4o | May 2024
The "omni" model unified text, voice, and image understanding in a single model. It was meaningfully faster and cheaper than GPT-4 Turbo, enabling real-time voice conversations for the first time. Became the default ChatGPT model for free-tier users. Remained the standard conversational flagship through 2024.
o1 and o3 | 2024
The o-series introduced chain-of-thought reasoning as a first-class feature. o1 Preview launched September 2024 and outperformed PhD-level experts on science benchmarks. o3 followed in December 2024 with near-perfect ARC-AGI benchmark results (~87.5%), which the ARC Prize Foundation described as "a significant breakthrough."
GPT-4.5 | February 2025
The last pre-GPT-5 generation flagship. Focused on improved instruction-following, emotional intelligence in conversation, and better handling of nuanced requests. Not a reasoning model but a refinement of the GPT-4 generation. Available on Plus and Team tiers. This is the model that GPT-5 will replace.
3. What GPT-5 Is Expected to Deliver
Important Note:
OpenAI CEO Sam Altman has publicly described GPT-5 as a model that will be "meaningfully smarter" than its predecessors and will unify the GPT and o-series approaches. The practical implication is a model that handles both routine conversational tasks and hard reasoning problems in the same interface, without users needing to manually switch to the o-series for difficult requests.
Based on the model progression pattern, GPT-5 is expected to:
- Perform at or above o3 on reasoning benchmarks (math, coding, logic) while matching GPT-4o on conversational speed
- Support longer context windows than GPT-4o's 128K token limit
- Improve on multimodal understanding, particularly for complex document and image analysis
- Introduce better memory and personalization features across conversations
- Lower per-token API cost compared to the current o3 pricing tier
4. Competitive Context | Claude, Gemini, and the Race in 2026
The AI model landscape has compressed significantly since GPT-4 defined a clear capability tier in 2023. By early 2026, three organizations have competitive flagship models:
- OpenAI: GPT-4o + o3 (dual-model strategy until GPT-5 unifies them). Market position bolstered by the AWS and Pentagon classified AI infrastructure deal.
- Anthropic (Claude): Claude 3.5 Sonnet is widely considered the top coding assistant and documents model. Anthropic has prioritized safety and enterprise trust. See the Claude hub for model updates.
- Google (Gemini): Gemini 2.0 Ultra has the advantage of deep integration with Google Search, Workspace, and Android. Google's infrastructure cost advantage is substantial. Coverage at the Google hub.
The GPT-5 launch will be the clearest signal yet of whether OpenAI can maintain its position as the default public reference point for AI capability, a position it held unchallenged from GPT-3 (2020) through GPT-4 (2023) but has faced real competition on since mid-2024.
We're going to have a single model that's very capable of doing both fast, cheap things and very slow, careful reasoning. That's GPT-5.
5. Pricing | What GPT-5 Will Likely Cost
OpenAI has not confirmed GPT-5 pricing. Based on the historical pattern of each new generation being cheaper per token than the previous at equivalent quality, GPT-5 is expected to be priced lower than the current o3 Pro tier while offering similar or better reasoning performance.
The current pricing structure for comparison:
- ChatGPT Free: GPT-4o with rate limits. No o3 access.
- ChatGPT Plus ($20/month): GPT-4o, GPT-4.5, o3-mini, limited o3.
- ChatGPT Pro ($200/month): Unlimited o3, access to experimental models.
- API (pay-per-token): GPT-4o at $5 per million input tokens, o3 at $10 per million input tokens as of early 2026.
GPT-5 is expected to appear first on the Pro tier and API before rolling down to Plus and Free. This mirrors the GPT-4 to GPT-4o rollout cadence of 2023 to 2024.
Sources & References
- [1] GPT-4o System Card — OpenAI's official system card for GPT-4o, covering capabilities and safety evaluations.
- [2] OpenAI o3 Model Page — Official OpenAI page for the o3 reasoning model with benchmark comparisons.
- [3] ARC Prize Foundation | o3 ARC-AGI Results — Independent evaluation of o3 on the ARC-AGI benchmark, describing it as a significant breakthrough.