Key takeaways
The new artificial intelligence model arrives less than a month after the company launched GPT-5.1, underscoring the intensifying race among tech giants to dominate the AI landscape.
The release comes amid reports that OpenAI CEO Sam Altman issued an internal memo in early December warning staff about "temporary economic headwinds" and forecasting "rough vibes" caused by Google's renewed surge in the AI market.
According to The Information and The Wall Street Journal, Altman wrote that the company was "at a critical time for ChatGPT" and needed to marshal resources toward improving its flagship product.
Model designed for professional knowledge work
GPT-5.2 represents a strategic push toward enterprise and professional applications, with three distinct versions tailored for different use cases.
Fidji Simo, OpenAI's Chief Product Officer, emphasized the model's business focus during a press briefing with journalists on Thursday.
"We designed 5.2 to unlock even more economic value for people," Simo said. "It's better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long context, using tools and then linking complex, multi-step projects."
The model is available as GPT-5.2 Instant for speed-optimized routine queries, GPT-5.2 Thinking for complex structured work, including coding and document analysis, and GPT-5.2 Pro for maximum accuracy on difficult problems.
According to OpenAI, the average ChatGPT Enterprise user reports saving 40 to 60 minutes daily, with heavy users claiming time savings exceeding 10 hours per week.
On GDPval, OpenAI's benchmark measuring performance across 44 professional occupations, GPT-5.2 Thinking beat or tied top industry professionals on 70.9% of well-specified tasks.
By comparison, GPT-5 scored 38.8%, Anthropic's Claude Opus 4.5 achieved 59.6%, and Google's Gemini 3 Pro reached 53.3%.
'Code red' initiative reshapes company priorities
While OpenAI executives maintained that GPT-5.2 had been in development for "many months," the timing of its release cannot be separated from the company's competitive concerns.
Altman's code red directive reportedly led OpenAI to postpone work on several initiatives, including advertising integration, AI agents for health and shopping applications, and the ChatGPT Pulse personal assistant.
"We announced this code red to really signal to the company that we want to marshal resources in one particular area, and that's a way to really define priorities and define things that can be deprioritized," Simo told reporters in the Thursday briefing, according to CNBC.
"We have had an increase in resources focused on ChatGPT in general. I would say that helps with the release of this model, but that's not the reason it's coming out this week in particular."
The pressure on OpenAI intensified following Google's November launch of Gemini 3, which topped industry benchmark leaderboards and drew public praise from industry leaders.
Marc Benioff, Salesforce CEO, publicly stated on social media that after spending two hours with Gemini 3, he was "not going back" to ChatGPT, which he had used daily for three years.
Google reported that Gemini's monthly active users surged from 450 million in July to 650 million in October 2025.
Meanwhile, Anthropic has demonstrated strong momentum in the enterprise market, growing from fewer than 1,000 business customers two years ago to more than 300,000 as of September 2025.
OpenAI's release materials position GPT-5.2 as outperforming competitors across most evaluations.
On SWE-Bench Pro, which evaluates agentic coding performance, GPT-5.2 Thinking scored 55.6% compared to GPT-5.1's 50.6%, Gemini 3 Pro's 43.3%, and Claude Opus 4.5's 52.0%. On GPQA Diamond, a graduate-level science reasoning benchmark, GPT-5.2 achieved 92.4% versus Gemini 3 Pro's 91.9%.
The model demonstrated particular strength in abstract reasoning tests.
On ARC-AGI-2, GPT-5.2 Thinking scored 52.9% and GPT-5.2 Pro reached 54.2%, representing a threefold increase from GPT-5.1 Thinking's 17.6%.
In mathematics, GPT-5.2 Thinking achieved a perfect 100% score on the AIME 2025 (American Invitational Mathematics Examination) without using external tools, marking the first AI model to reach this milestone.
Despite these advances, Anthropic's Claude Opus 4.5 maintains an edge in certain coding benchmarks.
On SWE-Bench Verified, Claude scores higher than GPT-5.2, though OpenAI told reporters that this benchmark is less "contamination resistant, challenging, diverse, and industrially relevant" than SWE-Bench Pro.
Sam Altman told CNBC on Thursday that Google's Gemini 3 ultimately "had less of an impact on our metrics than we feared" and that he expects OpenAI to exit code red status by January 2025.
The company announced it has no current plans to deprecate GPT-5.1, GPT-5, or GPT-4.1 in its API, giving developers advance notice before any future changes.
GPT-5.2 is rolling out to ChatGPT Plus, Pro, Business, and Enterprise subscribers beginning Thursday, with gradual deployment to manage server load.
Developers can access the model immediately through OpenAI's API with pricing set 40% higher than GPT-5.1, reflecting what the company describes as greater token efficiency and superior task completion in fewer turns.
Read more:
OpenAI And Microsoft Face Wrongful Death Lawsuit Over Connecticut Murder-Suicide
Oracle Shares Tumble As Revenue Miss And Surging AI Spending Spook Investors
UK Launches AI-Powered Submarine Defence Programme To Counter Russian Threat