OpenAI released GPT-5.2 on Thursday, just weeks after declaring an internal “code red” in response to Google’s Gemini 3 topping industry benchmarks.
What do 1,000 journalists and PR pros know about AI that you don't? They took AI Quick Start, a 1-hour live class from The Media Copilot. 94% satisfaction. Find out how to work smarter with AI in just 60 minutes. Get 20% off with the code AIPRO: https://mediacopilot.ai/
The company says GPT-5.2 Thinking beats or ties top industry professionals on 70.9% of tasks across 44 occupations, according to expert human judges. Those tasks include creating presentations, spreadsheets, legal briefs, and engineering blueprints. The model completed them at more than 11 times the speed and less than 1% the cost of human experts.
“We designed GPT‑5.2 to unlock even more economic value for people,” OpenAI wrote in its announcement. “It’s better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects.”
The model also hallucinates less. OpenAI says responses with errors dropped 30% compared to GPT-5.1.
Fidji Simo, OpenAI’s CEO for applications, denied on a press call that GPT-5.2 was a response to Gemini 3, according to Axios. But the competitive pressure is real. Google’s Gemini app has grown to more than 650 million monthly active users, compared to OpenAI’s 800 million weekly active users.
Safety improvements are part of this release. OpenAI says GPT-5.2 produces fewer undesirable responses when users show signs of mental health distress or emotional reliance on the model. The company faces multiple wrongful death lawsuits over troubling conversations users had with ChatGPT.
GPT-5.2 comes in three versions: Instant for fast everyday tasks, Thinking for deeper work like coding and analysis, and Pro for difficult questions. It’s rolling out now to paid ChatGPT plans and is available in the API.
Why it matters for newsrooms: The claimed improvements in spreadsheets, presentations, and long-context understanding could make GPT-5.2 more useful for journalists handling data analysis or working with lengthy documents. But Anthropic’s Opus 4.5 still scores higher on SWE-Bench Verified, the software coding benchmark, suggesting the AI race remains wide open.







