OpenAI Achieves Gold Medal-Level LLM AI Math Performance on 2025 International Mathematical Olympiad

[HPP] Noam BrownJuly 19, 202511 min

21 connections·27 entities in this video→

Capture as you watch

Save any video to veridive in one click.

The free veridive Chrome extension pulls the transcript from any YouTube video or podcast you're watching — ready to ask, cite, and connect.

OpenAI's IMO Gold Medal Achievement

🏆 OpenAI's general reasoning LLM achieved gold medal-level performance on the 2025 International Mathematical Olympiad (IMO).
⏱️ This milestone was reached under human time limits, without external tools, and on novel, previously unseen mathematical tasks.
💡 The achievement is particularly notable as it demonstrates strong generalization capabilities beyond benchmarks saturated by data contamination from prior IMO problems.

Breakthrough in AI Reasoning

🚀 The success stems from breaking new ground in general-purpose reinforcement learning and test-time compute scaling.
🧠 The model can construct intricate, watertight arguments comparable to those of human mathematicians.
🌱 This progress moves beyond the traditional reinforcement learning paradigm that relies on clear-cut, verifiable rewards.

Advanced Problem-Solving Capabilities

⏳ Unlike previous models that thought for seconds or minutes, this new model thinks for hours to solve complex problems.
🌐 The mathematical tasks tackled include difficult algebra, pre-calculus, and advanced branches like projective geometry, functional equations, and combinatorics, indicating a high level of universality.
🤯 The speaker expressed surprise at this general LLM's performance, having previously believed that neurosymbolic systems would be necessary for such tasks.

Implications and Future Outlook

📈 This development is seen as a significant step towards superintelligence that could aid in advanced research.
⚠️ The high compute costs associated with the model's operation might explain why it is currently an internal OpenAI achievement rather than a public one.

Ask, don't scrub

Discover the spoken web.

veridive answers questions with exact timestamps and citations — across every podcast, video, and article you've saved.

Knowledge graph27 entities · 21 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover · drag to explore

27 entities

Chapters4 moments

Key Moments

Transcript39 segments

Full Transcript

Follow the thread

Find every place these ideas show up.

veridive maps the same people, claims, and topics across thousands of sources — so you can trace an idea from one conversation to the next.

Topics11 themes

What’s Discussed

International Mathematical Olympiad (IMO)Large Language Models (LLMs)General reasoning LLMReinforcement learningTest-time compute scalingData contaminationNeurosymbolic systemsSuperintelligenceMathematical proofsAdvanced mathematicsCompute costs

Smart Objects27 · 21 links

Concepts· 21

Company· 1

People· 3

Events· 2

Hours of content, seconds to the answer.

Save what you listen to. Ask it anything. Watch the threads between sources surface on their own.

Get started free