
OpenAI's mysterious new model wins the IMO 2025 gold medal! Conquering the peak of Olympiad mathematics, Silicon Valley is buzzing

OpenAI's brand new general reasoning model won the gold medal at the 2025 IMO competition, solving 5 problems and scoring 35 points, far surpassing other models. This achievement is considered a significant breakthrough in AI reasoning technology, potentially disrupting traditional thinking chains. The model took the exam under the same conditions as humans and ultimately won with a high score, demonstrating its potential in complex tasks
Just yesterday, the world's top large models were completely defeated at the IMO competition in 2025, not even touching the edge of a bronze medal.
However, just now, OpenAI dropped a heavy bombshell—they successfully won the gold medal at IMO 2025 with a brand new "general reasoning model"!
Out of 6 problems, they solved 5, racking up an impressive 35 points!
It's worth noting that the previous best performer, Gemini 2.5 Pro, only scored 13 points.
Co-founder Greg Brockman, lead Alexander Wei, and various researchers from OpenAI excitedly announced this milestone achievement on Twitter!
In response, Noam Brown, the father of poker, stated that the significance of this achievement goes beyond "AI conquering the IMO" itself.
Swipe left and right to view
People are speculating that OpenAI may have deployed a disruptive reasoning technology this time, completely bidding farewell to the traditional Chain of Thought (CoT) reasoning.
This is not just a victory for a model, but the beginning of a whole new era!
Even more shocking is that OpenAI announced that this historic model is not the rumored GPT-5, but a completely new, experimental model!
And this model will not be released at all, it's simply too mysterious!
Mysterious model wins IMO gold medal
Alexander Wei and his team had the model compete under the exact same conditions as humans:
Two 4.5-hour exam periods, no tools, no internet, relying solely on understanding the problems to write out complete proofs in natural language.
Then, it was strictly graded by three former IMO medalists.
In the end, the model achieved an astonishing high score of 35/42, reaching gold medal level.
In contrast, in the past, AI needed specialized training in specific fields to achieve victory in complex tasks like Go, Dota, or others But this time, OpenAI broke this iron rule—the new model not only is not a special supply for IMO, but can also think for hours.
In contrast, the o1 model we are familiar with is measured in seconds, and Deep Research is measured in minutes.
This deep, sustained creative thinking ability is a chasm that AI has struggled to overcome in the past!
What does this mean? Has it reached AGI level? What is special about winning IMO?
First, compared to previous benchmarks, IMO problems require a higher level of sustained creative thinking.
In terms of reasoning time, it has now climbed all the way: GSM8K (top humans need about 0.1 minutes) → MATH benchmark (about 1 minute) → AIME (about 10 minutes) → IMO (about 100 minutes).
Secondly, the submissions for IMO are difficult to verify multi-page proofs.
Making progress in this field requires surpassing the reinforcement learning paradigm with clear, verifiable rewards.
By doing so, the OpenAI research team has developed a model capable of constructing complex and impeccable arguments at the level of human mathematicians.
Moreover, this model did not achieve this level of capability through training on "specific tasks (IMO)," but rather achieved new breakthroughs in general reinforcement learning and computational expansion during testing.
So, was the o3-alpha exposed last night just a prelude?
It turns out this is the real trump card that OpenAI has been holding back!
Complete Solution Process
If you are interested, you can take a look at the freshly released 2025 IMO competition problem solutions from OpenAI.
The model solved problems one to five (P1-P5), but failed to solve problem six (P6). As rumored, the difficulty of this sixth problem is extremely high, with only 6 people worldwide cracking it.
Repository address: https://github.com/aw31/openai-imo-2025-proofs/blob/main/README.md
Now let's take a look at the specific solution process for the first five problems with this new model.
The first problem is a coordinate geometry problem.
It can be seen that the key to solving this problem is to find the n lines covering the points and the possible number of sunlight lines.
The model adopts a unique approach to determine all non-negative integers k that meet the conditions.
Lemma: For n≥4, any covering of P_n with n lines must use one side of the triangle.
A precise analysis of the case for n=3.
For the general case of n≥3, it is proven that for each n, there exists a configuration with k=0, 1, or 3.
The main conclusion is proven using the reduction lemma.
The schematic diagram is as follows.
In the solution provided by the expert netizen, it is quite difficult to directly prove that the line passing through point H and parallel to AP is tangent to the circumcircle of triangle BER.
However, this problem can be rephrased: define X as the midpoint of EF on the side that does not contain B. If it can be proven that HX is parallel to EF, this conclusion can basically be established.
On the other hand, if this line is tangent, then it must touch the circle at the midpoint.
Therefore, it is sufficient to prove these two points.
During the proof process, the model mainly completes the following four steps:
- Analytical settings and parameters. Let P be the circumcenter of triangle ACD. Connect AP and the intersection points of the two circles Ω and Γ are points E and F, respectively. Find the circumcircle equation passing through points B, E, and F.
Slide up and down to view
In the solution of the human player, the key to this problem is to prove the yellow inequality in the diagram below.
The solution to this problem by the model is divided into the following four steps.
Consider bonza-type functions, that is, functions that satisfy property P_f.
When the function takes a value greater than 1 at a certain prime number, consider the congruence relation in the sense of modulo prime.
Conclusion: If a certain odd prime p satisfies f > 1, then the entire function f must be an identity function.
Enter the main structural lemma: analyze the structure of the function in the case of non-identity functions.
Scroll up and down to view
The key steps to solving it can be expressed in the following form.
Scroll up and down to view
The solution to this problem includes the following three parts.
During the problem-solving process, the model discussed the following three situations.
When λ ≥ c (no defense), Alice always wins.
When λ > c, Alice wins.
When λ < c (c = 1/√2), Bazza wins.
Scroll up and down to view
The sixth question involves combinatorial mathematics of graphs.
This recognized difficult problem has only 6 human participants who can solve it, and all six large models, including o3, o4-mini, Gemini 2.5 Pro, Grok-4, and DeepSeek-R1, scored zero, even this super strong model from OpenAI also failed Currently, there is no large model in the world that can solve the sixth question.
An Easter Egg
When Alexander Wei announced this news, he used the image of "Strawberry."
"Strawberry" was the code name used by OpenAI during its internal research to promote a brand new reasoning model project, which is the "o" series model we are now familiar with.
Author Introduction
Alex Wei is a research scientist at OpenAI, primarily focusing on large language models and reasoning. He has previously researched the intersection of machine learning, game theory, and algorithms.
He obtained his Ph.D. in Computer Science from the University of California, Berkeley, under the supervision of Nika Haghtalab, Michael I. Jordan, and Jacob Steinhardt; and received his bachelor's and master's degrees from Harvard University, under the guidance of Jelani Nelson and Scott Kominers.
He was a member of the FAIR team and contributed to building the first artificial intelligence to reach human-level performance in the game of Diplomacy—CICERO. This achievement was published in the journal Science in 2022.
Reference:
https://x.com/alexwei_/status/1946477742855532918
Risk Warning and Disclaimer
The market has risks, and investment requires caution. This article does not constitute personal investment advice and does not take into account the specific investment goals, financial situation, or needs of individual users. Users should consider whether any opinions, views, or conclusions in this article are suitable for their specific circumstances. Investment based on this is at their own risk
