An 80-year mathematical puzzle tied to combinatorial geometry has reportedly been overturned by an AI system, with OpenAI saying its reasoning model has produced an original proof that challenges a long-held assumption in a problem first posed by Paul Erdős in 1946.
The claim places artificial intelligence directly inside active mathematical research, not just as a support tool but as a system capable of generating work that holds up under expert review.
AI System Tackles Erdős Geometry Problem After Decades Of Uncertainty
OpenAI says a general-purpose reasoning model independently produced a proof that disproves a long-standing conjecture in planar unit distance geometry.
For decades, mathematicians believed the strongest configurations resembled square-grid structures.
The model instead identified a different family of constructions that reportedly performs better, overturning that assumption.
The company described the outcome as the first instance of one of its systems autonomously resolving a prominent open problem in mathematics without step-by-step human direction.
It added that the reasoning process combined ideas across fields, including algebraic number theory, to complete a long chain of logic.
Mathematicians Step In To Verify The Result
The proof was externally reviewed by several researchers, including Noga Alon, Melanie Wood and Thomas Bloom, who is the mathematician behind the Erdős problems database.
Bloom had previously criticised earlier AI-related mathematical claims as inaccurate but has now supported the latest result, adding weight to its credibility.
OpenAI said the work was produced by a general reasoning model rather than a system specifically trained for mathematical discovery, stressing that the system was not designed around this type of problem.
Earlier AI Claim Resurfaces Old Doubts
The announcement has revived scrutiny following a previous misstep involving GPT-5.
Seven months ago, former OpenAI executive Kevin Weil claimed on social media that the model had solved ten unsolved Erdős problems and made progress on eleven others.
That claim was later withdrawn after mathematicians showed the system had rediscovered known solutions from existing literature.
Critics including Yann LeCun and Demis Hassabis publicly challenged the accuracy of the announcement at the time.
OpenAI says it has since taken a more cautious approach, publishing supporting commentary from domain experts alongside the latest result.
Why Researchers Say The Finding Matters Beyond Mathematics
OpenAI argues the development signals a shift in how advanced systems handle complex reasoning tasks.
It says the model was able to sustain long chains of logic, combine ideas across disciplines and explore solutions that researchers may not have previously considered.
In a statement, the company said
“This result points to something larger: AI systems are becoming capable of holding together long, difficult chains of reasoning, connecting ideas across distant fields, and surfacing paths researchers may not have explored. We believe those same abilities will soon accelerate work in biology, physics, engineering, and medicine.”
The broader implication, according to the company, is that general-purpose models may increasingly contribute to scientific exploration rather than only assist it.
AI Reshapes How High-Level Research Is Viewed In Industry
The development arrives amid heightened competition across the AI sector.
OpenAI is reportedly preparing an initial public offering filing as soon as this week, shortly after a US jury cleared the company in a lawsuit brought by Elon Musk.
Meanwhile, rival Anthropic is on track to report its first profitable quarter, with projected revenue of 10.9 billion dollars.
The industry’s focus is also shifting towards agentic systems capable of executing tasks rather than simply generating responses.
Investor Ken Griffin recently warned that AI is already performing work once reserved for PhD-level professionals, particularly in finance and analytics.
Within the research community, figures such as Andrej Karpathy have also moved deeper into frontier model development, joining Anthropic to focus on long-horizon reasoning systems.
OpenAI maintains that human oversight remains central, stating that researchers still decide which problems are worth pursuing and how results are interpreted, even as AI systems take on increasingly complex reasoning tasks.