FrontierMath

AI benchmark testbed for mathematical problem solving

Artificial intelligence (AI)
Part of a series on
Major goals Artificial general intelligence Intelligent agent Recursive self-improvement Planning Computer vision General game playing Knowledge representation Natural language processing Robotics AI safety
Approaches Machine learning Symbolic Deep learning Bayesian networks Evolutionary algorithms Hybrid intelligent systems Systems integration Open-source AI data centers
Applications Bioinformatics Deepfake Earth sciences Finance Generative AI Art Audio Music Government Healthcare Industry Software development Translation Military Physics Projects
Philosophy AI alignment Artificial consciousness The bitter lesson Chinese room Friendly AI Ethics Existential risk Turing test Uncanny valley Human–AI interaction
History Timeline Progress AI winter AI boom AI bubble
Controversies Deepfake pornography Taylor Swift deepfake pornography controversy Grok sexual deepfake scandal Google Gemini image generation controversy It's the Most Terrible Time of the Year Pause Giant AI Experiments Removal of Sam Altman from OpenAI Statement on AI Risk Tay (chatbot) Théâtre D'opéra Spatial Voiceverse NFT plagiarism scandal
Glossary Glossary
v t e

FrontierMath is a test bed to benchmark ^[1] various artificial intelligence systems in their attempts to solve 14 bespoke^[2] heretofore unexamined mathematical problems^[3] (none of which are on the scale of the Millennium Problems). It was established by the non-profit research organization Epoch AI in November 2024.^[4] The first such open problem—of the "moderately interesting" rank—to be solved was in hypergraph theory: "A Constant-Factor Lower Bound For H (n)" by GPT-5.4.^[5]

References

[edit ]

↑ Glazer, Elliot; Erdil, Ege; Besiroglu, Tamay; Chicharro, Diego; Chen, Evan; Gunning, Alex; Olsson, Caroline Falkman; Denain, Jean-Stanislas; Ho, Anson (2025年12月23日). "FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI". arXiv:2411.04872 [cs.AI].
↑ Team, MindStudio (April 7, 2026). "What Is the Frontier Math Benchmark? Why Open Research Problems Expose True AI Reasoning". MindStudio.
↑ "FrontierMath: Open Problems - Unsolved Mathematical Challenges". Epoch AI.
↑ "AI Math Benchmarks: AI's Growing Capabilities - IEEE Spectrum". spectrum.ieee.org.
↑ Johnson, Olivia (March 14, 2026). "GPT-5.4 solves its first open math problem from FrontierMath benchmark". remio.

Retrieved from "https://en.wikipedia.org/w/index.php?title=FrontierMath&oldid=1367333052"

See also

References