FrontierMath
Appearance
From Wikipedia, the free encyclopedia
| Part of a series on |
| Artificial intelligence (AI) |
|---|
|
Glossary |
FrontierMath is a test bed to benchmark [1] various artificial intelligences in their attempts to solve 14 bespoke[2] heretofore unexamined mathematical problems[3] (none of which are on the scale of the Millennium Problems). It was established by the non-profit research organization Epoch AI in November 2024.[4] The first such open problem—of the "moderately interesting" rank—to be solved was in hypergraph theory: "A Constant-Factor Lower Bound For H (n)" by GPT-5.4.[5] Such was the novelty of the methodology that memes were generated.[6]
See also
[edit ]References
[edit ]- ^ Glazer, Elliot; Erdil, Ege; Besiroglu, Tamay; Chicharro, Diego; Chen, Evan; Gunning, Alex; Olsson, Caroline Falkman; Denain, Jean-Stanislas; Ho, Anson (2025年12月23日), FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI, arXiv, doi:10.48550/arXiv.2411.04872, arXiv:2411.04872, retrieved 2026年05月16日
- ^ Team, MindStudio (April 7, 2026). "What Is the Frontier Math Benchmark? Why Open Research Problems Expose True AI Reasoning". MindStudio.
- ^ "FrontierMath: Open Problems - Unsolved Mathematical Challenges". Epoch AI.
- ^ "AI Math Benchmarks: AI's Growing Capabilities - IEEE Spectrum". spectrum.ieee.org.
- ^ Johnson, Olivia (March 14, 2026). "GPT-5.4 solves its first open math problem from FrontierMath benchmark". remio.
- ^ https://www.weaving.news/news/019d1dbd-7129-7664-a16e-fd3e4f9454e0