Jump to content
Wikipedia The Free Encyclopedia

FrontierMath

From Wikipedia, the free encyclopedia
Part of a series on
Artificial intelligence (AI)
Glossary

FrontierMath is a test bed to benchmark [1] various artificial intelligences in their attempts to solve 14 bespoke[2] heretofore unexamined mathematical problems[3] (none of which are on the scale of the Millennium Problems). It was established by the non-profit research organization Epoch AI in November 2024.[4] The first such open problem—of the "moderately interesting" rank—to be solved was in hypergraph theory: "A Constant-Factor Lower Bound For H (n)" by GPT-5.4.[5] Such was the novelty of the methodology that memes were generated.[6]

See also

[edit ]

References

[edit ]
  1. ^ Glazer, Elliot; Erdil, Ege; Besiroglu, Tamay; Chicharro, Diego; Chen, Evan; Gunning, Alex; Olsson, Caroline Falkman; Denain, Jean-Stanislas; Ho, Anson (2025年12月23日), FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI, arXiv, doi:10.48550/arXiv.2411.04872, arXiv:2411.04872, retrieved 2026年05月16日
  2. ^ Team, MindStudio (April 7, 2026). "What Is the Frontier Math Benchmark? Why Open Research Problems Expose True AI Reasoning". MindStudio.
  3. ^ "FrontierMath: Open Problems - Unsolved Mathematical Challenges". Epoch AI.
  4. ^ "AI Math Benchmarks: AI's Growing Capabilities - IEEE Spectrum". spectrum.ieee.org.
  5. ^ Johnson, Olivia (March 14, 2026). "GPT-5.4 solves its first open math problem from FrontierMath benchmark". remio.
  6. ^ https://www.weaving.news/news/019d1dbd-7129-7664-a16e-fd3e4f9454e0

AltStyle によって変換されたページ (->オリジナル) /