Compute (machine learning)
In artificial intelligence and cloud computing, compute is the amount of computing power or computational resources required to train machine learning models and large language models. More broadly, compute is the computational power or resources necessary for a computer or computer program to function.
Definition
[edit ]Compute is commonly defined as the amount of computing power or computational resources required to train machine learning and large language models.[2] [1] The term "compute" has also been more broadly applied to cloud computing, referencing processing power, memory, networking, storage, and other resources required for the computation of any program.[3]
Compute is measured in petaflop/s-days and is used to document AI training.[4] A petaflop/s-day (pfs-day) consists of performing 1015 neural net operations per second for one day, or a total of about 1020 operations. The compute-time product serves as a mental convenience, similar to kilowatt-hour for energy. An amount of compute is meant to give an idea of the number of actual operations performed.[4]
History
[edit ]The noun compute was relatively uncommon, though not unheard of, before gaining wider usage in fields such as cloud computing and artificial intelligence in the 2010s.[5] Artificial intelligence company OpenAI popularized the concept of compute as a strategic metric for AI progress in 2018.[4]
OpenAI identified two eras of training AI systems in terms of compute-usage. From 1959 to 2012, compute roughly followed Moore’s law. [4] Between 2012 and 2018, the amount of compute used in the largest AI training runs increased exponentially, growing by more than 300,000 times — roughly doubling every 3.4 months.[4] [2] By comparison, Moore’s Law doubled every two years over the same period.[4] One of the largest models, released in 2020, used 600,000 times more computing power than the 2012 model.[2]
After 2020, compute growth began to slow down,[2] with the compute needed for the largest AI models continuing to slow down in 2023.[6]
Use
[edit ]Cloud computing
[edit ]AI provider SpaceXAI said in 2026 that their AI progress is driven by compute and used it a key metric in the AI training of its supercomputer Colossus, the which contains 1 million GPUs.[7] Anthropic has a contract of 1ドル.25 billion per month with SpaceXAI to buy all the compute capacity at Colossus 1 data center.[8] Since June 2026 ahead of the initial public offering of SpaceX, Google pays SpaceXAI 920ドル million monthly for cloud compute capacity.[9]
AI training
[edit ]A 2022 study found that current large language models are significantly under-trained, a consequence of focusing on scaling language models whilst keeping the amount of training data constant. By training over 400 language models of various parameter and token size, they found that "for compute-optimal training", the model size and the number of training tokens should ideally be scaled equally: for every doubling of model size the number of training tokens should also be doubled.[10]
Compute growth and AI progress
[edit ]Larger AI models trained on more data and using more computational resources, tend to perform better.[11] [10] This happens even if the algorithms themselves remain unchanged.[10]
As early as 2018, OpenAI noted the exponential increase in compute to be have a key role in AI progress.[4] OpenAI considers three factors drive the advance of AI: algorithmic innovation, data, and the amount of compute available for training.[4] AI models with more compute not only improve in the tasks they were trained on but can develop emergent abilities.[12] Incremental improvements can lead to more abrupt leaps in capabilities.[11]
Criticism and policy
[edit ]Increasing, promoting or constraining progress in artificial intelligence has often be done via controlling the amount of compute.[13] Policymarkers have enacted policies and provided support to make compute resources more accessible to domestic AI researchers.[13]
In a January 2022 report, the Center for Security and Emerging Technology (CSET) suggested to institutions that increasingly powerful and generalizable AI (AGI) will likely require other strategies than maximizing compute.[2] [13] Some AI researchers are also concerned that government might exclusively focus on scaling compute instead of other strategies.[13]
The CSET has reported on the various bottlenecks which could explain why deep learning needs for compute have slow down:
- training is expensive and training extremely large models generates traffic jams across many processors that are difficult to manage.[2]
- there is a limited supply of AI chips[2] (see AI chip memory shortage).
- CSET advances that the main resource is human capital, specifically talented researchers — according to a 2023 published survey of more than 400 AI researchers, academic and private sector workers.[13] [6] The survey found that AI researchers are not primarily or exclusively constrained by compute access.[13] However, both academic and industry AI researchers equally report concerns that insufficient compute could prevent them from contributing meaningfully to AI research in the future.[13] High compute users are more concerned about compute access.[13] When asked about which resource provided by the government would be the most useful to them, some AI researchers select compute, other prefer grant funding.[13]
For this goal, CSET advised policymakers to ensure that even researchers with smaller budgets could effectively contribute to AI research.[2] Other proposed strategies include using contemporary AI algorithms, managing modern AI infrastructure or focusing on interdisciplinary work between the AI field and other fields of computer science.[2]
A 2024 study on compute access found that academic-only AI research teams often have less compute intensive research topics, especially foundation models, compared to industry AI labs.[14] As a consequence, academia is likely to play a smaller role in advancing such techniques. The researchers suggest nationally-sponsored computing infrastructure as well as open science initiatives to boost academic compute access.[14]
See also
[edit ]- AI scaling law
- Grok (chatbot)
- ChatGPT
- Claude (language model)
- National Artificial Intelligence Research Institutes
References
[edit ]- ^ a b Samborska, Veronika (2025年01月20日). "Scaling up: how increasing inputs has made artificial intelligence more capable". Our World in Data.
- ^ a b c d e f g h i "AI and Compute". Center for Security and Emerging Technology . 2022年01月01日. Retrieved 2026年05月27日.
- ^ "What is Compute? - Enterprise Cloud Computing Explained - AWS". Amazon Web Services, Inc. Retrieved 2026年05月27日.
- ^ a b c d e f g h "AI and compute". OpenAI . 2018. Retrieved 2026年05月27日.
- ^ "compute". www.merriam-webster.com. 2026年01月08日. Retrieved 2026年05月28日.
- ^ a b "How Important is Compute to the Future of AI?". Center for Security and Emerging Technology. 2023年05月25日. Retrieved 2026年05月27日.
- ^ "Colossus: The World's Largest AI Supercomputer | xAI". x.ai. Retrieved 2026年05月27日.
- ^ Brandom, Russell (2026年05月06日). "Is xAI a neocloud now?". TechCrunch. Retrieved 2026年05月27日.
- ^ "SpaceX lands Google AI compute deal after Anthropic pact ahead of IPO". Reuters . 2026年06月05日.
- ^ a b c Hoffmann, Jordan; Borgeaud, Sebastian; Mensch, Arthur; Buchatskaya, Elena; Cai, Trevor; Rutherford, Eliza; Casas, Diego de Las; Hendricks, Lisa Anne; Welbl, Johannes (2022年03月29日), Training Compute-Optimal Large Language Models, arXiv, doi:10.48550/arXiv.2203.15556, arXiv:2203.15556, retrieved 2026年05月27日
- ^ a b "Artificial intelligence: Performance on knowledge tests vs. training computation". Our World in Data. Archived from the original on 2026年05月09日. Retrieved 2026年05月27日.
- ^ Wei, Jason; Tay, Yi; Bommasani, Rishi; Raffel, Colin; Zoph, Barret; Borgeaud, Sebastian; Yogatama, Dani; Bosma, Maarten; Zhou, Denny (2022年10月26日), Emergent Abilities of Large Language Models, arXiv, doi:10.48550/arXiv.2206.07682, arXiv:2206.07682, retrieved 2026年05月27日
- ^ a b c d e f g h i ""The Main Resource is the Human"". Center for Security and Emerging Technology. 2023年04月01日. Retrieved 2026年05月27日.
- ^ a b Besiroglu, Tamay; Bergerson, Sage Andrus; Michael, Amelia; Heim, Lennart; Luo, Xueyun; Thompson, Neil (2024年01月08日), The Compute Divide in Machine Learning: A Threat to Academic Contribution and Scrutiny?, arXiv, doi:10.48550/arXiv.2401.02452, arXiv:2401.02452, retrieved 2026年05月27日
This artificial neural network-related article is a stub. You can help Wikipedia by adding missing information.