Scaling Laws
Definition
Scaling laws in AI describe empirically observed, predictable relationships between model performance and three key variables: the number of model parameters, the amount of training data, and the amount of compute used for training. Research from OpenAI, DeepMind, and Anthropic has shown that loss on language modeling tasks decreases as a smooth power law as each of these variables increases, enabling researchers to predict performance improvements before training large models.
Scaling laws have profound strategic implications for the AI industry and for enterprise buyers. They partially explain why frontier models have improved so rapidly — increased investment in compute and data has yielded predictable capability gains. However, scaling laws also have limits: they describe average performance on benchmark distributions, not specific capabilities or safety properties. For organizations evaluating AI investments, scaling laws provide a framework for understanding why larger models tend to perform better on general tasks, while also highlighting that domain-specific fine-tuning and architectural innovations can outperform brute-scale improvements for targeted applications.
Related Terms
Source
Last updated: May 12, 2026