April 24, 2026
Scaling Laws for Neural Language Models
Notes
Power-law relationships between compute, data, parameters, and loss. Empirical scaling science.
Browse posts by tag
Power-law relationships between compute, data, parameters, and loss. Empirical scaling science.