April 24, 2026
Training Compute-Optimal Large Language Models (Chinchilla)
Notes
Showed most LLMs were undertrained. Optimal ratio of data to parameters.
Browse posts by tag
Showed most LLMs were undertrained. Optimal ratio of data to parameters.