1 code implementation • 15 Apr 2024 • Tamay Besiroglu, Ege Erdil, Matthew Barnett, Josh You
Hoffmann et al. (2022) propose three methods for estimating a compute-optimal scaling law.