Search Results for author: Ard Louis

Found 2 papers, 0 papers with code

An exactly solvable model for emergence and scaling laws

no code implementations26 Apr 2024 Yoonsoo Nam, Nayara Fonseca, Seok Hyeong Lee, Ard Louis

Deep learning models can exhibit what appears to be a sudden ability to solve a new problem as training time ($T$), training data ($D$), or model size ($N$) increases, a phenomenon known as emergence.

Why flatness does and does not correlate with generalization for deep neural networks

no code implementations10 Mar 2021 Shuofeng Zhang, Isaac Reid, Guillermo Valle Pérez, Ard Louis

As an alternative to flatness measures, we use a function based picture and propose using the log of Bayesian prior upon initialization, $\log P(f)$, as a predictor of the generalization when a DNN converges on function $f$ after training to zero error.

Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.