no code implementations • 3 Aug 2022 • Chuhan Xu, Pablo Coen-Pirani, Xia Jiang
We also find some interesting interacting pairs of hyperparameters such as learning rate and momentum, learning rate and decay, and batch size and epochs.