Search Results for author: Mitchell A Gordon

Found 2 papers, 0 papers with code

Data and Parameter Scaling Laws for Neural Machine Translation

no code implementations • ACL ARR May 2021 • Mitchell A Gordon, Kevin Duh, Jared Kaplan

We observe that the development cross-entropy loss of supervised neural machine translation models scales like a power law with the amount of training data and the number of non-embedding parameters in the model.

Machine Translation Translation

Paper
Add Code

When Are Neural Pruning Approximation Bounds Useful?

no code implementations • 1 Jan 2021 • Mitchell A Gordon

Approximation bounds for neural network pruning attempt to predict the trade-off between sparsity and fidelity while shrinking neural networks.

Network Pruning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.