Search Results for author: Pedro H. P. Savarese

Found 3 papers, 1 papers with code

Convergence of Gradient Descent on Separable Data

no code implementations • 5 Mar 2018 • Mor Shpigel Nacson, Jason D. Lee, Suriya Gunasekar, Pedro H. P. Savarese, Nathan Srebro, Daniel Soudry

We show that for a large family of super-polynomial tailed losses, gradient descent iterates on linear networks of any depth converge in the direction of $L_2$ maximum-margin solution, while this does not hold for losses with heavier tails.

Paper
Add Code

From Monte Carlo to Las Vegas: Improving Restricted Boltzmann Machine Training Through Stopping Sets

1 code implementation • 22 Nov 2017 • Pedro H. P. Savarese, Mayank Kakodkar, Bruno Ribeiro

We propose a Las Vegas transformation of Markov Chain Monte Carlo (MCMC) estimators of Restricted Boltzmann Machines (RBMs).

Paper
Code

Learning Identity Mappings with Residual Gates

no code implementations • 4 Nov 2016 • Pedro H. P. Savarese, Leonardo O. Mazza, Daniel R. Figueiredo

We evaluate our method on MNIST using fully-connected networks, showing empirical indications that our augmentation facilitates the optimization of deep models, and that it provides high tolerance to full layer removal: the model retains over 90% of its performance even after half of its layers have been randomly removed.

Ranked #108 on Image Classification on CIFAR-10

Image Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.