Search Results for author: Luca Herranz-Celotti

Found 3 papers, 0 papers with code

Stabilizing RNN Gradients through Pre-training

no code implementations23 Aug 2023 Luca Herranz-Celotti, Jean Rouat

However, analysing deep recurrent networks, we identify a new additive source of exponential explosion that emerges from counting gradient paths in a rectangular grid in depth and time.

Less is More! A slim architecture for optimal language translation

no code implementations18 May 2023 Luca Herranz-Celotti, Ermal Rrapaj

Our method outperforms existing techniques in terms of test loss while simultaneously halving the number of parameters.

Translation

Stabilizing Spiking Neuron Training

no code implementations1 Feb 2022 Luca Herranz-Celotti, Jean Rouat

We show how it can be used to reduce the need of extensive grid-search of dampening, sharpness and tail-fatness of the SG.

Cannot find the paper you are looking for? You can Submit a new open access paper.