Search Results for author: Liam Madden

Found 4 papers, 0 papers with code

Upper and lower memory capacity bounds of transformers for next-token prediction

no code implementations22 May 2024 Liam Madden, Curtis Fox, Christos Thrampoulidis

Given a sequence of tokens, such as words, the task of next-token prediction is to predict the next-token conditional probability distribution.

Decoder Memorization

Memory capacity of three-layer neural networks with non-polynomial activations

no code implementations22 May 2024 Liam Madden

The minimal number of neurons required for a feedforward neural network to interpolate $n$ generic input-output pairs from $\mathbb{R}^d\times \mathbb{R}$ is $\Theta(\sqrt{n})$.

Memory capacity of two layer neural networks with smooth activations

no code implementations3 Aug 2023 Liam Madden, Christos Thrampoulidis

In order to analyze general activations, we derive the precise generic rank of the network's Jacobian, which can be written in terms of Hadamard powers and the Khatri-Rao product.

Optimization and Learning with Information Streams: Time-varying Algorithms and Applications

no code implementations17 Oct 2019 Emiliano Dall'Anese, Andrea Simonetto, Stephen Becker, Liam Madden

Approaches for the design of time-varying or online first-order optimization methods are discussed, with emphasis on algorithms that can handle errors in the gradient, as may arise when the gradient is estimated.

Cannot find the paper you are looking for? You can Submit a new open access paper.