Search Results for author: Alex Stein

Found 3 papers, 3 papers with code

Transformers Can Do Arithmetic with the Right Embeddings

1 code implementation27 May 2024 Sean McLeish, Arpit Bansal, Alex Stein, Neel Jain, John Kirchenbauer, Brian R. Bartoldson, Bhavya Kailkhura, Abhinav Bhatele, Jonas Geiping, Avi Schwarzschild, Tom Goldstein

The poor performance of transformers on arithmetic tasks seems to stem in large part from their inability to keep track of the exact position of each digit inside of a large span of digits.

Coercing LLMs to do and reveal (almost) anything

1 code implementation21 Feb 2024 Jonas Geiping, Alex Stein, Manli Shu, Khalid Saifullah, Yuxin Wen, Tom Goldstein

It has recently been shown that adversarial attacks on large language models (LLMs) can "jailbreak" the model into making harmful statements.

Neural Auctions Compromise Bidder Information

1 code implementation28 Feb 2023 Alex Stein, Avi Schwarzschild, Michael Curry, Tom Goldstein, John Dickerson

It has been shown that neural networks can be used to approximate optimal mechanisms while satisfying the constraints that an auction be strategyproof and individually rational.

Cannot find the paper you are looking for? You can Submit a new open access paper.