Search Results for author: Jacob Mitchell Springer

Found 2 papers, 2 papers with code

Repetition Improves Language Model Embeddings

2 code implementations23 Feb 2024 Jacob Mitchell Springer, Suhas Kotha, Daniel Fried, Graham Neubig, aditi raghunathan

In this work, we address an architectural limitation of autoregressive models: token embeddings cannot contain information from tokens that appear later in the input.

Language Modelling

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

1 code implementation18 Sep 2023 Suhas Kotha, Jacob Mitchell Springer, aditi raghunathan

We lack a systematic understanding of the effects of fine-tuning (via methods such as instruction-tuning or reinforcement learning from human feedback), particularly on tasks outside the narrow fine-tuning distribution.

In-Context Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.