Search Results for author: Sedrick Keh

Found 4 papers, 3 papers with code

Linearizing Large Language Models

1 code implementation10 May 2024 Jean Mercat, Igor Vasiljevic, Sedrick Keh, Kushal Arora, Achal Dave, Adrien Gaidon, Thomas Kollar

Linear transformers have emerged as a subquadratic-time alternative to softmax attention and have garnered significant interest due to their fixed-size recurrent state that lowers inference cost.

In-Context Learning

A Critical Evaluation of AI Feedback for Aligning Large Language Models

1 code implementation19 Feb 2024 Archit Sharma, Sedrick Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar

RLAIF first performs supervised fine-tuning (SFT) using demonstrations from a teacher model and then further fine-tunes the model with reinforcement learning (RL), using feedback from a critic model.

Instruction Following reinforcement-learning +1

Asking More Informative Questions for Grounded Retrieval

no code implementations14 Nov 2023 Sedrick Keh, Justin T. Chiu, Daniel Fried

When a model is trying to gather information in an interactive setting, it benefits from asking informative questions.

Question Answering Question Selection +2

Cannot find the paper you are looking for? You can Submit a new open access paper.