Search Results for author: Daniel Zhaohan Guo

Found 1 papers, 0 papers with code

Understanding the performance gap between online and offline alignment algorithms

no code implementations • 14 May 2024 • Yunhao Tang, Daniel Zhaohan Guo, Zeyu Zheng, Daniele Calandriello, Yuan Cao, Eugene Tarassov, Rémi Munos, Bernardo Ávila Pires, Michal Valko, Yong Cheng, Will Dabney

However, rising popularity in offline alignment algorithms challenge the need for on-policy sampling in RLHF.

Language Modelling Large Language Model

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.