Search Results for author: Yongcheng Zeng

Found 1 papers, 1 papers with code

Token-level Direct Preference Optimization

1 code implementation18 Apr 2024 Yongcheng Zeng, Guoqing Liu, Weiyu Ma, Ning Yang, Haifeng Zhang, Jun Wang

Fine-tuning pre-trained Large Language Models (LLMs) is essential to align them with human values and intentions.

Cannot find the paper you are looking for? You can Submit a new open access paper.