Search Results for author: Zhichen Dong

Found 3 papers, 3 papers with code

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models

1 code implementation29 May 2024 Zhanhui Zhou, Zhixuan Liu, Jie Liu, Zhichen Dong, Chao Yang, Yu Qiao

In this work, we introduce $\textit{weak-to-strong search}$, framing the alignment of a large language model as a test-time greedy search to maximize the log-likelihood difference between small tuned and untuned models while sampling from the frozen large model.

Instruction Following Language Modelling +1

Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey

1 code implementation14 Feb 2024 Zhichen Dong, Zhanhui Zhou, Chao Yang, Jing Shao, Yu Qiao

Large Language Models (LLMs) are now commonplace in conversation applications.

Cannot find the paper you are looking for? You can Submit a new open access paper.