Search Results for author: Shaoyang Xu

Found 3 papers, 1 papers with code

ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation

no code implementations22 May 2024 Weilong Dong, Xinwei Wu, Renren Jin, Shaoyang Xu, Deyi Xiong

From the perspective of representation engineering, ConTrans refines concept vectors in value alignment from a source LLM (usually a weak yet aligned LLM).

Exploring Multilingual Concepts of Human Value in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

1 code implementation28 Feb 2024 Shaoyang Xu, Weilong Dong, Zishan Guo, Xinwei Wu, Deyi Xiong

Drawing from our findings on multilingual value alignment, we prudently provide suggestions on the composition of multilingual data for LLMs pre-training: including a limited number of dominant languages for cross-lingual alignment transfer while avoiding their excessive prevalence, and keeping a balanced distribution of non-dominant languages.

Cross-Lingual Transfer Philosophy

Cannot find the paper you are looking for? You can Submit a new open access paper.