Search Results for author: Hanxu Hou

Found 2 papers, 0 papers with code

Learning From Correctness Without Prompting Makes LLM Efficient Reasoner

no code implementations • 28 Mar 2024 • Yuxuan Yao, Han Wu, Zhijiang Guo, Biyan Zhou, Jiahui Gao, Sichun Luo, Hanxu Hou, Xiaojin Fu, Linqi Song

Large language models (LLMs) have demonstrated outstanding performance across various tasks, yet they still exhibit limitations such as hallucination, unfaithful reasoning, and toxic content.

Hallucination

Paper
Add Code

Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning

no code implementations • 2 Feb 2024 • Guangfeng Yan, Tan Li, Yuanzhang Xiao, Hanxu Hou, Linqi Song

We consider a general family of heavy-tail gradients that follow a power-law distribution, we aim to minimize the error resulting from quantization, thereby determining optimal values for two critical parameters: the truncation threshold and the quantization density.

Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.