Search Results for author: Hanxu Hou

Found 2 papers, 0 papers with code

Learning From Correctness Without Prompting Makes LLM Efficient Reasoner

no code implementations28 Mar 2024 Yuxuan Yao, Han Wu, Zhijiang Guo, Biyan Zhou, Jiahui Gao, Sichun Luo, Hanxu Hou, Xiaojin Fu, Linqi Song

Large language models (LLMs) have demonstrated outstanding performance across various tasks, yet they still exhibit limitations such as hallucination, unfaithful reasoning, and toxic content.

Hallucination

Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning

no code implementations2 Feb 2024 Guangfeng Yan, Tan Li, Yuanzhang Xiao, Hanxu Hou, Linqi Song

We consider a general family of heavy-tail gradients that follow a power-law distribution, we aim to minimize the error resulting from quantization, thereby determining optimal values for two critical parameters: the truncation threshold and the quantization density.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.