Search Results for author: Zouying Cao

Found 3 papers, 2 papers with code

Head-wise Shareable Attention for Large Language Models

1 code implementation19 Feb 2024 Zouying Cao, Yifei Yang, Hai Zhao

In this paper, we present a perspective on $\textit{$\textbf{head-wise shareable attention for large language models}$}$.

LaCo: Large Language Model Pruning via Layer Collapse

1 code implementation17 Feb 2024 Yifei Yang, Zouying Cao, Hai Zhao

Large language models (LLMs) based on transformer are witnessing a notable trend of size expansion, which brings considerable costs to both model training and inference.

Knowledge Distillation Language Modelling +2

AutoHall: Automated Hallucination Dataset Generation for Large Language Models

no code implementations30 Sep 2023 Zouying Cao, Yifei Yang, Hai Zhao

While Large language models (LLMs) have garnered widespread applications across various domains due to their powerful language understanding and generation capabilities, the detection of non-factual or hallucinatory content generated by LLMs remains scarce.

Fact Checking Hallucination

Cannot find the paper you are looking for? You can Submit a new open access paper.