Search Results for author: Zouying Cao

Found 3 papers, 2 papers with code

Head-wise Shareable Attention for Large Language Models

1 code implementation • 19 Feb 2024 • Zouying Cao, Yifei Yang, Hai Zhao

In this paper, we present a perspective on $\textit{$\textbf{head-wise shareable attention for large language models}$}$.

22,381

Paper
Code

LaCo: Large Language Model Pruning via Layer Collapse

1 code implementation • 17 Feb 2024 • Yifei Yang, Zouying Cao, Hai Zhao

Large language models (LLMs) based on transformer are witnessing a notable trend of size expansion, which brings considerable costs to both model training and inference.

Knowledge Distillation Language Modelling +2

22,381

Paper
Code

AutoHall: Automated Hallucination Dataset Generation for Large Language Models

no code implementations • 30 Sep 2023 • Zouying Cao, Yifei Yang, Hai Zhao

While Large language models (LLMs) have garnered widespread applications across various domains due to their powerful language understanding and generation capabilities, the detection of non-factual or hallucinatory content generated by LLMs remains scarce.

Fact Checking Hallucination

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.