Search Results for author: Shixiong Zhao

Found 2 papers, 0 papers with code

XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference

no code implementations28 May 2024 Shengnan Wang, Youhui Bai, Lin Zhang, Pingyi Zhou, Shixiong Zhao, Gong Zhang, Sen Wang, Renhai Chen, Hua Xu, Hongwei Sun

Under the XL3M framework, the input context will be firstly decomposed into multiple short sub-contexts, where each sub-context contains an independent segment and a common ``question'' which is a few tokens from the end of the original context.

Language Modelling Large Language Model

Neural Architecture Search via Ensemble-based Knowledge Distillation

no code implementations29 Sep 2021 Fanxin Li, Shixiong Zhao, Haowen Pi, Yuhao QING, Yichao Fu, Sen Wang, Heming Cui

Neural Architecture Search (NAS) automatically searches for well-performed network architectures from a given search space.

Knowledge Distillation Neural Architecture Search

Cannot find the paper you are looking for? You can Submit a new open access paper.