Search Results for author: Luxi Lin

Found 1 papers, 1 papers with code

Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference

1 code implementation9 May 2024 Zhihang Lin, Mingbao Lin, Luxi Lin, Rongrong Ji

Our approach is inspired by two intriguing phenomena we have observed: (1) the attention sink phenomenon that is prevalent in LLMs also persists in MLLMs, suggesting that initial tokens and nearest tokens receive the majority of attention, while middle vision tokens garner minimal attention in deep layers; (2) the presence of information migration, which implies that visual information is transferred to subsequent text tokens within the first few layers of MLLMs.

Cannot find the paper you are looking for? You can Submit a new open access paper.