1 code implementation • 30 Apr 2024 • Yue Li, Baiqiao Yin, Jinfu Liu, Jiajun Wen, Jiaying Lin, Mengyuan Liu
In recent years, Event Sound Source Localization has been widely applied in various fields.
1 code implementation • 29 Apr 2024 • Junhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang
To address this issue, we introduce TheaterGen, a training-free framework that integrates large language models (LLMs) and text-to-image (T2I) models to provide the capability of multi-turn image generation.
1 code implementation • 25 Apr 2024 • Jiaying Lin, Jiajun Wen, Mengyuan Liu, Jinfu Liu, Baiqiao Yin, Yue Li
The task of spatiotemporal action localization in chaotic scenes is a challenging task toward advanced video understanding.
1 code implementation • 24 Apr 2024 • Jinfu Liu, Baiqiao Yin, Jiaying Lin, Jiajun Wen, Yue Li, Mengyuan Liu
Skeleton-based action recognition has gained considerable traction thanks to its utilization of succinct and robust skeletal representations.
Ranked #1 on Skeleton Based Action Recognition on UAV-Human
no code implementations • 25 Aug 2023 • Chao Wu, Peng Chen, Baiqiao Yin, Zijuan Lin, Chen Jiang, Di Yu, Changhong Zou, Chunwang Lui
Social networks exhibit a complex graph-like structure due to the uncertainty surrounding potential collaborations among participants.