no code implementations • 27 Apr 2024 • Wenzhen Yue, Xianghua Ying, Ruohao Guo, Dongdong Chen, Ji Shi, Bowei Xing, Yuqing Zhu, Taiyan Chen
By focusing the attention on the sub-adjacent areas, we make the reconstruction of anomalies more challenging, thereby enhancing their detectability.
no code implementations • 28 Oct 2023 • Ruohao Guo, Yaru Chen, Yanyu Qi, Wenzhen Yue, Dantong Niu, Xianghua Ying
In this paper, we propose a new multi-modal task, namely audio-visual instance segmentation (AVIS), in which the goal is to identify, segment, and track individual sounding object instances in audible videos, simultaneously.