no code implementations • 2 Mar 2024 • Xinyi Yu, Ling Yan, PengTao Jiang, Hao Chen, Bo Li, Lin Yuanbo Wu, Linlin Ou
This innovative approach empowers the network to simultaneously predict masks and depth, enhancing its ability to capture nuanced depth-related information during the instance segmentation process.
no code implementations • 18 Oct 2023 • Weian Mao, Muzhi Zhu, Zheng Sun, Shuaike Shen, Lin Yuanbo Wu, Hao Chen, Chunhua Shen
Most prior encoders rely on atom-wise features, such as angles and distances between atoms, which are not available in this context.
1 code implementation • ICCV 2023 • Kaining Ying, Qing Zhong, Weian Mao, Zhenhua Wang, Hao Chen, Lin Yuanbo Wu, Yifan Liu, Chengxiang Fan, Yunzhi Zhuge, Chunhua Shen
The discrimination of instance embeddings plays a vital role in associating instances across time for online video instance segmentation (VIS).
Ranked #2 on Video Instance Segmentation on Youtube-VIS 2022 Validation (using extra training data)
1 code implementation • 26 Nov 2022 • Zhong Ji, Junhua Hu, Deyin Liu, Lin Yuanbo Wu, Ye Zhao
To implement this task, one needs to extract multi-scale features from both image and text domains, and then perform the cross-modal alignment.
1 code implementation • 18 Aug 2022 • Deyin Liu, Lin Yuanbo Wu, Bo Li, ZongYuan Ge
Our architecture is orthogonal to StackGAN++ , and focuses on person image generation, with all of them together to enrich the spectrum of GANs for the image generation task.