1 code implementation • 2 May 2024 • Xiaoqi Zhao, Youwei Pang, Wei Ji, Baicheng Sheng, Jiaming Zuo, Lihe Zhang, Huchuan Lu
Different from the context-independent (CI) concepts such as human, car, and airplane, context-dependent (CD) concepts require higher visual understanding ability, such as camouflaged object and medical lesion.
2 code implementations • 11 Apr 2024 • Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu
Dichotomous Image Segmentation (DIS) has recently emerged towards high-precision object segmentation from high-resolution natural images.
Ranked #1 on Dichotomous Image Segmentation on DIS-VD
1 code implementation • 5 Dec 2023 • Xiaoqi Zhao, Youwei Pang, Zhenyu Chen, Qian Yu, Lihe Zhang, Hanqi Liu, Jiaming Zuo, Huchuan Lu
We conduct a comprehensive study on a new task named power battery detection (PBD), which aims to localize the dense cathode and anode plates endpoints from X-ray images to evaluate the quality of power batteries.
no code implementations • 19 Nov 2023 • Youwei Pang, Xiaoqi Zhao, Jiaming Zuo, Lihe Zhang, Huchuan Lu
With the proposed dataset and baseline, we hope that this new task with more practical value can further expand the research on open-vocabulary dense prediction tasks.
1 code implementation • 31 Oct 2023 • Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, Huchuan Lu
Apart from the high intrinsic similarity between camouflaged objects and their background, objects are usually diverse in scale, fuzzy in appearance, and even severely occluded.
Ranked #1 on Camouflaged Object Segmentation on Camouflaged Animal Dataset (using extra training data)
1 code implementation • 23 Jul 2023 • Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu
Specifically, unlike existing methods that over-specialize in a single task or a subset of tasks, ComPtr starts from the more general concept of bi-source dense prediction.
Ranked #14 on Semantic Segmentation on NYU Depth v2
2 code implementations • 20 Mar 2023 • Xiaoqi Zhao, Hongpeng Jia, Youwei Pang, Long Lv, Feng Tian, Lihe Zhang, Weibing Sun, Huchuan Lu
Next, we expand the single-scale SU to the intra-layer multi-scale SU, which can provide the decoder with both pixel-level and structure-level difference information.
1 code implementation • 18 Mar 2023 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Lei Zhang
They ignore two key problems when the encoder exchanges information with the decoder: one is the lack of interference control mechanism between them, the other is without considering the disparity of the contributions from different encoder levels.
1 code implementation • 18 Mar 2023 • Xiaoqi Zhao, Shijie Chang, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu
In the static object predictor, the RGB source is converted to depth and static saliency sources, simultaneously.
1 code implementation • ICCV 2023 • Jiayu Sun, Ke Xu, Youwei Pang, Lihe Zhang, Huchuan Lu, Gerhard Hancke, Rynson Lau
In this paper, we propose a novel method to detect shadows from raw images.
no code implementations • 3 Oct 2022 • Hongsheng Wang, Xiaoqi Zhao, Youwei Pang, Jinqing Qi
In this research, we propose a rich prototype generation module (RPGM) and a recurrent prediction enhancement module (RPEM) to reinforce the prototype learning paradigm and build a unified memory-augmented decoder for few-shot segmentation, respectively.
1 code implementation • 9 Mar 2022 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu
In this paper, we propose a novel multi-task and multi-modal filtered transformer (MMFT) network for RGB-D salient object detection (SOD).
1 code implementation • 4 Dec 2021 • Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu
Most of the existing bi-modal (RGB-D and RGB-T) salient object detection methods utilize the convolution operation and construct complex interweave fusion structures to achieve cross-modal information integration.
1 code implementation • 11 Aug 2021 • Xiaoqi Zhao, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu
In this paper, we propose a novel multi-source fusion network for zero-shot video object segmentation.
Ranked #1 on Video Object Segmentation on FBMS (Jaccard (Mean) metric)
1 code implementation • 29 Jan 2021 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Xiang Ruan
Existing CNNs-Based RGB-D salient object detection (SOD) networks are all required to be pretrained on the ImageNet to learn the hierarchy features which helps provide a good initialization.
1 code implementation • CVPR 2020 • Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu
To obtain more efficient multi-scale features from the integrated features, the self-interaction modules are embedded in each decoder unit.
3 code implementations • ECCV 2020 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Lei Zhang
With the help of multilevel gate units, the valuable context information from the encoder can be optimally transmitted to the decoder.
Ranked #15 on Dichotomous Image Segmentation on DIS-TE4
1 code implementation • ECCV 2020 • Xiaoqi Zhao, Lihe Zhang, Youwei Pang, Huchuan Lu, Lei Zhang
In this work, we design a single stream network to directly use the depth map to guide early fusion and middle fusion between RGB and depth, which saves the feature encoder of the depth stream and achieves a lightweight and real-time model.
Ranked #15 on Thermal Image Segmentation on RGB-T-Glass-Segmentation
1 code implementation • ECCV 2020 • Youwei Pang, Lihe Zhang, Xiaoqi Zhao, Huchuan Lu
The main purpose of RGB-D salient object detection (SOD) is how to better integrate and utilize cross-modal fusion information.
Ranked #5 on RGB-D Salient Object Detection on NJU2K