ACNet: Attention Based Network to Exploit Complementary Features for RGBD Semantic Segmentation

24 May 2019  ·  Xinxin Hu, Kailun Yang, Lei Fei, Kaiwei Wang ·

Compared to RGB semantic segmentation, RGBD semantic segmentation can achieve better performance by taking depth information into consideration. However, it is still problematic for contemporary segmenters to effectively exploit RGBD information since the feature distributions of RGB and depth (D) images vary significantly in different scenes. In this paper, we propose an Attention Complementary Network (ACNet) that selectively gathers features from RGB and depth branches. The main contributions lie in the Attention Complementary Module (ACM) and the architecture with three parallel branches. More precisely, ACM is a channel attention-based module that extracts weighted features from RGB and depth branches. The architecture preserves the inference of the original RGB and depth branches, and enables the fusion branch at the same time. Based on the above structures, ACNet is capable of exploiting more high-quality features from different channels. We evaluate our model on SUN-RGBD and NYUDv2 datasets, and prove that our model outperforms state-of-the-art methods. In particular, a mIoU score of 48.3\% on NYUDv2 test set is achieved with ResNet50. We will release our source code based on PyTorch and the trained segmentation model at https://github.com/anheidelonghu/ACNet.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Semantic Segmentation KITTI-360 ACNet (ResNet50) mIoU 61.57 # 4
Thermal Image Segmentation MFN Dataset ACNet mIOU 46.3 # 38
Semantic Segmentation NYU Depth v2 ACNet Mean IoU 48.3% # 66
Thermal Image Segmentation PST900 ACNet mIoU 71.81 # 13
Semantic Segmentation SUN-RGBD CMX (B4) Mean IoU 48.1% # 25

Methods


No methods listed for this paper. Add relevant methods here