TY - JOUR
T1 - When Visual Disparity Generation Meets Semantic Segmentation
T2 - A Mutual Encouragement Approach
AU - Zhang, Xiaohong
AU - Chen, Yi
AU - Zhang, Haofeng
AU - Wang, Shuihua
AU - Lu, Jianfeng
AU - Yang, Jingyu
N1 - Publisher Copyright:
© 2000-2011 IEEE.
PY - 2021/3
Y1 - 2021/3
N2 - Semantic segmentation and depth estimation play important roles in the field of autonomous driving. In recent years, the advantages of Convolutional Neural Networks (CNNs) have allowed these two topics to flourish. However, people always solve these two tasks separately and rarely solve them in a united model. In this paper, we propose a Mutual Encouragement Network (MENet), which includes a semantic segmentation branch and a disparity regression branch, and simultaneously generates semantic map and visual disparity. In the cost volume construction phase, the depth information is embedded in the semantic segmentation branch to increase contextual understanding. Similarly, the semantic information is also included in the disparity regression branch to generate more accurate disparity. Two branches mutually promote each other during training phase and inference phase. We conducted our method on the popular dataset KITTI, and the experimental results show that our method can outperform the state-of-the-art methods on both visual disparity generation and semantic segmentation. In addition, extensive ablation studies also demonstrate that the two tasks in our method can facilitate each other significantly with the proposed approach.
AB - Semantic segmentation and depth estimation play important roles in the field of autonomous driving. In recent years, the advantages of Convolutional Neural Networks (CNNs) have allowed these two topics to flourish. However, people always solve these two tasks separately and rarely solve them in a united model. In this paper, we propose a Mutual Encouragement Network (MENet), which includes a semantic segmentation branch and a disparity regression branch, and simultaneously generates semantic map and visual disparity. In the cost volume construction phase, the depth information is embedded in the semantic segmentation branch to increase contextual understanding. Similarly, the semantic information is also included in the disparity regression branch to generate more accurate disparity. Two branches mutually promote each other during training phase and inference phase. We conducted our method on the popular dataset KITTI, and the experimental results show that our method can outperform the state-of-the-art methods on both visual disparity generation and semantic segmentation. In addition, extensive ablation studies also demonstrate that the two tasks in our method can facilitate each other significantly with the proposed approach.
KW - Scene parsing
KW - mutual encouragement network (MENet)
KW - semantic segmentation
KW - stereo matching
UR - http://www.scopus.com/inward/record.url?scp=85102440932&partnerID=8YFLogxK
U2 - 10.1109/TITS.2020.3027556
DO - 10.1109/TITS.2020.3027556
M3 - Article
AN - SCOPUS:85102440932
SN - 1524-9050
VL - 22
SP - 1853
EP - 1867
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 3
M1 - 9244074
ER -