When Visual Disparity Generation Meets Semantic Segmentation: A Mutual Encouragement Approach

Xiaohong Zhang; Yi Chen; Haofeng Zhang; Shuihua Wang; Jianfeng Lu; Jingyu Yang

doi:10.1109/TITS.2020.3027556

When Visual Disparity Generation Meets Semantic Segmentation: A Mutual Encouragement Approach

Xiaohong Zhang, Yi Chen, Haofeng Zhang^*, Shuihua Wang^*, Jianfeng Lu, Jingyu Yang

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

13 Citations (Scopus)

Abstract

Semantic segmentation and depth estimation play important roles in the field of autonomous driving. In recent years, the advantages of Convolutional Neural Networks (CNNs) have allowed these two topics to flourish. However, people always solve these two tasks separately and rarely solve them in a united model. In this paper, we propose a Mutual Encouragement Network (MENet), which includes a semantic segmentation branch and a disparity regression branch, and simultaneously generates semantic map and visual disparity. In the cost volume construction phase, the depth information is embedded in the semantic segmentation branch to increase contextual understanding. Similarly, the semantic information is also included in the disparity regression branch to generate more accurate disparity. Two branches mutually promote each other during training phase and inference phase. We conducted our method on the popular dataset KITTI, and the experimental results show that our method can outperform the state-of-the-art methods on both visual disparity generation and semantic segmentation. In addition, extensive ablation studies also demonstrate that the two tasks in our method can facilitate each other significantly with the proposed approach.

Original language	English
Article number	9244074
Pages (from-to)	1853-1867
Number of pages	15
Journal	IEEE Transactions on Intelligent Transportation Systems
Volume	22
Issue number	3
DOIs	https://doi.org/10.1109/TITS.2020.3027556
Publication status	Published - Mar 2021
Externally published	Yes

Keywords

Scene parsing
mutual encouragement network (MENet)
semantic segmentation
stereo matching

Access to Document

10.1109/TITS.2020.3027556

Cite this

@article{597f31b0e02d4fa8975749710714c490,

title = "When Visual Disparity Generation Meets Semantic Segmentation: A Mutual Encouragement Approach",

abstract = "Semantic segmentation and depth estimation play important roles in the field of autonomous driving. In recent years, the advantages of Convolutional Neural Networks (CNNs) have allowed these two topics to flourish. However, people always solve these two tasks separately and rarely solve them in a united model. In this paper, we propose a Mutual Encouragement Network (MENet), which includes a semantic segmentation branch and a disparity regression branch, and simultaneously generates semantic map and visual disparity. In the cost volume construction phase, the depth information is embedded in the semantic segmentation branch to increase contextual understanding. Similarly, the semantic information is also included in the disparity regression branch to generate more accurate disparity. Two branches mutually promote each other during training phase and inference phase. We conducted our method on the popular dataset KITTI, and the experimental results show that our method can outperform the state-of-the-art methods on both visual disparity generation and semantic segmentation. In addition, extensive ablation studies also demonstrate that the two tasks in our method can facilitate each other significantly with the proposed approach.",

keywords = "Scene parsing, mutual encouragement network (MENet), semantic segmentation, stereo matching",

author = "Xiaohong Zhang and Yi Chen and Haofeng Zhang and Shuihua Wang and Jianfeng Lu and Jingyu Yang",

note = "Publisher Copyright: {\textcopyright} 2000-2011 IEEE.",

year = "2021",

month = mar,

doi = "10.1109/TITS.2020.3027556",

language = "English",

volume = "22",

pages = "1853--1867",

journal = "IEEE Transactions on Intelligent Transportation Systems",

issn = "1524-9050",

number = "3",

}

TY - JOUR

T1 - When Visual Disparity Generation Meets Semantic Segmentation

T2 - A Mutual Encouragement Approach

AU - Zhang, Xiaohong

AU - Chen, Yi

AU - Zhang, Haofeng

AU - Wang, Shuihua

AU - Lu, Jianfeng

AU - Yang, Jingyu

PY - 2021/3

Y1 - 2021/3

N2 - Semantic segmentation and depth estimation play important roles in the field of autonomous driving. In recent years, the advantages of Convolutional Neural Networks (CNNs) have allowed these two topics to flourish. However, people always solve these two tasks separately and rarely solve them in a united model. In this paper, we propose a Mutual Encouragement Network (MENet), which includes a semantic segmentation branch and a disparity regression branch, and simultaneously generates semantic map and visual disparity. In the cost volume construction phase, the depth information is embedded in the semantic segmentation branch to increase contextual understanding. Similarly, the semantic information is also included in the disparity regression branch to generate more accurate disparity. Two branches mutually promote each other during training phase and inference phase. We conducted our method on the popular dataset KITTI, and the experimental results show that our method can outperform the state-of-the-art methods on both visual disparity generation and semantic segmentation. In addition, extensive ablation studies also demonstrate that the two tasks in our method can facilitate each other significantly with the proposed approach.

AB - Semantic segmentation and depth estimation play important roles in the field of autonomous driving. In recent years, the advantages of Convolutional Neural Networks (CNNs) have allowed these two topics to flourish. However, people always solve these two tasks separately and rarely solve them in a united model. In this paper, we propose a Mutual Encouragement Network (MENet), which includes a semantic segmentation branch and a disparity regression branch, and simultaneously generates semantic map and visual disparity. In the cost volume construction phase, the depth information is embedded in the semantic segmentation branch to increase contextual understanding. Similarly, the semantic information is also included in the disparity regression branch to generate more accurate disparity. Two branches mutually promote each other during training phase and inference phase. We conducted our method on the popular dataset KITTI, and the experimental results show that our method can outperform the state-of-the-art methods on both visual disparity generation and semantic segmentation. In addition, extensive ablation studies also demonstrate that the two tasks in our method can facilitate each other significantly with the proposed approach.

KW - Scene parsing

KW - mutual encouragement network (MENet)

KW - semantic segmentation

KW - stereo matching

UR - http://www.scopus.com/inward/record.url?scp=85102440932&partnerID=8YFLogxK

U2 - 10.1109/TITS.2020.3027556

DO - 10.1109/TITS.2020.3027556

M3 - Article

AN - SCOPUS:85102440932

SN - 1524-9050

VL - 22

SP - 1853

EP - 1867

JO - IEEE Transactions on Intelligent Transportation Systems

JF - IEEE Transactions on Intelligent Transportation Systems

IS - 3

M1 - 9244074

ER -

When Visual Disparity Generation Meets Semantic Segmentation: A Mutual Encouragement Approach

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this