Selecting optimal combination of data channels for semantic segmentation in city information modelling (CIM)

Yuanzhi Cai, Hong Huang, Kaiyang Wang, Cheng Zhang*, Lei Fan, Fangyu Guo

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

20 Citations (Scopus)

Abstract

Over the last decade, a 3D reconstruction technique has been developed to present the latest as-is information for various objects and build the city information models. Meanwhile, deep learning based approaches are employed to add semantic information to the models. Studies have proved that the accuracy of the model could be improved by combining multiple data channels (e.g., XYZ, Intensity, D, and RGB). Nevertheless, the redundant data channels in large-scale datasets may cause high computation cost and time during data processing. Few researchers have addressed the question of which combination of channels is optimal in terms of overall accuracy (OA) and mean intersection over union (mIoU). Therefore, a framework is proposed to explore an efficient data fusion approach for semantic segmentation by selecting an optimal combination of data channels. In the framework, a total of 13 channel combinations are investigated to pre-process data and the encoder-to-decoder structure is utilized for network permutations. A case study is carried out to investigate the efficiency of the proposed approach by adopting a city-level benchmark dataset and applying nine networks. It is found that the combination of IRGB channels provide the best OA performance, while IRGBD channels provide the best mIoU performance.

Original languageEnglish
Article number1367
JournalRemote Sensing
Volume13
Issue number7
DOIs
Publication statusPublished - 1 Apr 2021

Keywords

  • 3D reconstruction
  • City information modelling
  • Data channels
  • Data fusion
  • Point cloud
  • Semantic segmentation

Cite this