Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations

Yuanzhi Cai; Lei Fan; Peter Atkinson; Cheng Zhang

doi:10.1109/TGRS.2022.3161982

Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations

Yuanzhi Cai, Lei Fan^*, Peter Atkinson, Cheng Zhang

^*Corresponding author for this work

Department of Civil Engineering

Research output: Contribution to journal › Article › peer-review

13 Citations (Scopus)

Abstract

Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination $IZ_{e}D_{e}$ (i.e., intensity, enhanced $Z$ -coordinate, and enhanced range images) outperformed the conventional $I$ RGB and $I$ RGB $D$ channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.

Original language	English
Article number	5702815
Journal	IEEE Transactions on Geoscience and Remote Sensing
Volume	60
DOIs	https://doi.org/10.1109/TGRS.2022.3161982
Publication status	Published - 24 Mar 2022

Keywords

Deep learning
point cloud
semantic segmentation
terrestrial laser scanning (TLS)
transfer learning

Access to Document

10.1109/TGRS.2022.3161982

Cite this

@article{4b76c22a968a4fca92624338aa397e06,

title = "Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations",

abstract = "Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination $IZ_{e}D_{e}$ (i.e., intensity, enhanced $Z$ -coordinate, and enhanced range images) outperformed the conventional $I$ RGB and $I$ RGB $D$ channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.",

keywords = "Deep learning, point cloud, semantic segmentation, terrestrial laser scanning (TLS), transfer learning",

author = "Yuanzhi Cai and Lei Fan and Peter Atkinson and Cheng Zhang",

note = "Funding Information: This work was supported in part by the XJTLU Key Program Special Fund under Grant KSF-E-40, in part by the XJTLU Research Development Fund under Grant RDF-18-01-40, and in part by the XJTLU Research Enhancement Funding under Grant REF-21-01-003 Publisher Copyright: {\textcopyright} 1980-2012 IEEE.",

year = "2022",

month = mar,

day = "24",

doi = "10.1109/TGRS.2022.3161982",

language = "English",

volume = "60",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

}

TY - JOUR

T1 - Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations

AU - Cai, Yuanzhi

AU - Fan, Lei

AU - Atkinson, Peter

AU - Zhang, Cheng

N1 - Funding Information: This work was supported in part by the XJTLU Key Program Special Fund under Grant KSF-E-40, in part by the XJTLU Research Development Fund under Grant RDF-18-01-40, and in part by the XJTLU Research Enhancement Funding under Grant REF-21-01-003 Publisher Copyright: © 1980-2012 IEEE.

PY - 2022/3/24

Y1 - 2022/3/24

N2 - Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination $IZ_{e}D_{e}$ (i.e., intensity, enhanced $Z$ -coordinate, and enhanced range images) outperformed the conventional $I$ RGB and $I$ RGB $D$ channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.

AB - Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination $IZ_{e}D_{e}$ (i.e., intensity, enhanced $Z$ -coordinate, and enhanced range images) outperformed the conventional $I$ RGB and $I$ RGB $D$ channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.

KW - Deep learning

KW - point cloud

KW - semantic segmentation

KW - terrestrial laser scanning (TLS)

KW - transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85128824373&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2022.3161982

DO - 10.1109/TGRS.2022.3161982

M3 - Article

AN - SCOPUS:85128824373

SN - 0196-2892

VL - 60

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

M1 - 5702815

ER -

Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this