TY - JOUR
T1 - Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations
AU - Cai, Yuanzhi
AU - Fan, Lei
AU - Atkinson, Peter
AU - Zhang, Cheng
N1 - Funding Information:
This work was supported in part by the XJTLU Key Program Special Fund under Grant KSF-E-40, in part by the XJTLU Research Development Fund under Grant RDF-18-01-40, and in part by the XJTLU Research Enhancement Funding under Grant REF-21-01-003
Publisher Copyright:
© 1980-2012 IEEE.
PY - 2022/3/24
Y1 - 2022/3/24
N2 - Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination $IZ_{e}D_{e}$ (i.e., intensity, enhanced $Z$ -coordinate, and enhanced range images) outperformed the conventional $I$ RGB and $I$ RGB $D$ channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.
AB - Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination $IZ_{e}D_{e}$ (i.e., intensity, enhanced $Z$ -coordinate, and enhanced range images) outperformed the conventional $I$ RGB and $I$ RGB $D$ channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.
KW - Deep learning
KW - point cloud
KW - semantic segmentation
KW - terrestrial laser scanning (TLS)
KW - transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85128824373&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2022.3161982
DO - 10.1109/TGRS.2022.3161982
M3 - Article
AN - SCOPUS:85128824373
SN - 0196-2892
VL - 60
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5702815
ER -