TY - GEN
T1 - Enhanced 3D Semantic Segmentation via Local Polar Encoding and Attention Fusion
AU - Wei, Yujia
AU - Li, Yushi
AU - Wang, Yunzhe
AU - Ji, Chengtao
AU - Jin, Xiaobo
AU - Xu, Chao
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - As a fundamental task in fields such as remote sensing, autonomous vehicles, augmented reality, and robotic navigation, Point-cloud semantic segmentation is critical for interpreting 3D environments, as it involves classifying point cloud data to assign semantic labels to points within a 3D scene. Leveraging spatial positions and other attributes, 3D semantic segmentation enables accurate object representation and categorization. While deep learning has advanced feature extraction from point clouds, developing efficient and accurate segmentation networks remains challenging. To address this, we propose an encoder-decoder framework that integrates local polar embedding and attention fusion. The method enhances local geometric feature extraction and reduces the semantic gap between encoded and decoded features. For enhancing the network's ability to segment complex-shaped point clouds, we first employ polar encoding and offset updating to redefine neighborhood coordinates. Then, a hybrid pooling module is introduced to improve local feature sensing. Finally, we integrate attention feature fusion between encoding and decoding layers to minimize semantic discrepancies and optimize feature mapping. Both qualitative and quantitative experiments demonstrate the effectiveness of our approach, showcasing its competitiveness with state-of-the-art methods in 3D semantic segmentation.
AB - As a fundamental task in fields such as remote sensing, autonomous vehicles, augmented reality, and robotic navigation, Point-cloud semantic segmentation is critical for interpreting 3D environments, as it involves classifying point cloud data to assign semantic labels to points within a 3D scene. Leveraging spatial positions and other attributes, 3D semantic segmentation enables accurate object representation and categorization. While deep learning has advanced feature extraction from point clouds, developing efficient and accurate segmentation networks remains challenging. To address this, we propose an encoder-decoder framework that integrates local polar embedding and attention fusion. The method enhances local geometric feature extraction and reduces the semantic gap between encoded and decoded features. For enhancing the network's ability to segment complex-shaped point clouds, we first employ polar encoding and offset updating to redefine neighborhood coordinates. Then, a hybrid pooling module is introduced to improve local feature sensing. Finally, we integrate attention feature fusion between encoding and decoding layers to minimize semantic discrepancies and optimize feature mapping. Both qualitative and quantitative experiments demonstrate the effectiveness of our approach, showcasing its competitiveness with state-of-the-art methods in 3D semantic segmentation.
KW - 3D semantic segmentation
KW - autoencoder
KW - point cloud learning
UR - https://www.scopus.com/pages/publications/105012128752
U2 - 10.1109/ICAISISAS64483.2025.11051540
DO - 10.1109/ICAISISAS64483.2025.11051540
M3 - Conference Proceeding
AN - SCOPUS:105012128752
T3 - 2025 Joint International Conference on Automation-Intelligence-Safety, ICAIS 2025 and International Symposium on Autonomous Systems, ISAS 2025
BT - 2025 Joint International Conference on Automation-Intelligence-Safety, ICAIS 2025 and International Symposium on Autonomous Systems, ISAS 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 Joint International Conference on Automation-Intelligence-Safety, ICAIS 2025 and International Symposium on Autonomous Systems, ISAS 2025
Y2 - 23 May 2025 through 25 May 2025
ER -