TY - GEN
T1 - A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data
AU - Li, Jianhua
AU - Xu, Yuanping
AU - Zhang, Chaolong
AU - Kong, Chao
AU - Jin, Jin
AU - Wang, Weiye
AU - Xu, Zhijie
AU - Guo, Benjun
AU - Tang, Dan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In recent years, transformer-based models have made significant breakthroughs in natural language processing and computer vision. However, these models have encountered challenges when dealing with point cloud data because the irregular and disordered structure of point cloud data leads to a huge computational and memory burden. To address this problem, this paper proposes a Multidimensional Convolution-Dual Channel Transformer Network for efficient processing of point cloud data. The MCDTN framework consists of two branches: the main channel enhances the modeling of cross-channel features through dynamic attention to optimize feature representation; the auxiliary channel further improves fine-grained segmentation capabilities through encoders, multi-scale information interaction, and spatial attention. Experimental results show that MCDTN performs excellently in shape classification and part segmentation tasks, effectively reducing computational costs.
AB - In recent years, transformer-based models have made significant breakthroughs in natural language processing and computer vision. However, these models have encountered challenges when dealing with point cloud data because the irregular and disordered structure of point cloud data leads to a huge computational and memory burden. To address this problem, this paper proposes a Multidimensional Convolution-Dual Channel Transformer Network for efficient processing of point cloud data. The MCDTN framework consists of two branches: the main channel enhances the modeling of cross-channel features through dynamic attention to optimize feature representation; the auxiliary channel further improves fine-grained segmentation capabilities through encoders, multi-scale information interaction, and spatial attention. Experimental results show that MCDTN performs excellently in shape classification and part segmentation tasks, effectively reducing computational costs.
KW - deep learning
KW - point cloud classification
KW - selfattention mechanism
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=105002236838&partnerID=8YFLogxK
U2 - 10.1109/AIIM64537.2024.10934656
DO - 10.1109/AIIM64537.2024.10934656
M3 - Conference Proceeding
AN - SCOPUS:105002236838
T3 - 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024
SP - 100
EP - 104
BT - 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024
Y2 - 20 December 2024 through 22 December 2024
ER -