A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data

Jianhua Li; Yuanping Xu; Chaolong Zhang; Chao Kong; Jin Jin; Weiye Wang; Zhijie Xu; Benjun Guo; Dan Tang

doi:10.1109/AIIM64537.2024.10934656

A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data

Jianhua Li, Yuanping Xu^*, Chaolong Zhang, Chao Kong, Jin Jin, Weiye Wang, Zhijie Xu, Benjun Guo, Dan Tang

^*Corresponding author for this work

Department of Computing

Chengdu University of Information Technology

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

In recent years, transformer-based models have made significant breakthroughs in natural language processing and computer vision. However, these models have encountered challenges when dealing with point cloud data because the irregular and disordered structure of point cloud data leads to a huge computational and memory burden. To address this problem, this paper proposes a Multidimensional Convolution-Dual Channel Transformer Network for efficient processing of point cloud data. The MCDTN framework consists of two branches: the main channel enhances the modeling of cross-channel features through dynamic attention to optimize feature representation; the auxiliary channel further improves fine-grained segmentation capabilities through encoders, multi-scale information interaction, and spatial attention. Experimental results show that MCDTN performs excellently in shape classification and part segmentation tasks, effectively reducing computational costs.

Original language	English
Title of host publication	2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	100-104
Number of pages	5
ISBN (Electronic)	9798331541729
DOIs	https://doi.org/10.1109/AIIM64537.2024.10934656
Publication status	Published - 2024
Event	4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024 - Chengdu, China Duration: 20 Dec 2024 → 22 Dec 2024

Publication series

Name	2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024

Conference

Conference	4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024
Country/Territory	China
City	Chengdu
Period	20/12/24 → 22/12/24

Keywords

deep learning
point cloud classification
selfattention mechanism
Transformer

Access to Document

10.1109/AIIM64537.2024.10934656

Cite this

Li, J., Xu, Y., Zhang, C., Kong, C., Jin, J., Wang, W., Xu, Z., Guo, B., & Tang, D. (2024). A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data. In 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024 (pp. 100-104). (2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/AIIM64537.2024.10934656

Li, Jianhua ; Xu, Yuanping ; Zhang, Chaolong et al. / A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data. 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 100-104 (2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024).

@inproceedings{c165bfdbdf174f9280f9f3893a8aa1a6,

title = "A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data",

abstract = "In recent years, transformer-based models have made significant breakthroughs in natural language processing and computer vision. However, these models have encountered challenges when dealing with point cloud data because the irregular and disordered structure of point cloud data leads to a huge computational and memory burden. To address this problem, this paper proposes a Multidimensional Convolution-Dual Channel Transformer Network for efficient processing of point cloud data. The MCDTN framework consists of two branches: the main channel enhances the modeling of cross-channel features through dynamic attention to optimize feature representation; the auxiliary channel further improves fine-grained segmentation capabilities through encoders, multi-scale information interaction, and spatial attention. Experimental results show that MCDTN performs excellently in shape classification and part segmentation tasks, effectively reducing computational costs.",

keywords = "deep learning, point cloud classification, selfattention mechanism, Transformer",

author = "Jianhua Li and Yuanping Xu and Chaolong Zhang and Chao Kong and Jin Jin and Weiye Wang and Zhijie Xu and Benjun Guo and Dan Tang",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024 ; Conference date: 20-12-2024 Through 22-12-2024",

year = "2024",

doi = "10.1109/AIIM64537.2024.10934656",

language = "English",

series = "2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "100--104",

booktitle = "2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024",

}

Li, J, Xu, Y, Zhang, C, Kong, C, Jin, J, Wang, W, Xu, Z, Guo, B & Tang, D 2024, A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data. in 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024. 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024, Institute of Electrical and Electronics Engineers Inc., pp. 100-104, 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024, Chengdu, China, 20/12/24. https://doi.org/10.1109/AIIM64537.2024.10934656

A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data. / Li, Jianhua; Xu, Yuanping; Zhang, Chaolong et al.
2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 100-104 (2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data

AU - Li, Jianhua

AU - Xu, Yuanping

AU - Zhang, Chaolong

AU - Kong, Chao

AU - Jin, Jin

AU - Wang, Weiye

AU - Xu, Zhijie

AU - Guo, Benjun

AU - Tang, Dan

PY - 2024

Y1 - 2024

N2 - In recent years, transformer-based models have made significant breakthroughs in natural language processing and computer vision. However, these models have encountered challenges when dealing with point cloud data because the irregular and disordered structure of point cloud data leads to a huge computational and memory burden. To address this problem, this paper proposes a Multidimensional Convolution-Dual Channel Transformer Network for efficient processing of point cloud data. The MCDTN framework consists of two branches: the main channel enhances the modeling of cross-channel features through dynamic attention to optimize feature representation; the auxiliary channel further improves fine-grained segmentation capabilities through encoders, multi-scale information interaction, and spatial attention. Experimental results show that MCDTN performs excellently in shape classification and part segmentation tasks, effectively reducing computational costs.

AB - In recent years, transformer-based models have made significant breakthroughs in natural language processing and computer vision. However, these models have encountered challenges when dealing with point cloud data because the irregular and disordered structure of point cloud data leads to a huge computational and memory burden. To address this problem, this paper proposes a Multidimensional Convolution-Dual Channel Transformer Network for efficient processing of point cloud data. The MCDTN framework consists of two branches: the main channel enhances the modeling of cross-channel features through dynamic attention to optimize feature representation; the auxiliary channel further improves fine-grained segmentation capabilities through encoders, multi-scale information interaction, and spatial attention. Experimental results show that MCDTN performs excellently in shape classification and part segmentation tasks, effectively reducing computational costs.

KW - deep learning

KW - point cloud classification

KW - selfattention mechanism

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=105002236838&partnerID=8YFLogxK

U2 - 10.1109/AIIM64537.2024.10934656

DO - 10.1109/AIIM64537.2024.10934656

M3 - Conference Proceeding

AN - SCOPUS:105002236838

T3 - 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024

SP - 100

EP - 104

BT - 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024

Y2 - 20 December 2024 through 22 December 2024

ER -

Li J, Xu Y, Zhang C, Kong C, Jin J, Wang W et al. A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data. In 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 100-104. (2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2024). doi: 10.1109/AIIM64537.2024.10934656

A Lightweight Convolutional Dual-Channel Transformer Framework for Efficient Part Segmentation of Point Cloud Data

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this