TY - GEN
T1 - Self-Supervised Learning for Point Clouds through Multi-crop Mutual Prediction
AU - Zeng, Changyu
AU - Wang, Wei
AU - Xiao, Jimin
AU - Nguyen, Anh
AU - Yue, Yutao
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - As the demand for high-level autonomous driving continues to grow, efficient processing of large amount of point cloud data from LiDAR sensors has become a severe issue. Despite its great success, it is extremely difficult, if not impossible, to apply supervised learning in many real-world applications with natural scenes lacking annotations. In addition, removing label dependency and improving algorithm generalization ability are the current key research challenges in 3D deep learning. We propose a crop-based self-supervised learning framework for point clouds called Multi-Crop Mutual Prediction (MCMP). By cropping the original point clouds into different number of multiscale patches and making patches from the same object as close as possible through dynamic updates of online prototypes, the encoder is well able to maintain consistency between both local and global features. MCMP framework is model-agnostic so it is convenient to integrate it into encoders of any architectures. We pre-train three commonly used point cloud feature extraction networks on a single object-level dataset MondelNet40 and evaluate MCMP on three downstream tasks, namely, object classification, partial segmentation and semantic segmentation. Experimental results show that MCMP outperforms the previous crop-based self-supervised approaches in the object classification and semantic segmentation tasks and produces comparable results with the supervised benchmark methods.
AB - As the demand for high-level autonomous driving continues to grow, efficient processing of large amount of point cloud data from LiDAR sensors has become a severe issue. Despite its great success, it is extremely difficult, if not impossible, to apply supervised learning in many real-world applications with natural scenes lacking annotations. In addition, removing label dependency and improving algorithm generalization ability are the current key research challenges in 3D deep learning. We propose a crop-based self-supervised learning framework for point clouds called Multi-Crop Mutual Prediction (MCMP). By cropping the original point clouds into different number of multiscale patches and making patches from the same object as close as possible through dynamic updates of online prototypes, the encoder is well able to maintain consistency between both local and global features. MCMP framework is model-agnostic so it is convenient to integrate it into encoders of any architectures. We pre-train three commonly used point cloud feature extraction networks on a single object-level dataset MondelNet40 and evaluate MCMP on three downstream tasks, namely, object classification, partial segmentation and semantic segmentation. Experimental results show that MCMP outperforms the previous crop-based self-supervised approaches in the object classification and semantic segmentation tasks and produces comparable results with the supervised benchmark methods.
KW - Object Classification
KW - Part Segmentation
KW - Point Cloud
KW - Self-Supervised Learning
KW - Semantic Segmentation
UR - http://www.scopus.com/inward/record.url?scp=85182023259&partnerID=8YFLogxK
U2 - 10.1109/PRML59573.2023.10348385
DO - 10.1109/PRML59573.2023.10348385
M3 - Conference Proceeding
AN - SCOPUS:85182023259
T3 - 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023
SP - 160
EP - 167
BT - 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th IEEE International Conference on Pattern Recognition and Machine Learning, PRML 2023
Y2 - 4 August 2023 through 6 August 2023
ER -