Self-supervised learning for point cloud data: A survey

Changyu Zeng, Wei Wang, Anh Nguyen, Jimin Xiao, Yutao Yue*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

15 Citations (Scopus)

Abstract

3D point clouds are a crucial type of data collected by LiDAR sensors and widely used in transportation applications due to its concise descriptions and accurate localization. Deep neural networks (DNNs) have achieved remarkable success in processing large amount of disordered and sparse 3D point clouds, especially in various computer vision tasks, such as pedestrian detection and vehicle recognition. Among all the learning paradigms, Self-Supervised Learning (SSL), an unsupervised training paradigm that mines effective information from the data itself, is considered as an essential solution to solve the time-consuming and labor-intensive data labeling problems via smart pre-training task design. This paper provides a comprehensive survey of recent advances on SSL for point clouds. We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts’ characteristics. Under each category, we then further categorize the methods into more fine-grained groups and summarize the strength and limitations of the representative methods. We also compare the performance of the notable SSL methods in literature on multiple downstream tasks on benchmark datasets both quantitatively and qualitatively. Finally, we propose a number of future research directions based on the identified limitations of existing SSL research on point clouds.

Original languageEnglish
Article number121354
JournalExpert Systems with Applications
Volume237
Early online dateSept 2023
DOIs
Publication statusPublished - 1 Mar 2024

Keywords

  • Computer vision
  • Point clouds
  • Pretext task
  • Representation learning
  • Self-supervised learning
  • Transfer learning

Fingerprint

Dive into the research topics of 'Self-supervised learning for point cloud data: A survey'. Together they form a unique fingerprint.

Cite this