Self-supervised learning for point cloud data: A survey

Changyu Zeng; Wei Wang; Anh Nguyen; Jimin Xiao; Yutao Yue

doi:10.1016/j.eswa.2023.121354

Self-supervised learning for point cloud data: A survey

Changyu Zeng, Wei Wang, Anh Nguyen, Jimin Xiao, Yutao Yue^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

21 Citations (Scopus)

Abstract

3D point clouds are a crucial type of data collected by LiDAR sensors and widely used in transportation applications due to its concise descriptions and accurate localization. Deep neural networks (DNNs) have achieved remarkable success in processing large amount of disordered and sparse 3D point clouds, especially in various computer vision tasks, such as pedestrian detection and vehicle recognition. Among all the learning paradigms, Self-Supervised Learning (SSL), an unsupervised training paradigm that mines effective information from the data itself, is considered as an essential solution to solve the time-consuming and labor-intensive data labeling problems via smart pre-training task design. This paper provides a comprehensive survey of recent advances on SSL for point clouds. We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts’ characteristics. Under each category, we then further categorize the methods into more fine-grained groups and summarize the strength and limitations of the representative methods. We also compare the performance of the notable SSL methods in literature on multiple downstream tasks on benchmark datasets both quantitatively and qualitatively. Finally, we propose a number of future research directions based on the identified limitations of existing SSL research on point clouds.

Original language	English
Article number	121354
Journal	Expert Systems with Applications
Volume	237
Early online date	Sept 2023
DOIs	https://doi.org/10.1016/j.eswa.2023.121354 https://doi.org/10.1016/j.eswa.2023.121354
Publication status	Published - 1 Mar 2024

Keywords

Computer vision
Point clouds
Pretext task
Representation learning
Self-supervised learning
Transfer learning

Access to Document

https://www.sciencedirect.com/science/article/pii/S0957417423018560

Cite this

@article{9999f7152b634f41a4baec63c5c803d7,

title = "Self-supervised learning for point cloud data: A survey",

abstract = "3D point clouds are a crucial type of data collected by LiDAR sensors and widely used in transportation applications due to its concise descriptions and accurate localization. Deep neural networks (DNNs) have achieved remarkable success in processing large amount of disordered and sparse 3D point clouds, especially in various computer vision tasks, such as pedestrian detection and vehicle recognition. Among all the learning paradigms, Self-Supervised Learning (SSL), an unsupervised training paradigm that mines effective information from the data itself, is considered as an essential solution to solve the time-consuming and labor-intensive data labeling problems via smart pre-training task design. This paper provides a comprehensive survey of recent advances on SSL for point clouds. We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts{\textquoteright} characteristics. Under each category, we then further categorize the methods into more fine-grained groups and summarize the strength and limitations of the representative methods. We also compare the performance of the notable SSL methods in literature on multiple downstream tasks on benchmark datasets both quantitatively and qualitatively. Finally, we propose a number of future research directions based on the identified limitations of existing SSL research on point clouds.",

keywords = "Computer vision, Point clouds, Pretext task, Representation learning, Self-supervised learning, Transfer learning",

author = "Changyu Zeng and Wei Wang and Anh Nguyen and Jimin Xiao and Yutao Yue",

note = "Publisher Copyright: {\textcopyright} 2023 The Author(s)",

year = "2024",

month = mar,

day = "1",

doi = "10.1016/j.eswa.2023.121354",

language = "English",

volume = "237",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier",

}

TY - JOUR

T1 - Self-supervised learning for point cloud data

T2 - A survey

AU - Zeng, Changyu

AU - Wang, Wei

AU - Nguyen, Anh

AU - Xiao, Jimin

AU - Yue, Yutao

PY - 2024/3/1

Y1 - 2024/3/1

N2 - 3D point clouds are a crucial type of data collected by LiDAR sensors and widely used in transportation applications due to its concise descriptions and accurate localization. Deep neural networks (DNNs) have achieved remarkable success in processing large amount of disordered and sparse 3D point clouds, especially in various computer vision tasks, such as pedestrian detection and vehicle recognition. Among all the learning paradigms, Self-Supervised Learning (SSL), an unsupervised training paradigm that mines effective information from the data itself, is considered as an essential solution to solve the time-consuming and labor-intensive data labeling problems via smart pre-training task design. This paper provides a comprehensive survey of recent advances on SSL for point clouds. We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts’ characteristics. Under each category, we then further categorize the methods into more fine-grained groups and summarize the strength and limitations of the representative methods. We also compare the performance of the notable SSL methods in literature on multiple downstream tasks on benchmark datasets both quantitatively and qualitatively. Finally, we propose a number of future research directions based on the identified limitations of existing SSL research on point clouds.

AB - 3D point clouds are a crucial type of data collected by LiDAR sensors and widely used in transportation applications due to its concise descriptions and accurate localization. Deep neural networks (DNNs) have achieved remarkable success in processing large amount of disordered and sparse 3D point clouds, especially in various computer vision tasks, such as pedestrian detection and vehicle recognition. Among all the learning paradigms, Self-Supervised Learning (SSL), an unsupervised training paradigm that mines effective information from the data itself, is considered as an essential solution to solve the time-consuming and labor-intensive data labeling problems via smart pre-training task design. This paper provides a comprehensive survey of recent advances on SSL for point clouds. We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts’ characteristics. Under each category, we then further categorize the methods into more fine-grained groups and summarize the strength and limitations of the representative methods. We also compare the performance of the notable SSL methods in literature on multiple downstream tasks on benchmark datasets both quantitatively and qualitatively. Finally, we propose a number of future research directions based on the identified limitations of existing SSL research on point clouds.

KW - Computer vision

KW - Point clouds

KW - Pretext task

KW - Representation learning

KW - Self-supervised learning

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85171613702&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2023.121354

DO - 10.1016/j.eswa.2023.121354

M3 - Article

AN - SCOPUS:85171613702

SN - 0957-4174

VL - 237

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 121354

ER -

Self-supervised learning for point cloud data: A survey

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this