TY - GEN
T1 - Contrastive Cycle Consistency Learning for Unsupervised Visual Tracking
AU - Zhu, Jiajun
AU - Ma, Chao
AU - Jia, Shuai
AU - Xu, Shugong
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Unsupervised visual tracking has received increasing attention recently. Existing unsupervised visual tracking methods mainly exploit the cycle consistency of sequential images to learn an unsupervised representation for target objects. Due to the small appearance changes between consecutive images, existing unsupervised deep trackers compute the cycle consistency loss over a temporal span to reduce data correlation. However, this causes the learned unsupervised representation not robust to abrupt motion changes as the rich motion dynamics between consecutive frames are not exploited. To address this problem, we propose to contrastively learn cycle consistency over consecutive frames with data augmentation. Specifically, we first use a skipping frame scheme to perform step-by-step cycle tracking for learning unsupervised representation. We then perform unsupervised tracking by computing the contrastive cycle consistency over the augmented consecutive frames, which simulates the challenging scenarios of large appearance changes in visual tracking. This helps us make full use of the valuable temporal motion information for learning robust unsupervised representation. Extensive experiments on large-scale benchmark datasets demonstrate that our proposed tracker significantly advances the state-of-the-art unsupervised visual tracking algorithms by large margins.
AB - Unsupervised visual tracking has received increasing attention recently. Existing unsupervised visual tracking methods mainly exploit the cycle consistency of sequential images to learn an unsupervised representation for target objects. Due to the small appearance changes between consecutive images, existing unsupervised deep trackers compute the cycle consistency loss over a temporal span to reduce data correlation. However, this causes the learned unsupervised representation not robust to abrupt motion changes as the rich motion dynamics between consecutive frames are not exploited. To address this problem, we propose to contrastively learn cycle consistency over consecutive frames with data augmentation. Specifically, we first use a skipping frame scheme to perform step-by-step cycle tracking for learning unsupervised representation. We then perform unsupervised tracking by computing the contrastive cycle consistency over the augmented consecutive frames, which simulates the challenging scenarios of large appearance changes in visual tracking. This helps us make full use of the valuable temporal motion information for learning robust unsupervised representation. Extensive experiments on large-scale benchmark datasets demonstrate that our proposed tracker significantly advances the state-of-the-art unsupervised visual tracking algorithms by large margins.
KW - Contrastive learning
KW - Cycle consistency
KW - Unsupervised visual tracking
UR - http://www.scopus.com/inward/record.url?scp=85118191489&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-88004-0_46
DO - 10.1007/978-3-030-88004-0_46
M3 - Conference Proceeding
AN - SCOPUS:85118191489
SN - 9783030880033
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 564
EP - 576
BT - Pattern Recognition and Computer Vision - 4th Chinese Conference, PRCV 2021, Proceedings
A2 - Ma, Huimin
A2 - Wang, Liang
A2 - Zhang, Changshui
A2 - Wu, Fei
A2 - Tan, Tieniu
A2 - Wang, Yaonan
A2 - Lai, Jianhuang
A2 - Zhao, Yao
PB - Springer Science and Business Media Deutschland GmbH
T2 - 4th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2021
Y2 - 29 October 2021 through 1 November 2021
ER -