Distortion-Disentangled Contrastive Learning

Jinfeng Wang; Sifan Song; Jionglong Su; S. Kevin Zhou

doi:10.1109/WACV57701.2024.00015

Distortion-Disentangled Contrastive Learning

Jinfeng Wang, Sifan Song, Jionglong Su^*, S. Kevin Zhou^*

^*Corresponding author for this work

School of AI and Advanced Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

2 Citations (Scopus)

Abstract

Self-supervised learning is well known for its remarkable performance in representation learning and various downstream computer vision tasks. Recently, Positive-pair-Only Contrastive Learning (POCL) has achieved reliable performance without the need to construct positive-negative training sets. It reduces memory requirements by lessening the dependency on the batch size. The POCL method typically uses a single objective function to extract the distortion invariant representation (DIR) which describes the proximity of positive-pair representations affected by different distortions. This objective function implicitly enables the model to filter out or ignore the distortion variant representation (DVR) affected by different distortions. However, some recent studies have shown that proper use of DVR in contrastive can optimize the performance of models in some downstream domain-specific tasks. In addition, these POCL methods have been observed to be sensitive to augmentation strategies. To address these limitations, we propose a novel POCL framework named Distortion-Disentangled Contrastive Learning (DDCL) and a Distortion-Disentangled Loss (DDL). Our approach is the first to explicitly and adaptively disentangle and exploit the DVR inside the model and feature stream to improve the representation utilization efficiency, robustness and representation ability. Experiments demonstrate our framework's superiority to Barlow Twins and Simsiam in terms of convergence, representation quality (including transferability and generalization), and robustness on several datasets.

Original language	English
Title of host publication	Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	75-85
Number of pages	11
ISBN (Electronic)	9798350318920
DOIs	https://doi.org/10.1109/WACV57701.2024.00015
Publication status	Published - 3 Jan 2024
Event	2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 - Waikoloa, United States Duration: 4 Jan 2024 → 8 Jan 2024

Publication series

Name	Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024

Conference

Conference	2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024
Country/Territory	United States
City	Waikoloa
Period	4/01/24 → 8/01/24

Keywords

Algorithms
Machine learning architectures
and algorithms
formulations

Access to Document

10.1109/WACV57701.2024.00015

Cite this

Wang, J., Song, S., Su, J., & Zhou, S. K. (2024). Distortion-Disentangled Contrastive Learning. In Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 (pp. 75-85). (Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WACV57701.2024.00015

@inproceedings{77c2c56f4c0b44cfaf64e0c88442c4ae,

title = "Distortion-Disentangled Contrastive Learning",

abstract = "Self-supervised learning is well known for its remarkable performance in representation learning and various downstream computer vision tasks. Recently, Positive-pair-Only Contrastive Learning (POCL) has achieved reliable performance without the need to construct positive-negative training sets. It reduces memory requirements by lessening the dependency on the batch size. The POCL method typically uses a single objective function to extract the distortion invariant representation (DIR) which describes the proximity of positive-pair representations affected by different distortions. This objective function implicitly enables the model to filter out or ignore the distortion variant representation (DVR) affected by different distortions. However, some recent studies have shown that proper use of DVR in contrastive can optimize the performance of models in some downstream domain-specific tasks. In addition, these POCL methods have been observed to be sensitive to augmentation strategies. To address these limitations, we propose a novel POCL framework named Distortion-Disentangled Contrastive Learning (DDCL) and a Distortion-Disentangled Loss (DDL). Our approach is the first to explicitly and adaptively disentangle and exploit the DVR inside the model and feature stream to improve the representation utilization efficiency, robustness and representation ability. Experiments demonstrate our framework's superiority to Barlow Twins and Simsiam in terms of convergence, representation quality (including transferability and generalization), and robustness on several datasets.",

keywords = "Algorithms, Machine learning architectures, and algorithms, formulations",

author = "Jinfeng Wang and Sifan Song and Jionglong Su and Zhou, {S. Kevin}",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 ; Conference date: 04-01-2024 Through 08-01-2024",

year = "2024",

month = jan,

day = "3",

doi = "10.1109/WACV57701.2024.00015",

language = "English",

series = "Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "75--85",

booktitle = "Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024",

}

Wang, J, Song, S, Su, J & Zhou, SK 2024, Distortion-Disentangled Contrastive Learning. in Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024. Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024, Institute of Electrical and Electronics Engineers Inc., pp. 75-85, 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024, Waikoloa, United States, 4/01/24. https://doi.org/10.1109/WACV57701.2024.00015

Distortion-Disentangled Contrastive Learning. / Wang, Jinfeng; Song, Sifan; Su, Jionglong et al.
Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 75-85 (Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Distortion-Disentangled Contrastive Learning

AU - Wang, Jinfeng

AU - Song, Sifan

AU - Su, Jionglong

AU - Zhou, S. Kevin

PY - 2024/1/3

Y1 - 2024/1/3

N2 - Self-supervised learning is well known for its remarkable performance in representation learning and various downstream computer vision tasks. Recently, Positive-pair-Only Contrastive Learning (POCL) has achieved reliable performance without the need to construct positive-negative training sets. It reduces memory requirements by lessening the dependency on the batch size. The POCL method typically uses a single objective function to extract the distortion invariant representation (DIR) which describes the proximity of positive-pair representations affected by different distortions. This objective function implicitly enables the model to filter out or ignore the distortion variant representation (DVR) affected by different distortions. However, some recent studies have shown that proper use of DVR in contrastive can optimize the performance of models in some downstream domain-specific tasks. In addition, these POCL methods have been observed to be sensitive to augmentation strategies. To address these limitations, we propose a novel POCL framework named Distortion-Disentangled Contrastive Learning (DDCL) and a Distortion-Disentangled Loss (DDL). Our approach is the first to explicitly and adaptively disentangle and exploit the DVR inside the model and feature stream to improve the representation utilization efficiency, robustness and representation ability. Experiments demonstrate our framework's superiority to Barlow Twins and Simsiam in terms of convergence, representation quality (including transferability and generalization), and robustness on several datasets.

AB - Self-supervised learning is well known for its remarkable performance in representation learning and various downstream computer vision tasks. Recently, Positive-pair-Only Contrastive Learning (POCL) has achieved reliable performance without the need to construct positive-negative training sets. It reduces memory requirements by lessening the dependency on the batch size. The POCL method typically uses a single objective function to extract the distortion invariant representation (DIR) which describes the proximity of positive-pair representations affected by different distortions. This objective function implicitly enables the model to filter out or ignore the distortion variant representation (DVR) affected by different distortions. However, some recent studies have shown that proper use of DVR in contrastive can optimize the performance of models in some downstream domain-specific tasks. In addition, these POCL methods have been observed to be sensitive to augmentation strategies. To address these limitations, we propose a novel POCL framework named Distortion-Disentangled Contrastive Learning (DDCL) and a Distortion-Disentangled Loss (DDL). Our approach is the first to explicitly and adaptively disentangle and exploit the DVR inside the model and feature stream to improve the representation utilization efficiency, robustness and representation ability. Experiments demonstrate our framework's superiority to Barlow Twins and Simsiam in terms of convergence, representation quality (including transferability and generalization), and robustness on several datasets.

KW - Algorithms

KW - Machine learning architectures

KW - and algorithms

KW - formulations

UR - http://www.scopus.com/inward/record.url?scp=85192003966&partnerID=8YFLogxK

U2 - 10.1109/WACV57701.2024.00015

DO - 10.1109/WACV57701.2024.00015

M3 - Conference Proceeding

AN - SCOPUS:85192003966

T3 - Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024

SP - 75

EP - 85

BT - Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024

Y2 - 4 January 2024 through 8 January 2024

ER -

Distortion-Disentangled Contrastive Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this