SFT: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer

Jit Yan Lim; Kian Ming Lim; Chin Poo Lee; Yong Xuan Tan

doi:10.1109/ACCESS.2024.3416327

SFT: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer

Jit Yan Lim, Kian Ming Lim^*, Chin Poo Lee, Yong Xuan Tan

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

The few-shot learning paradigm aims to generalize to unseen tasks with limited samples. However, a focus solely on class-level discrimination may fall short of achieving robust generalization, especially when neglecting instance diversity and discriminability. This study introduces a metric-based few-shot approach, named Self-supervised Feature Fusion with Transformer (SFT), which integrates self-supervised learning with a transformer. SFT addresses the limitations of previous approaches by employing two distinct self-supervised tasks in separate models during pre-training, thus enhancing both instance diversity and discriminability in the feature space. The training process unfolds in two stages: pre-training and transfer learning. In pre-training, each model undergoes training with specific self-supervised tasks to harness the benefits of enhanced feature space. In the subsequent transfer learning stage, model weights are frozen, acting as feature extractors. The features from both models are amalgamated using a feature fusion technique and are transformed into task-specific features by a transformer, boosting discrimination on unseen tasks. The combined features enable the model to learn a well-generalized representation, effectively tackling the challenges posed by few-shot tasks. The proposed SFT method achieves state-of-the-art results on three benchmark datasets in few-shot image classification.

Original language	English
Pages (from-to)	86690-86703
Number of pages	14
Journal	IEEE Access
Volume	12
DOIs	https://doi.org/10.1109/ACCESS.2024.3416327
Publication status	Published - 2024
Externally published	Yes

Keywords

contrastive learning
feature fusion
Few-shot learning
self-supervised learning
transformer

Access to Document

10.1109/ACCESS.2024.3416327

Cite this

@article{a8a5bcb4a7264ed6a1f760f152341f4a,

title = "SFT: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer",

abstract = "The few-shot learning paradigm aims to generalize to unseen tasks with limited samples. However, a focus solely on class-level discrimination may fall short of achieving robust generalization, especially when neglecting instance diversity and discriminability. This study introduces a metric-based few-shot approach, named Self-supervised Feature Fusion with Transformer (SFT), which integrates self-supervised learning with a transformer. SFT addresses the limitations of previous approaches by employing two distinct self-supervised tasks in separate models during pre-training, thus enhancing both instance diversity and discriminability in the feature space. The training process unfolds in two stages: pre-training and transfer learning. In pre-training, each model undergoes training with specific self-supervised tasks to harness the benefits of enhanced feature space. In the subsequent transfer learning stage, model weights are frozen, acting as feature extractors. The features from both models are amalgamated using a feature fusion technique and are transformed into task-specific features by a transformer, boosting discrimination on unseen tasks. The combined features enable the model to learn a well-generalized representation, effectively tackling the challenges posed by few-shot tasks. The proposed SFT method achieves state-of-the-art results on three benchmark datasets in few-shot image classification.",

keywords = "contrastive learning, feature fusion, Few-shot learning, self-supervised learning, transformer",

author = "Lim, {Jit Yan} and Lim, {Kian Ming} and Lee, {Chin Poo} and Tan, {Yong Xuan}",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2024",

doi = "10.1109/ACCESS.2024.3416327",

language = "English",

volume = "12",

pages = "86690--86703",

journal = "IEEE Access",

issn = "2169-3536",

}

TY - JOUR

T1 - SFT: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer

T2 - Few-Shot Learning via Self-Supervised Feature Fusion With Transformer

AU - Lim, Jit Yan

AU - Lim, Kian Ming

AU - Lee, Chin Poo

AU - Tan, Yong Xuan

PY - 2024

Y1 - 2024

N2 - The few-shot learning paradigm aims to generalize to unseen tasks with limited samples. However, a focus solely on class-level discrimination may fall short of achieving robust generalization, especially when neglecting instance diversity and discriminability. This study introduces a metric-based few-shot approach, named Self-supervised Feature Fusion with Transformer (SFT), which integrates self-supervised learning with a transformer. SFT addresses the limitations of previous approaches by employing two distinct self-supervised tasks in separate models during pre-training, thus enhancing both instance diversity and discriminability in the feature space. The training process unfolds in two stages: pre-training and transfer learning. In pre-training, each model undergoes training with specific self-supervised tasks to harness the benefits of enhanced feature space. In the subsequent transfer learning stage, model weights are frozen, acting as feature extractors. The features from both models are amalgamated using a feature fusion technique and are transformed into task-specific features by a transformer, boosting discrimination on unseen tasks. The combined features enable the model to learn a well-generalized representation, effectively tackling the challenges posed by few-shot tasks. The proposed SFT method achieves state-of-the-art results on three benchmark datasets in few-shot image classification.

AB - The few-shot learning paradigm aims to generalize to unseen tasks with limited samples. However, a focus solely on class-level discrimination may fall short of achieving robust generalization, especially when neglecting instance diversity and discriminability. This study introduces a metric-based few-shot approach, named Self-supervised Feature Fusion with Transformer (SFT), which integrates self-supervised learning with a transformer. SFT addresses the limitations of previous approaches by employing two distinct self-supervised tasks in separate models during pre-training, thus enhancing both instance diversity and discriminability in the feature space. The training process unfolds in two stages: pre-training and transfer learning. In pre-training, each model undergoes training with specific self-supervised tasks to harness the benefits of enhanced feature space. In the subsequent transfer learning stage, model weights are frozen, acting as feature extractors. The features from both models are amalgamated using a feature fusion technique and are transformed into task-specific features by a transformer, boosting discrimination on unseen tasks. The combined features enable the model to learn a well-generalized representation, effectively tackling the challenges posed by few-shot tasks. The proposed SFT method achieves state-of-the-art results on three benchmark datasets in few-shot image classification.

KW - contrastive learning

KW - feature fusion

KW - Few-shot learning

KW - self-supervised learning

KW - transformer

UR - http://www.scopus.com/inward/record.url?scp=85196520468&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2024.3416327

DO - 10.1109/ACCESS.2024.3416327

M3 - Article

AN - SCOPUS:85196520468

SN - 2169-3536

VL - 12

SP - 86690

EP - 86703

JO - IEEE Access

JF - IEEE Access

ER -

SFT: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this