ViTMed: Vision Transformer for Medical Image Analysis

Yu Jie Lim*, Kian Ming Lim, Roy Kwang Yang Chang, Chin Poo Lee, Jit Yan Lim

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

The COVID-19 global health crisis has presented daunting challenges to medical professionals, making accurate and efficient diagnoses more important than ever. In view of this, this paper proposes a Vision Transformer model, ViTMed, with an attention mechanism to classify the CT scan images for more effective diagnosis of COVID-19. Given the input CT scan images, it is represented as sequences of tokens and a transformer is utilized to capture global and local dependencies between features by utilizing self-attention mechanism. The core element in ViTMed is the transformer encoder with multi-headed attention (MHA) mechanism and feed-forward network. This enables model to learn hierarchical representation of image and make more informed predictions. The proposed ViTMed achieves promising performance with fewer parameters and computations than conventional Convolutional Neural Networks. From the experimental results, the proposed ViTMed outperforms state-of-the-art approaches for all three public benchmark datasets of COVID-19, 98.38%, 90.48%, and 99.17% accuracy for SARS-CoV-2-CT, COVID-CT, and iCTCF datasets, respectively. The number of samples collected for each dataset are 2482, 746, 19685. The datasets consist of two to three classes, which are Covid, Non-Covid and Non-informative cases.

Original languageEnglish
Title of host publication2023 11th International Conference on Information and Communication Technology, ICoICT 2023
Pages277-282
Number of pages6
ISBN (Electronic)9798350321982
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event11th International Conference on Information and Communication Technology, ICoICT 2023 - Melaka, Malaysia
Duration: 23 Aug 202324 Aug 2023

Publication series

Name2023 11th International Conference on Information and Communication Technology, ICoICT 2023
Volume2023-August

Conference

Conference11th International Conference on Information and Communication Technology, ICoICT 2023
Country/TerritoryMalaysia
CityMelaka
Period23/08/2324/08/23

Keywords

  • Attention
  • COVID-19
  • CT-Scan
  • Medical Image Analysis
  • Vision Transformer

Fingerprint

Dive into the research topics of 'ViTMed: Vision Transformer for Medical Image Analysis'. Together they form a unique fingerprint.

Cite this