Bilateral-ViT For Robust Fovea Localization

Sifan Song, Kang Dang, Qinji Yu, Zilong Wang, Frans Coenen, Jionglong Su*, Xiaowei Ding*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

2 Citations (Scopus)

Abstract

The fovea is an important anatomical landmark of the retina. Detecting the location of the fovea is essential for the analysis of many retinal diseases. However, robust fovea localization remains a challenging problem, as the fovea region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes a novel Vision Transformer (ViT) approach that integrates information both inside and outside the fovea region to achieve robust fovea localization. Our proposed network, named Bilateral-Vision-Transformer (Bilateral-ViT), consists of two network branches: a transformer-based main network branch for integrating global context across the entire fundus image and a vessel branch for explicitly incorporating the structure of blood vessels. The encoded features from both network branches are subsequently merged with a customized Multi-scale Feature Fusion (MFF) module. Our comprehensive experiments demonstrate that the proposed approach is significantly more robust for diseased images and establishes the new state of the arts using the Messidor and PALM datasets.

Original languageEnglish
Title of host publicationISBI 2022 - Proceedings
Subtitle of host publication2022 IEEE International Symposium on Biomedical Imaging
PublisherIEEE Computer Society
ISBN (Electronic)9781665429238
DOIs
Publication statusPublished - 2022
Event19th IEEE International Symposium on Biomedical Imaging, ISBI 2022 - Kolkata, India
Duration: 28 Mar 202231 Mar 2022

Publication series

NameProceedings - International Symposium on Biomedical Imaging
Volume2022-March
ISSN (Print)1945-7928
ISSN (Electronic)1945-8452

Conference

Conference19th IEEE International Symposium on Biomedical Imaging, ISBI 2022
Country/TerritoryIndia
CityKolkata
Period28/03/2231/03/22

Keywords

  • Bilateral Neural Network
  • Feature Fusion
  • Fovea Localization
  • Vision Transformer

Fingerprint

Dive into the research topics of 'Bilateral-ViT For Robust Fovea Localization'. Together they form a unique fingerprint.

Cite this