Bilateral-ViT For Robust Fovea Localization

Sifan Song; Kang Dang; Qinji Yu; Zilong Wang; Frans Coenen; Jionglong Su; Xiaowei Ding

doi:10.1109/ISBI52829.2022.9761523

Bilateral-ViT For Robust Fovea Localization

Sifan Song, Kang Dang, Qinji Yu, Zilong Wang, Frans Coenen, Jionglong Su^*, Xiaowei Ding^*

^*Corresponding author for this work

School of AI and Advanced Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

5 Citations (Scopus)

Abstract

The fovea is an important anatomical landmark of the retina. Detecting the location of the fovea is essential for the analysis of many retinal diseases. However, robust fovea localization remains a challenging problem, as the fovea region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes a novel Vision Transformer (ViT) approach that integrates information both inside and outside the fovea region to achieve robust fovea localization. Our proposed network, named Bilateral-Vision-Transformer (Bilateral-ViT), consists of two network branches: a transformer-based main network branch for integrating global context across the entire fundus image and a vessel branch for explicitly incorporating the structure of blood vessels. The encoded features from both network branches are subsequently merged with a customized Multi-scale Feature Fusion (MFF) module. Our comprehensive experiments demonstrate that the proposed approach is significantly more robust for diseased images and establishes the new state of the arts using the Messidor and PALM datasets.

Original language	English
Title of host publication	ISBI 2022 - Proceedings
Subtitle of host publication	2022 IEEE International Symposium on Biomedical Imaging
Publisher	IEEE Computer Society
ISBN (Electronic)	9781665429238
DOIs	https://doi.org/10.1109/ISBI52829.2022.9761523
Publication status	Published - 2022
Event	19th IEEE International Symposium on Biomedical Imaging, ISBI 2022 - Kolkata, India Duration: 28 Mar 2022 → 31 Mar 2022

Publication series

Name	Proceedings - International Symposium on Biomedical Imaging
Volume	2022-March
ISSN (Print)	1945-7928
ISSN (Electronic)	1945-8452

Conference

Conference	19th IEEE International Symposium on Biomedical Imaging, ISBI 2022
Country/Territory	India
City	Kolkata
Period	28/03/22 → 31/03/22

Keywords

Bilateral Neural Network
Feature Fusion
Fovea Localization
Vision Transformer

Access to Document

10.1109/ISBI52829.2022.9761523

Cite this

@inproceedings{28b3a9fdf9c449f5b43c1cb84fa95e7b,

title = "Bilateral-ViT For Robust Fovea Localization",

abstract = "The fovea is an important anatomical landmark of the retina. Detecting the location of the fovea is essential for the analysis of many retinal diseases. However, robust fovea localization remains a challenging problem, as the fovea region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes a novel Vision Transformer (ViT) approach that integrates information both inside and outside the fovea region to achieve robust fovea localization. Our proposed network, named Bilateral-Vision-Transformer (Bilateral-ViT), consists of two network branches: a transformer-based main network branch for integrating global context across the entire fundus image and a vessel branch for explicitly incorporating the structure of blood vessels. The encoded features from both network branches are subsequently merged with a customized Multi-scale Feature Fusion (MFF) module. Our comprehensive experiments demonstrate that the proposed approach is significantly more robust for diseased images and establishes the new state of the arts using the Messidor and PALM datasets.",

keywords = "Bilateral Neural Network, Feature Fusion, Fovea Localization, Vision Transformer",

author = "Sifan Song and Kang Dang and Qinji Yu and Zilong Wang and Frans Coenen and Jionglong Su and Xiaowei Ding",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 19th IEEE International Symposium on Biomedical Imaging, ISBI 2022 ; Conference date: 28-03-2022 Through 31-03-2022",

year = "2022",

doi = "10.1109/ISBI52829.2022.9761523",

language = "English",

series = "Proceedings - International Symposium on Biomedical Imaging",

publisher = "IEEE Computer Society",

booktitle = "ISBI 2022 - Proceedings",

}

Song, S, Dang, K, Yu, Q, Wang, Z, Coenen, F, Su, J & Ding, X 2022, Bilateral-ViT For Robust Fovea Localization. in ISBI 2022 - Proceedings: 2022 IEEE International Symposium on Biomedical Imaging. Proceedings - International Symposium on Biomedical Imaging, vol. 2022-March, IEEE Computer Society, 19th IEEE International Symposium on Biomedical Imaging, ISBI 2022, Kolkata, India, 28/03/22. https://doi.org/10.1109/ISBI52829.2022.9761523

Bilateral-ViT For Robust Fovea Localization. / Song, Sifan; Dang, Kang; Yu, Qinji et al.
ISBI 2022 - Proceedings: 2022 IEEE International Symposium on Biomedical Imaging. IEEE Computer Society, 2022. (Proceedings - International Symposium on Biomedical Imaging; Vol. 2022-March).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Bilateral-ViT For Robust Fovea Localization

AU - Song, Sifan

AU - Dang, Kang

AU - Yu, Qinji

AU - Wang, Zilong

AU - Coenen, Frans

AU - Su, Jionglong

AU - Ding, Xiaowei

PY - 2022

Y1 - 2022

N2 - The fovea is an important anatomical landmark of the retina. Detecting the location of the fovea is essential for the analysis of many retinal diseases. However, robust fovea localization remains a challenging problem, as the fovea region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes a novel Vision Transformer (ViT) approach that integrates information both inside and outside the fovea region to achieve robust fovea localization. Our proposed network, named Bilateral-Vision-Transformer (Bilateral-ViT), consists of two network branches: a transformer-based main network branch for integrating global context across the entire fundus image and a vessel branch for explicitly incorporating the structure of blood vessels. The encoded features from both network branches are subsequently merged with a customized Multi-scale Feature Fusion (MFF) module. Our comprehensive experiments demonstrate that the proposed approach is significantly more robust for diseased images and establishes the new state of the arts using the Messidor and PALM datasets.

AB - The fovea is an important anatomical landmark of the retina. Detecting the location of the fovea is essential for the analysis of many retinal diseases. However, robust fovea localization remains a challenging problem, as the fovea region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes a novel Vision Transformer (ViT) approach that integrates information both inside and outside the fovea region to achieve robust fovea localization. Our proposed network, named Bilateral-Vision-Transformer (Bilateral-ViT), consists of two network branches: a transformer-based main network branch for integrating global context across the entire fundus image and a vessel branch for explicitly incorporating the structure of blood vessels. The encoded features from both network branches are subsequently merged with a customized Multi-scale Feature Fusion (MFF) module. Our comprehensive experiments demonstrate that the proposed approach is significantly more robust for diseased images and establishes the new state of the arts using the Messidor and PALM datasets.

KW - Bilateral Neural Network

KW - Feature Fusion

KW - Fovea Localization

KW - Vision Transformer

UR - http://www.scopus.com/inward/record.url?scp=85129647739&partnerID=8YFLogxK

U2 - 10.1109/ISBI52829.2022.9761523

DO - 10.1109/ISBI52829.2022.9761523

M3 - Conference Proceeding

AN - SCOPUS:85129647739

T3 - Proceedings - International Symposium on Biomedical Imaging

BT - ISBI 2022 - Proceedings

PB - IEEE Computer Society

T2 - 19th IEEE International Symposium on Biomedical Imaging, ISBI 2022

Y2 - 28 March 2022 through 31 March 2022

ER -

Bilateral-ViT For Robust Fovea Localization

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this