Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling

Haochuan Jiang, Chengjia Wang*, Agisilaos Chartsias, Sotirios A. Tsaftaris

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

8 Citations (Scopus)

Abstract

Automatic segmentation of multi-sequence (multi-modal) cardiac MR (CMR) images plays a significant role in diagnosis and management for a variety of cardiac diseases. However, the performance of relevant algorithms is significantly affected by the proper fusion of the multi-modal information. Furthermore, particular diseases, such as myocardial infarction, display irregular shapes on images and occupy small regions at random locations. These facts make pathology segmentation of multi-modal CMR images a challenging task. In this paper, we present the Max-Fusion U-Net that achieves improved pathology segmentation performance given aligned multi-modal images of LGE, T2-weighted, and bSSFP modalities. Specifically, modality-specific features are extracted by dedicated encoders. Then they are fused with the pixel-wise maximum operator. Together with the corresponding encoding features, these representations are propagated to decoding layers with U-Net skip-connections. Furthermore, a spatial-attention module is applied in the last decoding layer to encourage the network to focus on those small semantically meaningful pathological regions that trigger relatively high responses by the network neurons. We also use a simple image patch extraction strategy to dynamically resample training examples with varying spacial and batch sizes. With limited GPU memory, this strategy reduces the imbalance of classes and forces the model to focus on regions around the interested pathology. It further improves segmentation accuracy and reduces the mis-classification of pathology. We evaluate our methods using the Myocardial pathology segmentation (MyoPS) combining the multi-sequence CMR dataset which involves three modalities. Extensive experiments demonstrate the effectiveness of the proposed model which outperforms the related baselines. The code is available at https://github.com/falconjhc/MFU-Net.

Original languageEnglish
Title of host publicationMyocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings
EditorsXiahai Zhuang, Lei Li
PublisherSpringer Science and Business Media Deutschland GmbH
Pages68-81
Number of pages14
ISBN (Print)9783030656508
DOIs
Publication statusPublished - 2020
Externally publishedYes
Event1st Myocardial Pathology Segmentation Combining Multi-Sequence CMR Challenge, MyoPS 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020 - Lima, Peru
Duration: 4 Oct 20204 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12554 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st Myocardial Pathology Segmentation Combining Multi-Sequence CMR Challenge, MyoPS 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020
Country/TerritoryPeru
CityLima
Period4/10/204/10/20

Keywords

  • Dynamic resample
  • Max-fusion
  • Multi-modal
  • Pathology segmentation

Cite this