Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling

Haochuan Jiang; Chengjia Wang; Agisilaos Chartsias; Sotirios A. Tsaftaris

doi:10.1007/978-3-030-65651-5_7

Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling

Haochuan Jiang, Chengjia Wang^*, Agisilaos Chartsias, Sotirios A. Tsaftaris

^*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

8 Citations (Scopus)

Abstract

Automatic segmentation of multi-sequence (multi-modal) cardiac MR (CMR) images plays a significant role in diagnosis and management for a variety of cardiac diseases. However, the performance of relevant algorithms is significantly affected by the proper fusion of the multi-modal information. Furthermore, particular diseases, such as myocardial infarction, display irregular shapes on images and occupy small regions at random locations. These facts make pathology segmentation of multi-modal CMR images a challenging task. In this paper, we present the Max-Fusion U-Net that achieves improved pathology segmentation performance given aligned multi-modal images of LGE, T2-weighted, and bSSFP modalities. Specifically, modality-specific features are extracted by dedicated encoders. Then they are fused with the pixel-wise maximum operator. Together with the corresponding encoding features, these representations are propagated to decoding layers with U-Net skip-connections. Furthermore, a spatial-attention module is applied in the last decoding layer to encourage the network to focus on those small semantically meaningful pathological regions that trigger relatively high responses by the network neurons. We also use a simple image patch extraction strategy to dynamically resample training examples with varying spacial and batch sizes. With limited GPU memory, this strategy reduces the imbalance of classes and forces the model to focus on regions around the interested pathology. It further improves segmentation accuracy and reduces the mis-classification of pathology. We evaluate our methods using the Myocardial pathology segmentation (MyoPS) combining the multi-sequence CMR dataset which involves three modalities. Extensive experiments demonstrate the effectiveness of the proposed model which outperforms the related baselines. The code is available at https://github.com/falconjhc/MFU-Net.

Original language	English
Title of host publication	Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings
Editors	Xiahai Zhuang, Lei Li
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	68-81
Number of pages	14
ISBN (Print)	9783030656508
DOIs	https://doi.org/10.1007/978-3-030-65651-5_7
Publication status	Published - 2020
Externally published	Yes
Event	1st Myocardial Pathology Segmentation Combining Multi-Sequence CMR Challenge, MyoPS 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020 - Lima, Peru Duration: 4 Oct 2020 → 4 Oct 2020

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	12554 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	1st Myocardial Pathology Segmentation Combining Multi-Sequence CMR Challenge, MyoPS 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020
Country/Territory	Peru
City	Lima
Period	4/10/20 → 4/10/20

Keywords

Dynamic resample
Max-fusion
Multi-modal
Pathology segmentation

Access to Document

10.1007/978-3-030-65651-5_7

Cite this

Jiang, H., Wang, C., Chartsias, A., & Tsaftaris, S. A. (2020). Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling. In X. Zhuang, & L. Li (Eds.), Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings (pp. 68-81). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12554 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-65651-5_7

Jiang, Haochuan ; Wang, Chengjia ; Chartsias, Agisilaos et al. / Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling. Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings. editor / Xiahai Zhuang ; Lei Li. Springer Science and Business Media Deutschland GmbH, 2020. pp. 68-81 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{447154bbb155412fb055629cd5d91086,

title = "Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling",

abstract = "Automatic segmentation of multi-sequence (multi-modal) cardiac MR (CMR) images plays a significant role in diagnosis and management for a variety of cardiac diseases. However, the performance of relevant algorithms is significantly affected by the proper fusion of the multi-modal information. Furthermore, particular diseases, such as myocardial infarction, display irregular shapes on images and occupy small regions at random locations. These facts make pathology segmentation of multi-modal CMR images a challenging task. In this paper, we present the Max-Fusion U-Net that achieves improved pathology segmentation performance given aligned multi-modal images of LGE, T2-weighted, and bSSFP modalities. Specifically, modality-specific features are extracted by dedicated encoders. Then they are fused with the pixel-wise maximum operator. Together with the corresponding encoding features, these representations are propagated to decoding layers with U-Net skip-connections. Furthermore, a spatial-attention module is applied in the last decoding layer to encourage the network to focus on those small semantically meaningful pathological regions that trigger relatively high responses by the network neurons. We also use a simple image patch extraction strategy to dynamically resample training examples with varying spacial and batch sizes. With limited GPU memory, this strategy reduces the imbalance of classes and forces the model to focus on regions around the interested pathology. It further improves segmentation accuracy and reduces the mis-classification of pathology. We evaluate our methods using the Myocardial pathology segmentation (MyoPS) combining the multi-sequence CMR dataset which involves three modalities. Extensive experiments demonstrate the effectiveness of the proposed model which outperforms the related baselines. The code is available at https://github.com/falconjhc/MFU-Net.",

keywords = "Dynamic resample, Max-fusion, Multi-modal, Pathology segmentation",

author = "Haochuan Jiang and Chengjia Wang and Agisilaos Chartsias and Tsaftaris, {Sotirios A.}",

note = "Publisher Copyright: {\textcopyright} 2020, Springer Nature Switzerland AG.; 1st Myocardial Pathology Segmentation Combining Multi-Sequence CMR Challenge, MyoPS 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020 ; Conference date: 04-10-2020 Through 04-10-2020",

year = "2020",

doi = "10.1007/978-3-030-65651-5_7",

language = "English",

isbn = "9783030656508",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "68--81",

editor = "Xiahai Zhuang and Lei Li",

booktitle = "Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings",

}

Jiang, H, Wang, C, Chartsias, A & Tsaftaris, SA 2020, Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling. in X Zhuang & L Li (eds), Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12554 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 68-81, 1st Myocardial Pathology Segmentation Combining Multi-Sequence CMR Challenge, MyoPS 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020, Lima, Peru, 4/10/20. https://doi.org/10.1007/978-3-030-65651-5_7

Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling. / Jiang, Haochuan; Wang, Chengjia; Chartsias, Agisilaos et al.
Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings. ed. / Xiahai Zhuang; Lei Li. Springer Science and Business Media Deutschland GmbH, 2020. p. 68-81 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12554 LNCS).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling

AU - Jiang, Haochuan

AU - Wang, Chengjia

AU - Chartsias, Agisilaos

AU - Tsaftaris, Sotirios A.

PY - 2020

Y1 - 2020

N2 - Automatic segmentation of multi-sequence (multi-modal) cardiac MR (CMR) images plays a significant role in diagnosis and management for a variety of cardiac diseases. However, the performance of relevant algorithms is significantly affected by the proper fusion of the multi-modal information. Furthermore, particular diseases, such as myocardial infarction, display irregular shapes on images and occupy small regions at random locations. These facts make pathology segmentation of multi-modal CMR images a challenging task. In this paper, we present the Max-Fusion U-Net that achieves improved pathology segmentation performance given aligned multi-modal images of LGE, T2-weighted, and bSSFP modalities. Specifically, modality-specific features are extracted by dedicated encoders. Then they are fused with the pixel-wise maximum operator. Together with the corresponding encoding features, these representations are propagated to decoding layers with U-Net skip-connections. Furthermore, a spatial-attention module is applied in the last decoding layer to encourage the network to focus on those small semantically meaningful pathological regions that trigger relatively high responses by the network neurons. We also use a simple image patch extraction strategy to dynamically resample training examples with varying spacial and batch sizes. With limited GPU memory, this strategy reduces the imbalance of classes and forces the model to focus on regions around the interested pathology. It further improves segmentation accuracy and reduces the mis-classification of pathology. We evaluate our methods using the Myocardial pathology segmentation (MyoPS) combining the multi-sequence CMR dataset which involves three modalities. Extensive experiments demonstrate the effectiveness of the proposed model which outperforms the related baselines. The code is available at https://github.com/falconjhc/MFU-Net.

AB - Automatic segmentation of multi-sequence (multi-modal) cardiac MR (CMR) images plays a significant role in diagnosis and management for a variety of cardiac diseases. However, the performance of relevant algorithms is significantly affected by the proper fusion of the multi-modal information. Furthermore, particular diseases, such as myocardial infarction, display irregular shapes on images and occupy small regions at random locations. These facts make pathology segmentation of multi-modal CMR images a challenging task. In this paper, we present the Max-Fusion U-Net that achieves improved pathology segmentation performance given aligned multi-modal images of LGE, T2-weighted, and bSSFP modalities. Specifically, modality-specific features are extracted by dedicated encoders. Then they are fused with the pixel-wise maximum operator. Together with the corresponding encoding features, these representations are propagated to decoding layers with U-Net skip-connections. Furthermore, a spatial-attention module is applied in the last decoding layer to encourage the network to focus on those small semantically meaningful pathological regions that trigger relatively high responses by the network neurons. We also use a simple image patch extraction strategy to dynamically resample training examples with varying spacial and batch sizes. With limited GPU memory, this strategy reduces the imbalance of classes and forces the model to focus on regions around the interested pathology. It further improves segmentation accuracy and reduces the mis-classification of pathology. We evaluate our methods using the Myocardial pathology segmentation (MyoPS) combining the multi-sequence CMR dataset which involves three modalities. Extensive experiments demonstrate the effectiveness of the proposed model which outperforms the related baselines. The code is available at https://github.com/falconjhc/MFU-Net.

KW - Dynamic resample

KW - Max-fusion

KW - Multi-modal

KW - Pathology segmentation

UR - http://www.scopus.com/inward/record.url?scp=85098263944&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-65651-5_7

DO - 10.1007/978-3-030-65651-5_7

M3 - Conference Proceeding

AN - SCOPUS:85098263944

SN - 9783030656508

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 68

EP - 81

BT - Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings

A2 - Zhuang, Xiahai

A2 - Li, Lei

PB - Springer Science and Business Media Deutschland GmbH

T2 - 1st Myocardial Pathology Segmentation Combining Multi-Sequence CMR Challenge, MyoPS 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020

Y2 - 4 October 2020 through 4 October 2020

ER -

Jiang H, Wang C, Chartsias A, Tsaftaris SA. Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling. In Zhuang X, Li L, editors, Myocardial Pathology Segmentation Combining Multi-Sequence Cardiac Magnetic Resonance Images - First Challenge, MyoPS 2020, Held in Conjunction with MICCAI 2020, Proceedings. Springer Science and Business Media Deutschland GmbH. 2020. p. 68-81. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-65651-5_7

Max-Fusion U-Net for Multi-modal Pathology Segmentation with Attention and Dynamic Resampling

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Cite this