DSGA-Net: Deeply separable gated transformer and attention strategy for medical image segmentation network

Junding Sun; Jiuqiang Zhao; Xiaosheng Wu; Chaosheng Tang; Shuihua Wang; Yudong Zhang

doi:10.1016/j.jksuci.2023.04.006

DSGA-Net: Deeply separable gated transformer and attention strategy for medical image segmentation network

Junding Sun, Jiuqiang Zhao, Xiaosheng Wu, Chaosheng Tang, Shuihua Wang, Yudong Zhang^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

15 Citations (Scopus)

Abstract

To address the problems of under-segmentation and over-segmentation of small organs in medical image segmentation. We present a novel medical image segmentation network model with Depth Separable Gating Transformer and a Three-branch Attention module (DSGA-Net). Firstly, the model adds a Depth Separable Gated Visual Transformer (DSG-ViT) module into its Encoder to enhance (i) the contextual links among global, local, and channels and (ii) the sensitivity to location information. Secondly, a Mixed Three-branch Attention (MTA) module is proposed to increase the number of features in the up-sampling process. Meanwhile, the loss of feature information is reduced when restoring the feature image to the original image size. By validating Synapse, BraTs2020, and ACDC public datasets, the Dice Similarity Coefficient (DSC) of the results of DSGA-Net reached 81.24%,85.82%, and 91.34%, respectively. Moreover, the Hausdorff Score (HD) decreased to 20.91% and 5.27% on the Synapse and BraTs2020. There are 10.78% and 0.69% decreases compared to the Baseline TransUNet. The experimental results indicate that DSGA-Net achieves better segmentation than most advanced methods.

Original language	English
Article number	101553
Journal	Journal of King Saud University - Computer and Information Sciences
Volume	35
Issue number	5
DOIs	https://doi.org/10.1016/j.jksuci.2023.04.006
Publication status	Published - May 2023
Externally published	Yes

Keywords

Depth separable
Gated attention mechanism
Medical image segmentation
Transformer

Access to Document

10.1016/j.jksuci.2023.04.006

Cite this

@article{63b6a88c6ab141279c271587ad76b54a,

title = "DSGA-Net: Deeply separable gated transformer and attention strategy for medical image segmentation network",

abstract = "To address the problems of under-segmentation and over-segmentation of small organs in medical image segmentation. We present a novel medical image segmentation network model with Depth Separable Gating Transformer and a Three-branch Attention module (DSGA-Net). Firstly, the model adds a Depth Separable Gated Visual Transformer (DSG-ViT) module into its Encoder to enhance (i) the contextual links among global, local, and channels and (ii) the sensitivity to location information. Secondly, a Mixed Three-branch Attention (MTA) module is proposed to increase the number of features in the up-sampling process. Meanwhile, the loss of feature information is reduced when restoring the feature image to the original image size. By validating Synapse, BraTs2020, and ACDC public datasets, the Dice Similarity Coefficient (DSC) of the results of DSGA-Net reached 81.24%,85.82%, and 91.34%, respectively. Moreover, the Hausdorff Score (HD) decreased to 20.91% and 5.27% on the Synapse and BraTs2020. There are 10.78% and 0.69% decreases compared to the Baseline TransUNet. The experimental results indicate that DSGA-Net achieves better segmentation than most advanced methods.",

keywords = "Depth separable, Gated attention mechanism, Medical image segmentation, Transformer",

author = "Junding Sun and Jiuqiang Zhao and Xiaosheng Wu and Chaosheng Tang and Shuihua Wang and Yudong Zhang",

note = "Publisher Copyright: {\textcopyright} 2023 The Author(s)",

year = "2023",

month = may,

doi = "10.1016/j.jksuci.2023.04.006",

language = "English",

volume = "35",

journal = "Journal of King Saud University - Computer and Information Sciences",

issn = "1319-1578",

number = "5",

}

TY - JOUR

T1 - DSGA-Net

T2 - Deeply separable gated transformer and attention strategy for medical image segmentation network

AU - Sun, Junding

AU - Zhao, Jiuqiang

AU - Wu, Xiaosheng

AU - Tang, Chaosheng

AU - Wang, Shuihua

AU - Zhang, Yudong

PY - 2023/5

Y1 - 2023/5

N2 - To address the problems of under-segmentation and over-segmentation of small organs in medical image segmentation. We present a novel medical image segmentation network model with Depth Separable Gating Transformer and a Three-branch Attention module (DSGA-Net). Firstly, the model adds a Depth Separable Gated Visual Transformer (DSG-ViT) module into its Encoder to enhance (i) the contextual links among global, local, and channels and (ii) the sensitivity to location information. Secondly, a Mixed Three-branch Attention (MTA) module is proposed to increase the number of features in the up-sampling process. Meanwhile, the loss of feature information is reduced when restoring the feature image to the original image size. By validating Synapse, BraTs2020, and ACDC public datasets, the Dice Similarity Coefficient (DSC) of the results of DSGA-Net reached 81.24%,85.82%, and 91.34%, respectively. Moreover, the Hausdorff Score (HD) decreased to 20.91% and 5.27% on the Synapse and BraTs2020. There are 10.78% and 0.69% decreases compared to the Baseline TransUNet. The experimental results indicate that DSGA-Net achieves better segmentation than most advanced methods.

AB - To address the problems of under-segmentation and over-segmentation of small organs in medical image segmentation. We present a novel medical image segmentation network model with Depth Separable Gating Transformer and a Three-branch Attention module (DSGA-Net). Firstly, the model adds a Depth Separable Gated Visual Transformer (DSG-ViT) module into its Encoder to enhance (i) the contextual links among global, local, and channels and (ii) the sensitivity to location information. Secondly, a Mixed Three-branch Attention (MTA) module is proposed to increase the number of features in the up-sampling process. Meanwhile, the loss of feature information is reduced when restoring the feature image to the original image size. By validating Synapse, BraTs2020, and ACDC public datasets, the Dice Similarity Coefficient (DSC) of the results of DSGA-Net reached 81.24%,85.82%, and 91.34%, respectively. Moreover, the Hausdorff Score (HD) decreased to 20.91% and 5.27% on the Synapse and BraTs2020. There are 10.78% and 0.69% decreases compared to the Baseline TransUNet. The experimental results indicate that DSGA-Net achieves better segmentation than most advanced methods.

KW - Depth separable

KW - Gated attention mechanism

KW - Medical image segmentation

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85152746846&partnerID=8YFLogxK

U2 - 10.1016/j.jksuci.2023.04.006

DO - 10.1016/j.jksuci.2023.04.006

M3 - Article

AN - SCOPUS:85152746846

SN - 1319-1578

VL - 35

JO - Journal of King Saud University - Computer and Information Sciences

JF - Journal of King Saud University - Computer and Information Sciences

IS - 5

M1 - 101553

ER -

DSGA-Net: Deeply separable gated transformer and attention strategy for medical image segmentation network

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this