TY - JOUR
T1 - MSM-UNet
T2 - A medical image segmentation method based on wavelet transform and multi-scale Mamba-UNet
AU - Sun, Junding
AU - Chen, Kaixin
AU - Wu, Xiaosheng
AU - Xu, Zhaozhao
AU - Wang, Shuihua
AU - Zhang, Yudong
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/9/1
Y1 - 2025/9/1
N2 - In the field of medical image processing, combining global and local relationship modeling is an effective method for achieving precise image segmentation. Previous studies have demonstrated the remarkable performance of Convolutional Neural Networks (CNNs) in local relationship modeling, while Transformer can directly establish interactions between any two points in an image, thereby effectively capturing global contextual information. However, the application of Transformer to address the shortcomings of Convolutional Neural Networks (CNNs) in modeling global relationships is hindered by their substantial computational complexity and substantial memory demands, posing significant challenges in practice. To address this issue, this paper introduces the Mamba model, a State Space Model (SSM) that exhibits notable advantages in modeling long-range dependencies in sequential data. Inspired by the success of the Mamba model, a two-dimensional medical image segmentation model named MSM-UNet is designed. This model employs a Multi-Scale Mamba feature extraction block (MSMamba), a Wavelet Transform Feature Enhancement Attention Block (WTFEAB), a Feature Enhancement Merge Block (FEMB), and a Fusion Output Layer (FOL), aiming to accurately capture and integrate long-range and local dependencies among multi-scale features. Compared to Transformer-based methods, MSM-UNet exhibits superior performance in holistic feature modeling, significantly improving segmentation accuracy. Tests conducted on the Automatic Cardiac Diagnosis Challenge (ACDC) dataset, the Synapse multi-organ CT abdominal segmentation dataset, and the Colorectal Cancer-Clinic (CVC-ClinicDB) dataset demonstrated that the MSM-UNet achieved Dice coefficients of 92.02, 83.10, and 94.03, respectively. These results comprehensively validate the efficacy and practicality of the proposed MSM-UNet architecture.
AB - In the field of medical image processing, combining global and local relationship modeling is an effective method for achieving precise image segmentation. Previous studies have demonstrated the remarkable performance of Convolutional Neural Networks (CNNs) in local relationship modeling, while Transformer can directly establish interactions between any two points in an image, thereby effectively capturing global contextual information. However, the application of Transformer to address the shortcomings of Convolutional Neural Networks (CNNs) in modeling global relationships is hindered by their substantial computational complexity and substantial memory demands, posing significant challenges in practice. To address this issue, this paper introduces the Mamba model, a State Space Model (SSM) that exhibits notable advantages in modeling long-range dependencies in sequential data. Inspired by the success of the Mamba model, a two-dimensional medical image segmentation model named MSM-UNet is designed. This model employs a Multi-Scale Mamba feature extraction block (MSMamba), a Wavelet Transform Feature Enhancement Attention Block (WTFEAB), a Feature Enhancement Merge Block (FEMB), and a Fusion Output Layer (FOL), aiming to accurately capture and integrate long-range and local dependencies among multi-scale features. Compared to Transformer-based methods, MSM-UNet exhibits superior performance in holistic feature modeling, significantly improving segmentation accuracy. Tests conducted on the Automatic Cardiac Diagnosis Challenge (ACDC) dataset, the Synapse multi-organ CT abdominal segmentation dataset, and the Colorectal Cancer-Clinic (CVC-ClinicDB) dataset demonstrated that the MSM-UNet achieved Dice coefficients of 92.02, 83.10, and 94.03, respectively. These results comprehensively validate the efficacy and practicality of the proposed MSM-UNet architecture.
KW - Boundary enhancement
KW - CNN
KW - Mamba
KW - Medical image segmentation
KW - Multi-scale fusion
UR - http://www.scopus.com/inward/record.url?scp=105005941805&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2025.128241
DO - 10.1016/j.eswa.2025.128241
M3 - Article
AN - SCOPUS:105005941805
SN - 0957-4174
VL - 288
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 128241
ER -