Skip to main navigation Skip to search Skip to main content

MSVM-UNet: Multi-Scale Spatial Attention Enhanced Vision Mamba U-Net for Agricultural Disease Segmentation

  • Lin Shi
  • , Xinyu Liu
  • , Li Zhao
  • , Haiyang Zhang*
  • , Zhanlin Ji*
  • *Corresponding author for this work
  • North China University of Science and Technology
  • Tsinghua National Laboratory for Information Science and Technology
  • Zhejiang Agriculture and Forestry University

Research output: Contribution to journalArticlepeer-review

Abstract

Agricultural diseased leaf image segmentation is a critical technology for precision agriculture and intelligent crop protection. To overcome the limitations of current segmentation methods-such as imprecise leaf edge extraction, difficulty in detecting small disease lesions, and insufficient robustness in complex backgrounds-this paper proposes an agricultural diseased leaf image segmentation method based on an enhanced visual state space model, named MSVM-UNet (Multi-Scale Spatial Attention Vision Mamba U-Net). This method employs an encoder-decoder framework and integrates improved Visual State Space (VSS) modules in both the encoder and decoder, enhancing long-range dependency modeling and local-global feature fusion. Simultaneously, a Multi-Scale Spatial Attention (MSSA) module is introduced in the skip connections to enhance cross-scale feature representation and capture fine boundary details of disease spots. To simulate real field imaging conditions, we perform random horizontal or vertical flips on the images and randomly adjust hue, saturation, and brightness before training. Experimental results demonstrate that, compared with mainstream methods, MSVM-UNet achieves significant performance improvement in agricultural diseased leaf segmentation tasks, reaching 80.44% mIoU and 92.56% Dice on the validation set, providing our solution for intelligent agricultural disease monitoring.

Original languageEnglish
JournalIEEE Signal Processing Letters
DOIs
Publication statusAccepted/In press - 2026

Keywords

  • Agricultural diseased leaf segmentation
  • Mamba U-Net
  • multi-scale attention
  • visual state-space model

Cite this