Abstract
Plant diseases pose critical threats to global agricultural productivity and food security, incurring substantial economic losses annually. While traditional detection methods depend on labor-intensive manual observation, current deep learning approaches remain limited by reliance on unimodal visual data and struggle to provide interpret or actionable agricultural insights. To bridge this gap, we present AgriGPT, a comprehensive Visual-Language framework that synergistically integrates image analysis with textual context to enable precise and explainable detection of plant diseases. AgriGPT leverages ImageBind-Huge for robust cross-modal encoding and Vicuna-7B for generating human-readable diagnostic reports with domain-specific knowledge. A key innovation is our Learnable Clustering Module (LCM), which dynamically models normal plant feature distributions using Adaptive and Progressive Updating Mechanisms, enabling precise anomaly detection under data scarcity or class imbalance. By integrating an Encoding-Decoding Module, an LCM, and a Language Module, AgriGPT achieves state-of-the-art performance in localized anomaly localization, disease classification, and actionable treatment recommendations. Extensive experiments across multiple plant categories demonstrate improvements in Image-AUC ranging from 2.45% to 17.81% and in Pixel-AUC from 0.17% to 14.67% compared to existing models. To accelerate research in agricultural AI, we open source our code and a curated multifaceted dataset (covering nine plant and thirteen disease categories) at https://github.com/zzz123nnn/AgriGPT.
| Original language | English |
|---|---|
| Title of host publication | Advanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings |
| Editors | De-Shuang Huang, Wei Chen, Yijie Pan, Haiming Chen |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 239-250 |
| Number of pages | 12 |
| ISBN (Print) | 9789819699209 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 21st International Conference on Intelligent Computing, ICIC 2025 - Ningbo, China Duration: 26 Jul 2025 → 29 Jul 2025 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 15857 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 21st International Conference on Intelligent Computing, ICIC 2025 |
|---|---|
| Country/Territory | China |
| City | Ningbo |
| Period | 26/07/25 → 29/07/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Agricultural Large Model
- Feature Cluster
- Multimodal Learning
- Plant Disease Detection
- Smart Agriculture
- Visual-Language Model
Fingerprint
Dive into the research topics of 'AgriGPT: A Strong Plant Disease Detection Model via Visual-Language Model'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver