AgriGPT: A Strong Plant Disease Detection Model via Visual-Language Model

Jing Zheng, Jia Wang*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Plant diseases pose critical threats to global agricultural productivity and food security, incurring substantial economic losses annually. While traditional detection methods depend on labor-intensive manual observation, current deep learning approaches remain limited by reliance on unimodal visual data and struggle to provide interpret or actionable agricultural insights. To bridge this gap, we present AgriGPT, a comprehensive Visual-Language framework that synergistically integrates image analysis with textual context to enable precise and explainable detection of plant diseases. AgriGPT leverages ImageBind-Huge for robust cross-modal encoding and Vicuna-7B for generating human-readable diagnostic reports with domain-specific knowledge. A key innovation is our Learnable Clustering Module (LCM), which dynamically models normal plant feature distributions using Adaptive and Progressive Updating Mechanisms, enabling precise anomaly detection under data scarcity or class imbalance. By integrating an Encoding-Decoding Module, an LCM, and a Language Module, AgriGPT achieves state-of-the-art performance in localized anomaly localization, disease classification, and actionable treatment recommendations. Extensive experiments across multiple plant categories demonstrate improvements in Image-AUC ranging from 2.45% to 17.81% and in Pixel-AUC from 0.17% to 14.67% compared to existing models. To accelerate research in agricultural AI, we open source our code and a curated multifaceted dataset (covering nine plant and thirteen disease categories) at https://github.com/zzz123nnn/AgriGPT.

Original languageEnglish
Title of host publicationAdvanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings
EditorsDe-Shuang Huang, Wei Chen, Yijie Pan, Haiming Chen
PublisherSpringer Science and Business Media Deutschland GmbH
Pages239-250
Number of pages12
ISBN (Print)9789819699209
DOIs
Publication statusPublished - 2025
Event21st International Conference on Intelligent Computing, ICIC 2025 - Ningbo, China
Duration: 26 Jul 202529 Jul 2025

Publication series

NameLecture Notes in Computer Science
Volume15857 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Intelligent Computing, ICIC 2025
Country/TerritoryChina
CityNingbo
Period26/07/2529/07/25

Keywords

  • Agricultural Large Model
  • Feature Cluster
  • Multimodal Learning
  • Plant Disease Detection
  • Smart Agriculture
  • Visual-Language Model

Fingerprint

Dive into the research topics of 'AgriGPT: A Strong Plant Disease Detection Model via Visual-Language Model'. Together they form a unique fingerprint.

Cite this