Skin Disease Classification with LVLMs: An Empirical Study

Xinyi Zeng, Zimu Wang, Haiyang Zhang*, Yiming Luo, Wei Wang

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Skin diseases pose significant challenges to accurate and efficient diagnosis, often due to their diverse and complex representations. This study investigates the capabilities and limitations of Large Vision-Language Models (LVLMs) in addressing these challenges through skin disease classification tasks. We evaluated LVLMs in zero-shot, few-shot, and finetuning scenarios, exploring their performance, bias, and potential for improvement. Results show that LVLMs lack perceptual granularity in skin disease, though positive signals are also observed. Our findings underscore the necessity for domain- specific optimisation and highlight opportunities for advancing LVLMs in medical diagnostics through innovative strategies and collaborative efforts.

Original languageEnglish
Title of host publicationProceedings of the 2025 28th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2025
EditorsWeiming Shen, Weiming Shen, Marie-Helene Abel, Nada Matta, Jean-Paul Barthes, Junzhou Luo, Jinghui Zhang, Haibin Zhu, Kunkun Peng
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2304-2309
Number of pages6
Edition2025
ISBN (Electronic)9798331513054
DOIs
Publication statusPublished - 2025
Event28th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2025 - Compiegne, France
Duration: 5 May 20257 May 2025

Conference

Conference28th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2025
Country/TerritoryFrance
CityCompiegne
Period5/05/257/05/25

Keywords

  • large vision-language models
  • medical intelligent diagnostics
  • skin disease classification

Fingerprint

Dive into the research topics of 'Skin Disease Classification with LVLMs: An Empirical Study'. Together they form a unique fingerprint.

Cite this