Innovative application of large language model in the prediction of potential depression in the elderly: based on CHARLS dataset

Research output: Contribution to journalArticlepeer-review

Abstract

Screening for late-life depression remains difficult in rapidly ageing societies due to heterogeneous symptoms, uneven resources across regions, and the lack of simple tools that work at scale and over time. We address this by rendering each respondent’s CHARLS record into a concise, templated natural-language summary that a large language model can comprehend, unifying structured health, functional, socioeconomic, and social-support fields (and, where available, interview text) in a single representation. We then fine-tune instruction models (DeepSeek and Qwen) with parameter-efficient adapters (LoRA) on attention projections, alongside explicit handling of missingness and class imbalance, and with de-identification for governance. In our setup, LoRA drastically reduces the number of trainable parameters, helping to curb compute and overfitting while keeping accuracy. On a held-out split of CHARLS, DeepSeek variants achieve macro-F1 around 80% (weighted-F1 83%) and outperform Qwen variants of comparable size, with little additional gain from scaling 1.5B to 7B. Class-wise results show the depression-positive class remains harder than the non-depression class, consistent with residual imbalance and symptom heterogeneity. To our knowledge, this is the first study to treat complete survey records as text for geriatric depression screening on CHARLS and to demonstrate that compact LLMs adapted with LoRA can achieve strong performance without requiring table-specific architectures, within a transparent, reproducible, and structured-to-text pipeline. The approach is readily applicable to other survey-based screening tasks. We also state the current limitations and outline how these will be addressed in future validation and deployment.

Original languageEnglish
Article number42
JournalMultimedia Systems
Volume32
Issue number1
DOIs
Publication statusPublished - Feb 2026

Keywords

  • CHARLS
  • DeepSeek
  • Depression
  • Large language model
  • Mental health
  • Public health
  • Qwen

Cite this