Enhancing stock timing predictions based on multimodal architecture: Leveraging large language models (LLMs) for text quality improvement

Mingming Chen; Yifan Tang; Qi Qi; Hongyi Dai; Yi Lin; Chengxiu Ling; Tenglong Li

doi:10.1371/journal.pone.0326034

Enhancing stock timing predictions based on multimodal architecture: Leveraging large language models (LLMs) for text quality improvement

Mingming Chen, Yifan Tang, Qi Qi, Hongyi Dai, Yi Lin, Chengxiu Ling, Tenglong Li^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

This study aims to enhance stock timing predictions by leveraging large language models (LLMs), specifically GPT-4, to filter and analyze online investor comment data. Recognizing challenges such as variable comment quality, redundancy, and authenticity issues, we propose a multimodal architecture that integrates filtered comment data with stock price dynamics and technical indicators. Using data from nine Chinese banks, we compare four filtering models and demonstrate that employing GPT-4 significantly improves financial metrics like profit-loss ratio, win rate, and excess return rate. The multimodal architecture outperforms baseline models by effectively preprocessing comment data and combining it with quantitative financial data. While focused on Chinese banks, the approach can be adapted to broader markets by modifying the prompts of large language models. Our findings highlight the potential of LLMs in financial forecasting and provide more reliable decision support for investors.

Original language	English
Article number	e0326034
Journal	PLoS ONE
Volume	20
Issue number	6 June
DOIs	https://doi.org/10.1371/journal.pone.0326034
Publication status	Published - Jun 2025

Access to Document

10.1371/journal.pone.0326034

Cite this

@article{166be1b517ac41f0913bded616d2612b,

title = "Enhancing stock timing predictions based on multimodal architecture: Leveraging large language models (LLMs) for text quality improvement",

abstract = "This study aims to enhance stock timing predictions by leveraging large language models (LLMs), specifically GPT-4, to filter and analyze online investor comment data. Recognizing challenges such as variable comment quality, redundancy, and authenticity issues, we propose a multimodal architecture that integrates filtered comment data with stock price dynamics and technical indicators. Using data from nine Chinese banks, we compare four filtering models and demonstrate that employing GPT-4 significantly improves financial metrics like profit-loss ratio, win rate, and excess return rate. The multimodal architecture outperforms baseline models by effectively preprocessing comment data and combining it with quantitative financial data. While focused on Chinese banks, the approach can be adapted to broader markets by modifying the prompts of large language models. Our findings highlight the potential of LLMs in financial forecasting and provide more reliable decision support for investors.",

author = "Mingming Chen and Yifan Tang and Qi Qi and Hongyi Dai and Yi Lin and Chengxiu Ling and Tenglong Li",

note = "Publisher Copyright: {\textcopyright} 2025 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.",

year = "2025",

month = jun,

doi = "10.1371/journal.pone.0326034",

language = "English",

volume = "20",

journal = "PLoS ONE",

issn = "1932-6203",

number = "6 June",

}

TY - JOUR

T1 - Enhancing stock timing predictions based on multimodal architecture

T2 - Leveraging large language models (LLMs) for text quality improvement

AU - Chen, Mingming

AU - Tang, Yifan

AU - Qi, Qi

AU - Dai, Hongyi

AU - Lin, Yi

AU - Ling, Chengxiu

AU - Li, Tenglong

N1 - Publisher Copyright: © 2025 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PY - 2025/6

Y1 - 2025/6

N2 - This study aims to enhance stock timing predictions by leveraging large language models (LLMs), specifically GPT-4, to filter and analyze online investor comment data. Recognizing challenges such as variable comment quality, redundancy, and authenticity issues, we propose a multimodal architecture that integrates filtered comment data with stock price dynamics and technical indicators. Using data from nine Chinese banks, we compare four filtering models and demonstrate that employing GPT-4 significantly improves financial metrics like profit-loss ratio, win rate, and excess return rate. The multimodal architecture outperforms baseline models by effectively preprocessing comment data and combining it with quantitative financial data. While focused on Chinese banks, the approach can be adapted to broader markets by modifying the prompts of large language models. Our findings highlight the potential of LLMs in financial forecasting and provide more reliable decision support for investors.

AB - This study aims to enhance stock timing predictions by leveraging large language models (LLMs), specifically GPT-4, to filter and analyze online investor comment data. Recognizing challenges such as variable comment quality, redundancy, and authenticity issues, we propose a multimodal architecture that integrates filtered comment data with stock price dynamics and technical indicators. Using data from nine Chinese banks, we compare four filtering models and demonstrate that employing GPT-4 significantly improves financial metrics like profit-loss ratio, win rate, and excess return rate. The multimodal architecture outperforms baseline models by effectively preprocessing comment data and combining it with quantitative financial data. While focused on Chinese banks, the approach can be adapted to broader markets by modifying the prompts of large language models. Our findings highlight the potential of LLMs in financial forecasting and provide more reliable decision support for investors.

UR - http://www.scopus.com/inward/record.url?scp=105008471640&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0326034

DO - 10.1371/journal.pone.0326034

M3 - Article

C2 - 40531828

AN - SCOPUS:105008471640

SN - 1932-6203

VL - 20

JO - PLoS ONE

JF - PLoS ONE

IS - 6 June

M1 - e0326034

ER -

Enhancing stock timing predictions based on multimodal architecture: Leveraging large language models (LLMs) for text quality improvement

Abstract

Access to Document

Other files and links

Cite this