TY - JOUR
T1 - Enhancing stock timing predictions based on multimodal architecture
T2 - Leveraging large language models (LLMs) for text quality improvement
AU - Chen, Mingming
AU - Tang, Yifan
AU - Qi, Qi
AU - Dai, Hongyi
AU - Lin, Yi
AU - Ling, Chengxiu
AU - Li, Tenglong
N1 - Publisher Copyright:
© 2025 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2025/6
Y1 - 2025/6
N2 - This study aims to enhance stock timing predictions by leveraging large language models (LLMs), specifically GPT-4, to filter and analyze online investor comment data. Recognizing challenges such as variable comment quality, redundancy, and authenticity issues, we propose a multimodal architecture that integrates filtered comment data with stock price dynamics and technical indicators. Using data from nine Chinese banks, we compare four filtering models and demonstrate that employing GPT-4 significantly improves financial metrics like profit-loss ratio, win rate, and excess return rate. The multimodal architecture outperforms baseline models by effectively preprocessing comment data and combining it with quantitative financial data. While focused on Chinese banks, the approach can be adapted to broader markets by modifying the prompts of large language models. Our findings highlight the potential of LLMs in financial forecasting and provide more reliable decision support for investors.
AB - This study aims to enhance stock timing predictions by leveraging large language models (LLMs), specifically GPT-4, to filter and analyze online investor comment data. Recognizing challenges such as variable comment quality, redundancy, and authenticity issues, we propose a multimodal architecture that integrates filtered comment data with stock price dynamics and technical indicators. Using data from nine Chinese banks, we compare four filtering models and demonstrate that employing GPT-4 significantly improves financial metrics like profit-loss ratio, win rate, and excess return rate. The multimodal architecture outperforms baseline models by effectively preprocessing comment data and combining it with quantitative financial data. While focused on Chinese banks, the approach can be adapted to broader markets by modifying the prompts of large language models. Our findings highlight the potential of LLMs in financial forecasting and provide more reliable decision support for investors.
UR - http://www.scopus.com/inward/record.url?scp=105008471640&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0326034
DO - 10.1371/journal.pone.0326034
M3 - Article
C2 - 40531828
AN - SCOPUS:105008471640
SN - 1932-6203
VL - 20
JO - PLoS ONE
JF - PLoS ONE
IS - 6 June
M1 - e0326034
ER -