Multi-level Adversarial Training for Stock Sentiment Prediction

Zimu Wang; Hong Seng Gan

doi:10.1109/CCAI57533.2023.10201295

Multi-level Adversarial Training for Stock Sentiment Prediction

Zimu Wang^*, Hong Seng Gan

^*Corresponding author for this work

School of AI and Advanced Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

4 Citations (Scopus)

Abstract

Stock sentiment prediction is a task to evaluate whether the investors are expecting or gaining a positive or negative return from a stock, which has a high correlation with investors' sentiments towards the business. However, as the nature of social media, the textual information posted by ordinary people is usually noisy, inconsistent, and even grammatically incorrect, leading the model to generate unsatisfied predictions. In this paper, we improve the performance of stock sentiment prediction by applying and comparing adversarial training at multiple levels, including character, word, and sentence levels, with the utilization of three novel adversarial attack models: DeepWordBug, BAE, and Generative Adversarial Network (GAN). We also propose an effective pre-processing technique and a novel adversarial examples incorporation method to improve the prediction results. To make an objective evaluation, we select three backbone models: Embedding Bag, BERT, and RoBERTa-Twitter, and validate the models before and after adversarial training on the TweetFinSent dataset. Experimental results demonstrate remarkable improvements in the models after adversarial training, and the RoBERTa-Twitter model with word-level adversarial training performs optimally among the experimented models. We conclude that sentence-level and word-level adversarial training are the most appropriate for deep learning and pre-trained language models, respectively, and we further conduct ablation studies to highlight the usefulness of our data pre-processing and adversarial examples incorporation approaches and a case study to display the adversarial examples generated by the proposed adversarial attack models.

Original language	English
Title of host publication	2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	127-134
Number of pages	8
ISBN (Electronic)	9798350335262
DOIs	https://doi.org/10.1109/CCAI57533.2023.10201295
Publication status	Published - 2023
Event	3rd IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2023 - Taiyuan, China Duration: 26 May 2023 → 28 May 2023

Publication series

Name	2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023

Conference

Conference	3rd IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2023
Country/Territory	China
City	Taiyuan
Period	26/05/23 → 28/05/23

Keywords

Adversarial Training
Natural Language Processing
Sentiment Analysis
Stock Sentiment
Textual Adversarial Attack

Access to Document

10.1109/CCAI57533.2023.10201295

Cite this

Wang, Z., & Gan, H. S. (2023). Multi-level Adversarial Training for Stock Sentiment Prediction. In 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023 (pp. 127-134). (2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CCAI57533.2023.10201295

Wang, Zimu ; Gan, Hong Seng. / Multi-level Adversarial Training for Stock Sentiment Prediction. 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 127-134 (2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023).

@inproceedings{6598c97e825f425598fb52c331ba1a59,

title = "Multi-level Adversarial Training for Stock Sentiment Prediction",

abstract = "Stock sentiment prediction is a task to evaluate whether the investors are expecting or gaining a positive or negative return from a stock, which has a high correlation with investors' sentiments towards the business. However, as the nature of social media, the textual information posted by ordinary people is usually noisy, inconsistent, and even grammatically incorrect, leading the model to generate unsatisfied predictions. In this paper, we improve the performance of stock sentiment prediction by applying and comparing adversarial training at multiple levels, including character, word, and sentence levels, with the utilization of three novel adversarial attack models: DeepWordBug, BAE, and Generative Adversarial Network (GAN). We also propose an effective pre-processing technique and a novel adversarial examples incorporation method to improve the prediction results. To make an objective evaluation, we select three backbone models: Embedding Bag, BERT, and RoBERTa-Twitter, and validate the models before and after adversarial training on the TweetFinSent dataset. Experimental results demonstrate remarkable improvements in the models after adversarial training, and the RoBERTa-Twitter model with word-level adversarial training performs optimally among the experimented models. We conclude that sentence-level and word-level adversarial training are the most appropriate for deep learning and pre-trained language models, respectively, and we further conduct ablation studies to highlight the usefulness of our data pre-processing and adversarial examples incorporation approaches and a case study to display the adversarial examples generated by the proposed adversarial attack models.",

keywords = "Adversarial Training, Natural Language Processing, Sentiment Analysis, Stock Sentiment, Textual Adversarial Attack",

author = "Zimu Wang and Gan, {Hong Seng}",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 3rd IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2023 ; Conference date: 26-05-2023 Through 28-05-2023",

year = "2023",

doi = "10.1109/CCAI57533.2023.10201295",

language = "English",

series = "2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "127--134",

booktitle = "2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023",

}

Wang, Z & Gan, HS 2023, Multi-level Adversarial Training for Stock Sentiment Prediction. in 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023. 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023, Institute of Electrical and Electronics Engineers Inc., pp. 127-134, 3rd IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2023, Taiyuan, China, 26/05/23. https://doi.org/10.1109/CCAI57533.2023.10201295

Multi-level Adversarial Training for Stock Sentiment Prediction. / Wang, Zimu ; Gan, Hong Seng.
2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 127-134 (2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Multi-level Adversarial Training for Stock Sentiment Prediction

AU - Wang, Zimu

AU - Gan, Hong Seng

PY - 2023

Y1 - 2023

N2 - Stock sentiment prediction is a task to evaluate whether the investors are expecting or gaining a positive or negative return from a stock, which has a high correlation with investors' sentiments towards the business. However, as the nature of social media, the textual information posted by ordinary people is usually noisy, inconsistent, and even grammatically incorrect, leading the model to generate unsatisfied predictions. In this paper, we improve the performance of stock sentiment prediction by applying and comparing adversarial training at multiple levels, including character, word, and sentence levels, with the utilization of three novel adversarial attack models: DeepWordBug, BAE, and Generative Adversarial Network (GAN). We also propose an effective pre-processing technique and a novel adversarial examples incorporation method to improve the prediction results. To make an objective evaluation, we select three backbone models: Embedding Bag, BERT, and RoBERTa-Twitter, and validate the models before and after adversarial training on the TweetFinSent dataset. Experimental results demonstrate remarkable improvements in the models after adversarial training, and the RoBERTa-Twitter model with word-level adversarial training performs optimally among the experimented models. We conclude that sentence-level and word-level adversarial training are the most appropriate for deep learning and pre-trained language models, respectively, and we further conduct ablation studies to highlight the usefulness of our data pre-processing and adversarial examples incorporation approaches and a case study to display the adversarial examples generated by the proposed adversarial attack models.

AB - Stock sentiment prediction is a task to evaluate whether the investors are expecting or gaining a positive or negative return from a stock, which has a high correlation with investors' sentiments towards the business. However, as the nature of social media, the textual information posted by ordinary people is usually noisy, inconsistent, and even grammatically incorrect, leading the model to generate unsatisfied predictions. In this paper, we improve the performance of stock sentiment prediction by applying and comparing adversarial training at multiple levels, including character, word, and sentence levels, with the utilization of three novel adversarial attack models: DeepWordBug, BAE, and Generative Adversarial Network (GAN). We also propose an effective pre-processing technique and a novel adversarial examples incorporation method to improve the prediction results. To make an objective evaluation, we select three backbone models: Embedding Bag, BERT, and RoBERTa-Twitter, and validate the models before and after adversarial training on the TweetFinSent dataset. Experimental results demonstrate remarkable improvements in the models after adversarial training, and the RoBERTa-Twitter model with word-level adversarial training performs optimally among the experimented models. We conclude that sentence-level and word-level adversarial training are the most appropriate for deep learning and pre-trained language models, respectively, and we further conduct ablation studies to highlight the usefulness of our data pre-processing and adversarial examples incorporation approaches and a case study to display the adversarial examples generated by the proposed adversarial attack models.

KW - Adversarial Training

KW - Natural Language Processing

KW - Sentiment Analysis

KW - Stock Sentiment

KW - Textual Adversarial Attack

UR - http://www.scopus.com/inward/record.url?scp=85169294596&partnerID=8YFLogxK

U2 - 10.1109/CCAI57533.2023.10201295

DO - 10.1109/CCAI57533.2023.10201295

M3 - Conference Proceeding

AN - SCOPUS:85169294596

T3 - 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023

SP - 127

EP - 134

BT - 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 3rd IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2023

Y2 - 26 May 2023 through 28 May 2023

ER -

Wang Z , Gan HS. Multi-level Adversarial Training for Stock Sentiment Prediction. In 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 127-134. (2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence, CCAI 2023). doi: 10.1109/CCAI57533.2023.10201295

Multi-level Adversarial Training for Stock Sentiment Prediction

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this