A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management

Huanming Zhang; Zhengyong Jiang; Jionglong Su

doi:10.1109/ICBDA51983.2021.9403049

A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management

Huanming Zhang, Zhengyong Jiang, Jionglong Su

School of AI and Advanced Computing

University of Liverpool

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

11 Citations (Scopus)

Abstract

With the improvement of computer performance and the development of GPU-Accelerated technology, trading with machine learning algorithms has attracted the attention of many researchers and practitioners. In this research, we propose a novel portfolio management strategy based on the framework of Deep Deterministic Policy Gradient, a policy-based reinforcement learning framework, and compare its performance to that of other trading strategies. In our framework, two Long Short-Term Memory neural networks and two fully connected neural networks are constructed. We also investigate the performance of our strategy with and without transaction costs. Experimentally, we choose eight US stocks consisting of four low-volatility stocks and four high-volatility stocks. We compare the compound annual return rate of our strategy against seven other strategies, e.g., Uniform Buy and Hold, Exponential Gradient and Universal Portfolios. In our case, the compound annual return rate is 14.12%, outperforming all other strategies. Furthermore, in terms of Sharpe Ratio (0.5988), our strategy is nearly 33% higher than that of the second-best performing strategy.

Original language	English
Title of host publication	2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	230-238
Number of pages	9
ISBN (Electronic)	9780738131672
DOIs	https://doi.org/10.1109/ICBDA51983.2021.9403049
Publication status	Published - 5 Mar 2021
Event	6th IEEE International Conference on Big Data Analytics, ICBDA 2021 - Xiamen, China Duration: 5 Mar 2021 → 8 Mar 2021

Publication series

Name	2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021

Conference

Conference	6th IEEE International Conference on Big Data Analytics, ICBDA 2021
Country/Territory	China
City	Xiamen
Period	5/03/21 → 8/03/21

Keywords

Deep Learning
Portfolio Management
Reinforcement Learning

Access to Document

10.1109/ICBDA51983.2021.9403049

Cite this

Zhang, H., Jiang, Z., & Su, J. (2021). A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management. In 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021 (pp. 230-238). Article 9403049 (2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICBDA51983.2021.9403049

@inproceedings{0acea9c68b984fcab280d01465520c61,

title = "A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management",

abstract = "With the improvement of computer performance and the development of GPU-Accelerated technology, trading with machine learning algorithms has attracted the attention of many researchers and practitioners. In this research, we propose a novel portfolio management strategy based on the framework of Deep Deterministic Policy Gradient, a policy-based reinforcement learning framework, and compare its performance to that of other trading strategies. In our framework, two Long Short-Term Memory neural networks and two fully connected neural networks are constructed. We also investigate the performance of our strategy with and without transaction costs. Experimentally, we choose eight US stocks consisting of four low-volatility stocks and four high-volatility stocks. We compare the compound annual return rate of our strategy against seven other strategies, e.g., Uniform Buy and Hold, Exponential Gradient and Universal Portfolios. In our case, the compound annual return rate is 14.12%, outperforming all other strategies. Furthermore, in terms of Sharpe Ratio (0.5988), our strategy is nearly 33% higher than that of the second-best performing strategy.",

keywords = "Deep Learning, Portfolio Management, Reinforcement Learning",

author = "Huanming Zhang and Zhengyong Jiang and Jionglong Su",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 6th IEEE International Conference on Big Data Analytics, ICBDA 2021 ; Conference date: 05-03-2021 Through 08-03-2021",

year = "2021",

month = mar,

day = "5",

doi = "10.1109/ICBDA51983.2021.9403049",

language = "English",

series = "2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "230--238",

booktitle = "2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021",

}

Zhang, H, Jiang, Z & Su, J 2021, A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management. in 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021., 9403049, 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021, Institute of Electrical and Electronics Engineers Inc., pp. 230-238, 6th IEEE International Conference on Big Data Analytics, ICBDA 2021, Xiamen, China, 5/03/21. https://doi.org/10.1109/ICBDA51983.2021.9403049

A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management. / Zhang, Huanming; Jiang, Zhengyong ; Su, Jionglong.
2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 230-238 9403049 (2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management

AU - Zhang, Huanming

AU - Jiang, Zhengyong

AU - Su, Jionglong

PY - 2021/3/5

Y1 - 2021/3/5

N2 - With the improvement of computer performance and the development of GPU-Accelerated technology, trading with machine learning algorithms has attracted the attention of many researchers and practitioners. In this research, we propose a novel portfolio management strategy based on the framework of Deep Deterministic Policy Gradient, a policy-based reinforcement learning framework, and compare its performance to that of other trading strategies. In our framework, two Long Short-Term Memory neural networks and two fully connected neural networks are constructed. We also investigate the performance of our strategy with and without transaction costs. Experimentally, we choose eight US stocks consisting of four low-volatility stocks and four high-volatility stocks. We compare the compound annual return rate of our strategy against seven other strategies, e.g., Uniform Buy and Hold, Exponential Gradient and Universal Portfolios. In our case, the compound annual return rate is 14.12%, outperforming all other strategies. Furthermore, in terms of Sharpe Ratio (0.5988), our strategy is nearly 33% higher than that of the second-best performing strategy.

AB - With the improvement of computer performance and the development of GPU-Accelerated technology, trading with machine learning algorithms has attracted the attention of many researchers and practitioners. In this research, we propose a novel portfolio management strategy based on the framework of Deep Deterministic Policy Gradient, a policy-based reinforcement learning framework, and compare its performance to that of other trading strategies. In our framework, two Long Short-Term Memory neural networks and two fully connected neural networks are constructed. We also investigate the performance of our strategy with and without transaction costs. Experimentally, we choose eight US stocks consisting of four low-volatility stocks and four high-volatility stocks. We compare the compound annual return rate of our strategy against seven other strategies, e.g., Uniform Buy and Hold, Exponential Gradient and Universal Portfolios. In our case, the compound annual return rate is 14.12%, outperforming all other strategies. Furthermore, in terms of Sharpe Ratio (0.5988), our strategy is nearly 33% higher than that of the second-best performing strategy.

KW - Deep Learning

KW - Portfolio Management

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85105302545&partnerID=8YFLogxK

U2 - 10.1109/ICBDA51983.2021.9403049

DO - 10.1109/ICBDA51983.2021.9403049

M3 - Conference Proceeding

AN - SCOPUS:85105302545

T3 - 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021

SP - 230

EP - 238

BT - 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 6th IEEE International Conference on Big Data Analytics, ICBDA 2021

Y2 - 5 March 2021 through 8 March 2021

ER -

Zhang H, Jiang Z , Su J. A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management. In 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 230-238. 9403049. (2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021). doi: 10.1109/ICBDA51983.2021.9403049

A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this