TY - GEN
T1 - A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management
AU - Zhang, Huanming
AU - Jiang, Zhengyong
AU - Su, Jionglong
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/3/5
Y1 - 2021/3/5
N2 - With the improvement of computer performance and the development of GPU-Accelerated technology, trading with machine learning algorithms has attracted the attention of many researchers and practitioners. In this research, we propose a novel portfolio management strategy based on the framework of Deep Deterministic Policy Gradient, a policy-based reinforcement learning framework, and compare its performance to that of other trading strategies. In our framework, two Long Short-Term Memory neural networks and two fully connected neural networks are constructed. We also investigate the performance of our strategy with and without transaction costs. Experimentally, we choose eight US stocks consisting of four low-volatility stocks and four high-volatility stocks. We compare the compound annual return rate of our strategy against seven other strategies, e.g., Uniform Buy and Hold, Exponential Gradient and Universal Portfolios. In our case, the compound annual return rate is 14.12%, outperforming all other strategies. Furthermore, in terms of Sharpe Ratio (0.5988), our strategy is nearly 33% higher than that of the second-best performing strategy.
AB - With the improvement of computer performance and the development of GPU-Accelerated technology, trading with machine learning algorithms has attracted the attention of many researchers and practitioners. In this research, we propose a novel portfolio management strategy based on the framework of Deep Deterministic Policy Gradient, a policy-based reinforcement learning framework, and compare its performance to that of other trading strategies. In our framework, two Long Short-Term Memory neural networks and two fully connected neural networks are constructed. We also investigate the performance of our strategy with and without transaction costs. Experimentally, we choose eight US stocks consisting of four low-volatility stocks and four high-volatility stocks. We compare the compound annual return rate of our strategy against seven other strategies, e.g., Uniform Buy and Hold, Exponential Gradient and Universal Portfolios. In our case, the compound annual return rate is 14.12%, outperforming all other strategies. Furthermore, in terms of Sharpe Ratio (0.5988), our strategy is nearly 33% higher than that of the second-best performing strategy.
KW - Deep Learning
KW - Portfolio Management
KW - Reinforcement Learning
UR - http://www.scopus.com/inward/record.url?scp=85105302545&partnerID=8YFLogxK
U2 - 10.1109/ICBDA51983.2021.9403049
DO - 10.1109/ICBDA51983.2021.9403049
M3 - Conference Proceeding
AN - SCOPUS:85105302545
T3 - 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021
SP - 230
EP - 238
BT - 2021 IEEE 6th International Conference on Big Data Analytics, ICBDA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th IEEE International Conference on Big Data Analytics, ICBDA 2021
Y2 - 5 March 2021 through 8 March 2021
ER -