Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy

Xiangyu Cui; Ruoyu Sun; Mian Zhou; Jionglong Su; Chengyu Wang; Zhengyong Jiang

doi:10.1145/3708360.3708384

Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy

Xiangyu Cui, Ruoyu Sun, Mian Zhou, Jionglong Su, Chengyu Wang, Zhengyong Jiang^*

^*Corresponding author for this work

Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Reinforcement learning gains increasing popularity in portfolio management. However, in a complex stock trading circumstance, agent-based algorithms often face challenges such as slow convergence rates and inadequate cooperation between agents. These lead to learning inefficiencies, increased risk, and higher transaction costs. Finally, the generalizability of the trading strategy is reduced. To address these, we propose a novel multi-agent algorithm called the Dual Core Portfolio Strategy which integrates both deterministic and stochastic policies to capitalize on their complementary strengths. In this strategy, the Deep Deterministic Policy Gradient agent is proficient in deterministic policy learning, while the Soft Actor-Critic agent enhances exploration and generalization through stochastic policy. Multiple agents collaborate by making decisions and interacting with the environment, sharing a centralized critic network and their interaction trajectories. This approach strengthens the robustness and adaptability of the portfolio strategy, improving its generalizability. Experiments demonstrate that the Dual Core Portfolio Strategy model consistently outperforms traditional deep reinforcement learning models. The effectiveness is evaluated using data from 2018 to 2020 and from 2020 to 2022 for all constituent stocks in the DJIA. The DC-PS model achieves state-of-the-art results, with a minimum increase of 15.7% (from 0.213 to 0.247) in accumulated returns in 2021 and 2023, underlining its generalizability in the out-of-sample environment.

Original language	English
Title of host publication	Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024
Publisher	Association for Computing Machinery, Inc
Pages	147-153
Number of pages	7
ISBN (Electronic)	9798400711657
DOIs	https://doi.org/10.1145/3708360.3708384
Publication status	Published - 13 Jan 2025
Event	2024 International Conference on Mathematics and Machine Learning, ICMML 2024 - Nanjing, China Duration: 8 Nov 2024 → 10 Nov 2024

Publication series

Name	Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024

Conference

Conference	2024 International Conference on Mathematics and Machine Learning, ICMML 2024
Country/Territory	China
City	Nanjing
Period	8/11/24 → 10/11/24

Keywords

Centralized Critic Network
Decision Support
Deep Reinforcement Learning
Economics
Multi-Agent Algorithm

Access to Document

10.1145/3708360.3708384

Cite this

Cui, X., Sun, R., Zhou, M., Su, J., Wang, C., & Jiang, Z. (2025). Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy. In Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024 (pp. 147-153). (Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024). Association for Computing Machinery, Inc. https://doi.org/10.1145/3708360.3708384

Cui, Xiangyu ; Sun, Ruoyu ; Zhou, Mian et al. / Dual Core Portfolio Strategy : A Deep RL & Multi-Agent Portfolio Strategy. Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024. Association for Computing Machinery, Inc, 2025. pp. 147-153 (Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024).

@inproceedings{65332ca9f9b34324a5e315bbe3e66e56,

title = "Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy",

abstract = "Reinforcement learning gains increasing popularity in portfolio management. However, in a complex stock trading circumstance, agent-based algorithms often face challenges such as slow convergence rates and inadequate cooperation between agents. These lead to learning inefficiencies, increased risk, and higher transaction costs. Finally, the generalizability of the trading strategy is reduced. To address these, we propose a novel multi-agent algorithm called the Dual Core Portfolio Strategy which integrates both deterministic and stochastic policies to capitalize on their complementary strengths. In this strategy, the Deep Deterministic Policy Gradient agent is proficient in deterministic policy learning, while the Soft Actor-Critic agent enhances exploration and generalization through stochastic policy. Multiple agents collaborate by making decisions and interacting with the environment, sharing a centralized critic network and their interaction trajectories. This approach strengthens the robustness and adaptability of the portfolio strategy, improving its generalizability. Experiments demonstrate that the Dual Core Portfolio Strategy model consistently outperforms traditional deep reinforcement learning models. The effectiveness is evaluated using data from 2018 to 2020 and from 2020 to 2022 for all constituent stocks in the DJIA. The DC-PS model achieves state-of-the-art results, with a minimum increase of 15.7% (from 0.213 to 0.247) in accumulated returns in 2021 and 2023, underlining its generalizability in the out-of-sample environment.",

keywords = "Centralized Critic Network, Decision Support, Deep Reinforcement Learning, Economics, Multi-Agent Algorithm",

author = "Xiangyu Cui and Ruoyu Sun and Mian Zhou and Jionglong Su and Chengyu Wang and Zhengyong Jiang",

note = "Publisher Copyright: {\textcopyright} 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.; 2024 International Conference on Mathematics and Machine Learning, ICMML 2024 ; Conference date: 08-11-2024 Through 10-11-2024",

year = "2025",

month = jan,

day = "13",

doi = "10.1145/3708360.3708384",

language = "English",

series = "Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024",

publisher = "Association for Computing Machinery, Inc",

pages = "147--153",

booktitle = "Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024",

}

Cui, X, Sun, R, Zhou, M , Su, J , Wang, C & Jiang, Z 2025, Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy. in Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024. Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024, Association for Computing Machinery, Inc, pp. 147-153, 2024 International Conference on Mathematics and Machine Learning, ICMML 2024, Nanjing, China, 8/11/24. https://doi.org/10.1145/3708360.3708384

Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy. / Cui, Xiangyu; Sun, Ruoyu; Zhou, Mian et al.
Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024. Association for Computing Machinery, Inc, 2025. p. 147-153 (Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Dual Core Portfolio Strategy

T2 - 2024 International Conference on Mathematics and Machine Learning, ICMML 2024

AU - Cui, Xiangyu

AU - Sun, Ruoyu

AU - Zhou, Mian

AU - Su, Jionglong

AU - Wang, Chengyu

AU - Jiang, Zhengyong

PY - 2025/1/13

Y1 - 2025/1/13

N2 - Reinforcement learning gains increasing popularity in portfolio management. However, in a complex stock trading circumstance, agent-based algorithms often face challenges such as slow convergence rates and inadequate cooperation between agents. These lead to learning inefficiencies, increased risk, and higher transaction costs. Finally, the generalizability of the trading strategy is reduced. To address these, we propose a novel multi-agent algorithm called the Dual Core Portfolio Strategy which integrates both deterministic and stochastic policies to capitalize on their complementary strengths. In this strategy, the Deep Deterministic Policy Gradient agent is proficient in deterministic policy learning, while the Soft Actor-Critic agent enhances exploration and generalization through stochastic policy. Multiple agents collaborate by making decisions and interacting with the environment, sharing a centralized critic network and their interaction trajectories. This approach strengthens the robustness and adaptability of the portfolio strategy, improving its generalizability. Experiments demonstrate that the Dual Core Portfolio Strategy model consistently outperforms traditional deep reinforcement learning models. The effectiveness is evaluated using data from 2018 to 2020 and from 2020 to 2022 for all constituent stocks in the DJIA. The DC-PS model achieves state-of-the-art results, with a minimum increase of 15.7% (from 0.213 to 0.247) in accumulated returns in 2021 and 2023, underlining its generalizability in the out-of-sample environment.

AB - Reinforcement learning gains increasing popularity in portfolio management. However, in a complex stock trading circumstance, agent-based algorithms often face challenges such as slow convergence rates and inadequate cooperation between agents. These lead to learning inefficiencies, increased risk, and higher transaction costs. Finally, the generalizability of the trading strategy is reduced. To address these, we propose a novel multi-agent algorithm called the Dual Core Portfolio Strategy which integrates both deterministic and stochastic policies to capitalize on their complementary strengths. In this strategy, the Deep Deterministic Policy Gradient agent is proficient in deterministic policy learning, while the Soft Actor-Critic agent enhances exploration and generalization through stochastic policy. Multiple agents collaborate by making decisions and interacting with the environment, sharing a centralized critic network and their interaction trajectories. This approach strengthens the robustness and adaptability of the portfolio strategy, improving its generalizability. Experiments demonstrate that the Dual Core Portfolio Strategy model consistently outperforms traditional deep reinforcement learning models. The effectiveness is evaluated using data from 2018 to 2020 and from 2020 to 2022 for all constituent stocks in the DJIA. The DC-PS model achieves state-of-the-art results, with a minimum increase of 15.7% (from 0.213 to 0.247) in accumulated returns in 2021 and 2023, underlining its generalizability in the out-of-sample environment.

KW - Centralized Critic Network

KW - Decision Support

KW - Deep Reinforcement Learning

KW - Economics

KW - Multi-Agent Algorithm

UR - http://www.scopus.com/inward/record.url?scp=105005725931&partnerID=8YFLogxK

U2 - 10.1145/3708360.3708384

DO - 10.1145/3708360.3708384

M3 - Conference Proceeding

AN - SCOPUS:105005725931

T3 - Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024

SP - 147

EP - 153

BT - Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024

PB - Association for Computing Machinery, Inc

Y2 - 8 November 2024 through 10 November 2024

ER -

Cui X, Sun R, Zhou M , Su J , Wang C , Jiang Z. Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy. In Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024. Association for Computing Machinery, Inc. 2025. p. 147-153. (Proceedings of 2024 International Conference on Mathematics and Machine Learning, ICMML 2024). doi: 10.1145/3708360.3708384

Dual Core Portfolio Strategy: A Deep RL & Multi-Agent Portfolio Strategy

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this