DouRN: Improving DouZero by Residual Neural Networks

Yiquan Chen; Yingchao Lyu; Di Zhang

doi:10.1109/CyberC58899.2023.00026

DouRN: Improving DouZero by Residual Neural Networks

Yiquan Chen, Yingchao Lyu, Di Zhang^*

^*Corresponding author for this work

School of AI and Advanced Computing

Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Deep reinforcement learning has made significant progress in games with imperfect information, but its performance in the card game Doudizhu (Chinese Poker/Fight the Landlord) remains unsatisfactory. Doudizhu is different from conventional games as it involves three players and combines elements of cooperation and confrontation, resulting in a large state and action space. In 2021, a Doudizhu program called DouZero [8] surpassed previous models without prior knowledge by utilizing traditional Monte Carlo methods and multilayer perceptrons. Building on this work, our study incorporates residual networks into the model, explores different architectural designs, and conducts multi-role testing. Our findings demonstrate that this model significantly improves the winning rate within the same training time. Additionally, we introduce a call scoring system to assist the agent in deciding whether to become a landlord. With these enhancements, our model consistently outperforms the existing version of DouZero and even experienced human players.¹¹The source code is available at https://github.com/Yingchaol/Douzero_Resnet.git.

Original language	English
Title of host publication	Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	96-99
Number of pages	4
ISBN (Electronic)	9798350308693
DOIs	https://doi.org/10.1109/CyberC58899.2023.00026
Publication status	Published - 2023
Event	15th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023 - Jiangsu, China Duration: 2 Nov 2023 → 4 Nov 2023

Publication series

Name	Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023

Conference

Conference	15th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023
Country/Territory	China
City	Jiangsu
Period	2/11/23 → 4/11/23

Keywords

DouDizhu
Monte Carlo Methods
Reinforcement Learning
Residual Neural Networks

Access to Document

10.1109/CyberC58899.2023.00026

Cite this

Chen, Y., Lyu, Y., & Zhang, D. (2023). DouRN: Improving DouZero by Residual Neural Networks. In Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023 (pp. 96-99). (Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CyberC58899.2023.00026

Chen, Yiquan ; Lyu, Yingchao ; Zhang, Di. / DouRN : Improving DouZero by Residual Neural Networks. Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 96-99 (Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023).

@inproceedings{2cba529bfb394e8d9ae799a9d8166457,

title = "DouRN: Improving DouZero by Residual Neural Networks",

abstract = "Deep reinforcement learning has made significant progress in games with imperfect information, but its performance in the card game Doudizhu (Chinese Poker/Fight the Landlord) remains unsatisfactory. Doudizhu is different from conventional games as it involves three players and combines elements of cooperation and confrontation, resulting in a large state and action space. In 2021, a Doudizhu program called DouZero [8] surpassed previous models without prior knowledge by utilizing traditional Monte Carlo methods and multilayer perceptrons. Building on this work, our study incorporates residual networks into the model, explores different architectural designs, and conducts multi-role testing. Our findings demonstrate that this model significantly improves the winning rate within the same training time. Additionally, we introduce a call scoring system to assist the agent in deciding whether to become a landlord. With these enhancements, our model consistently outperforms the existing version of DouZero and even experienced human players.11The source code is available at https://github.com/Yingchaol/Douzero_Resnet.git.",

keywords = "DouDizhu, Monte Carlo Methods, Reinforcement Learning, Residual Neural Networks",

author = "Yiquan Chen and Yingchao Lyu and Di Zhang",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 15th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023 ; Conference date: 02-11-2023 Through 04-11-2023",

year = "2023",

doi = "10.1109/CyberC58899.2023.00026",

language = "English",

series = "Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "96--99",

booktitle = "Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023",

}

Chen, Y, Lyu, Y & Zhang, D 2023, DouRN: Improving DouZero by Residual Neural Networks. in Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023. Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023, Institute of Electrical and Electronics Engineers Inc., pp. 96-99, 15th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023, Jiangsu, China, 2/11/23. https://doi.org/10.1109/CyberC58899.2023.00026

DouRN: Improving DouZero by Residual Neural Networks. / Chen, Yiquan; Lyu, Yingchao; Zhang, Di.
Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 96-99 (Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - DouRN

T2 - 15th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023

AU - Chen, Yiquan

AU - Lyu, Yingchao

AU - Zhang, Di

PY - 2023

Y1 - 2023

N2 - Deep reinforcement learning has made significant progress in games with imperfect information, but its performance in the card game Doudizhu (Chinese Poker/Fight the Landlord) remains unsatisfactory. Doudizhu is different from conventional games as it involves three players and combines elements of cooperation and confrontation, resulting in a large state and action space. In 2021, a Doudizhu program called DouZero [8] surpassed previous models without prior knowledge by utilizing traditional Monte Carlo methods and multilayer perceptrons. Building on this work, our study incorporates residual networks into the model, explores different architectural designs, and conducts multi-role testing. Our findings demonstrate that this model significantly improves the winning rate within the same training time. Additionally, we introduce a call scoring system to assist the agent in deciding whether to become a landlord. With these enhancements, our model consistently outperforms the existing version of DouZero and even experienced human players.11The source code is available at https://github.com/Yingchaol/Douzero_Resnet.git.

AB - Deep reinforcement learning has made significant progress in games with imperfect information, but its performance in the card game Doudizhu (Chinese Poker/Fight the Landlord) remains unsatisfactory. Doudizhu is different from conventional games as it involves three players and combines elements of cooperation and confrontation, resulting in a large state and action space. In 2021, a Doudizhu program called DouZero [8] surpassed previous models without prior knowledge by utilizing traditional Monte Carlo methods and multilayer perceptrons. Building on this work, our study incorporates residual networks into the model, explores different architectural designs, and conducts multi-role testing. Our findings demonstrate that this model significantly improves the winning rate within the same training time. Additionally, we introduce a call scoring system to assist the agent in deciding whether to become a landlord. With these enhancements, our model consistently outperforms the existing version of DouZero and even experienced human players.11The source code is available at https://github.com/Yingchaol/Douzero_Resnet.git.

KW - DouDizhu

KW - Monte Carlo Methods

KW - Reinforcement Learning

KW - Residual Neural Networks

UR - http://www.scopus.com/inward/record.url?scp=85186768696&partnerID=8YFLogxK

U2 - 10.1109/CyberC58899.2023.00026

DO - 10.1109/CyberC58899.2023.00026

M3 - Conference Proceeding

AN - SCOPUS:85186768696

T3 - Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023

SP - 96

EP - 99

BT - Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 2 November 2023 through 4 November 2023

ER -

Chen Y, Lyu Y, Zhang D. DouRN: Improving DouZero by Residual Neural Networks. In Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 96-99. (Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023). doi: 10.1109/CyberC58899.2023.00026

DouRN: Improving DouZero by Residual Neural Networks

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this