Review of deep reinforcement learning and discussions on the development of computer Go

Dong Bin Zhao*, Kun Shao, Yuan Heng Zhu, Dong Li, Ya Ran Chen, Hai Tao Wang, De Rong Liu, Tong Zhou, Cheng Hong Wang

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

112 Citations (Scopus)

Abstract

Deep reinforcement learning which incorporates both the advantages of the perception of deep learning and the decision making of reinforcement learning is able to output control signal directly based on input images. This mechanism makes the artificial intelligence much close to human thinking modes. Deep reinforcement learning has achieved remarkable success in terms of theory and application since it is proposed. 'Chuyihao-AlphaGo', a computer Go developed by Google DeepMind, based on deep reinforcement learning, beat the world's top Go player Lee Sedol 4:1 in March 2016. This becomes a new milestone in artificial intelligence history. This paper surveys the development course of deep reinforcement learning, reviews the history of computer Go concurrently, analyzes the algorithms features, and discusses the research directions and application areas, in order to provide a valuable reference to the development of control theory and applications in a new direction.

Original languageEnglish
Pages (from-to)701-717
Number of pages17
JournalKongzhi Lilun Yu Yingyong/Control Theory and Applications
Volume33
Issue number6
DOIs
Publication statusPublished - 1 Jun 2016
Externally publishedYes

Keywords

  • AlphaGo
  • Artificial intelligence
  • Deep learning
  • Deep reinforcement learning
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Review of deep reinforcement learning and discussions on the development of computer Go'. Together they form a unique fingerprint.

Cite this