Sampled-data control through model-free reinforcement learning with effective experience replay

Bo Xiao; H. K. Lam; Xiaojie Su; Ziwei Wang; Frank P.W. Lo; Shihong Chen; Eric Yeatman

doi:10.1016/j.jai.2023.100018

Sampled-data control through model-free reinforcement learning with effective experience replay

Bo Xiao^*, H. K. Lam, Xiaojie Su, Ziwei Wang, Frank P.W. Lo, Shihong Chen, Eric Yeatman

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

20 Citations (Scopus)

Abstract

Reinforcement Learning (RL) based control algorithms can learn the control strategies for nonlinear and uncertain environment during interacting with it. Guided by the rewards generated by environment, a RL agent can learn the control strategy directly in a model-free way instead of investigating the dynamic model of the environment. In the paper, we propose the sampled-data RL control strategy to reduce the computational demand. In the sampled-data control strategy, the whole control system is of a hybrid structure, in which the plant is of continuous structure while the controller (RL agent) adopts a discrete structure. Given that the continuous states of the plant will be the input of the agent, the state–action value function is approximated by the fully connected feed-forward neural networks (FCFFNN). Instead of learning the controller at every step during the interaction with the environment, the learning and acting stages are decoupled to learn the control strategy more effectively through experience replay. In the acting stage, the most effective experience obtained during the interaction with the environment will be stored and during the learning stage, the stored experience will be replayed to customized times, which helps enhance the experience replay process. The effectiveness of proposed approach will be verified by simulation examples.

Original language	English
Pages (from-to)	20-30
Number of pages	11
Journal	Journal of Automation and Intelligence
Volume	2
Issue number	1
DOIs	https://doi.org/10.1016/j.jai.2023.100018
Publication status	Published - Feb 2023
Externally published	Yes

Keywords

Effective experience replay
Model-free
Neural networks
Reinforcement learning
Sampled-data control

Access to Document

10.1016/j.jai.2023.100018

Cite this

@article{db9bdb64ab514f61b1e8da9c3e7f3bdd,

title = "Sampled-data control through model-free reinforcement learning with effective experience replay",

abstract = "Reinforcement Learning (RL) based control algorithms can learn the control strategies for nonlinear and uncertain environment during interacting with it. Guided by the rewards generated by environment, a RL agent can learn the control strategy directly in a model-free way instead of investigating the dynamic model of the environment. In the paper, we propose the sampled-data RL control strategy to reduce the computational demand. In the sampled-data control strategy, the whole control system is of a hybrid structure, in which the plant is of continuous structure while the controller (RL agent) adopts a discrete structure. Given that the continuous states of the plant will be the input of the agent, the state–action value function is approximated by the fully connected feed-forward neural networks (FCFFNN). Instead of learning the controller at every step during the interaction with the environment, the learning and acting stages are decoupled to learn the control strategy more effectively through experience replay. In the acting stage, the most effective experience obtained during the interaction with the environment will be stored and during the learning stage, the stored experience will be replayed to customized times, which helps enhance the experience replay process. The effectiveness of proposed approach will be verified by simulation examples.",

keywords = "Effective experience replay, Model-free, Neural networks, Reinforcement learning, Sampled-data control",

author = "Bo Xiao and Lam, {H. K.} and Xiaojie Su and Ziwei Wang and Lo, {Frank P.W.} and Shihong Chen and Eric Yeatman",

note = "Publisher Copyright: {\textcopyright} 2023 The Authors",

year = "2023",

month = feb,

doi = "10.1016/j.jai.2023.100018",

language = "English",

volume = "2",

pages = "20--30",

journal = "Journal of Automation and Intelligence",

issn = "2949-8554",

number = "1",

}

TY - JOUR

T1 - Sampled-data control through model-free reinforcement learning with effective experience replay

AU - Xiao, Bo

AU - Lam, H. K.

AU - Su, Xiaojie

AU - Wang, Ziwei

AU - Lo, Frank P.W.

AU - Chen, Shihong

AU - Yeatman, Eric

PY - 2023/2

Y1 - 2023/2

N2 - Reinforcement Learning (RL) based control algorithms can learn the control strategies for nonlinear and uncertain environment during interacting with it. Guided by the rewards generated by environment, a RL agent can learn the control strategy directly in a model-free way instead of investigating the dynamic model of the environment. In the paper, we propose the sampled-data RL control strategy to reduce the computational demand. In the sampled-data control strategy, the whole control system is of a hybrid structure, in which the plant is of continuous structure while the controller (RL agent) adopts a discrete structure. Given that the continuous states of the plant will be the input of the agent, the state–action value function is approximated by the fully connected feed-forward neural networks (FCFFNN). Instead of learning the controller at every step during the interaction with the environment, the learning and acting stages are decoupled to learn the control strategy more effectively through experience replay. In the acting stage, the most effective experience obtained during the interaction with the environment will be stored and during the learning stage, the stored experience will be replayed to customized times, which helps enhance the experience replay process. The effectiveness of proposed approach will be verified by simulation examples.

AB - Reinforcement Learning (RL) based control algorithms can learn the control strategies for nonlinear and uncertain environment during interacting with it. Guided by the rewards generated by environment, a RL agent can learn the control strategy directly in a model-free way instead of investigating the dynamic model of the environment. In the paper, we propose the sampled-data RL control strategy to reduce the computational demand. In the sampled-data control strategy, the whole control system is of a hybrid structure, in which the plant is of continuous structure while the controller (RL agent) adopts a discrete structure. Given that the continuous states of the plant will be the input of the agent, the state–action value function is approximated by the fully connected feed-forward neural networks (FCFFNN). Instead of learning the controller at every step during the interaction with the environment, the learning and acting stages are decoupled to learn the control strategy more effectively through experience replay. In the acting stage, the most effective experience obtained during the interaction with the environment will be stored and during the learning stage, the stored experience will be replayed to customized times, which helps enhance the experience replay process. The effectiveness of proposed approach will be verified by simulation examples.

KW - Effective experience replay

KW - Model-free

KW - Neural networks

KW - Reinforcement learning

KW - Sampled-data control

UR - http://www.scopus.com/inward/record.url?scp=85174418292&partnerID=8YFLogxK

U2 - 10.1016/j.jai.2023.100018

DO - 10.1016/j.jai.2023.100018

M3 - Article

AN - SCOPUS:85174418292

SN - 2949-8554

VL - 2

SP - 20

EP - 30

JO - Journal of Automation and Intelligence

JF - Journal of Automation and Intelligence

IS - 1

ER -

Sampled-data control through model-free reinforcement learning with effective experience replay

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this