Convolutional fitted Q iteration for vision-based control problems

Dongbin Zhao; Yuanheng Zhu; Le Lv; Yaran Chen; Qichao Zhang

doi:10.1109/IJCNN.2016.7727794

Convolutional fitted Q iteration for vision-based control problems

Dongbin Zhao, Yuanheng Zhu, Le Lv, Yaran Chen, Qichao Zhang

Department of Intelligent Science

Chinese Academy of Sciences

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

6 Citations (Scopus)

Abstract

In this paper a deep reinforcement learning (DRL) method is proposed to solve the control problem which takes raw image pixels as input states. A convolutional neural network (CNN) is used to approximate Q functions, termed as Q-CNN. A pretrained network, which is the result of a classification challenge on a vast set of natural images, initializes the parameters of Q-CNN. Such initialization assigns Q-CNN with the features of image representation, so it is more concentrated on the control tasks. The weights are tuned under the scheme of fitted Q iteration (FQI), which is an offline reinforcement learning method with the stable convergence property. To demonstrate the performance, a modified Food-Poison problem is simulated. The agent determines its movements based on its forward view. In the end the algorithm successfully learns a satisfied policy which has better performance than the results of previous researches.

Original language	English
Title of host publication	2016 International Joint Conference on Neural Networks, IJCNN 2016
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	4539-4544
Number of pages	6
ISBN (Electronic)	9781509006199
DOIs	https://doi.org/10.1109/IJCNN.2016.7727794
Publication status	Published - 31 Oct 2016
Event	2016 International Joint Conference on Neural Networks, IJCNN 2016 - Vancouver, Canada Duration: 24 Jul 2016 → 29 Jul 2016

Publication series

Name	Proceedings of the International Joint Conference on Neural Networks
Volume	2016-October

Conference

Conference	2016 International Joint Conference on Neural Networks, IJCNN 2016
Country/Territory	Canada
City	Vancouver
Period	24/07/16 → 29/07/16

Keywords

Convolutional neural network
Deep reinforcement learning
Fitted Q iteration
Vision-based control

Access to Document

10.1109/IJCNN.2016.7727794

Cite this

Zhao, D., Zhu, Y., Lv, L., Chen, Y., & Zhang, Q. (2016). Convolutional fitted Q iteration for vision-based control problems. In 2016 International Joint Conference on Neural Networks, IJCNN 2016 (pp. 4539-4544). Article 7727794 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2016-October). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN.2016.7727794

@inproceedings{d5a32c40a93145e196ad35f1bc0a9a29,

title = "Convolutional fitted Q iteration for vision-based control problems",

abstract = "In this paper a deep reinforcement learning (DRL) method is proposed to solve the control problem which takes raw image pixels as input states. A convolutional neural network (CNN) is used to approximate Q functions, termed as Q-CNN. A pretrained network, which is the result of a classification challenge on a vast set of natural images, initializes the parameters of Q-CNN. Such initialization assigns Q-CNN with the features of image representation, so it is more concentrated on the control tasks. The weights are tuned under the scheme of fitted Q iteration (FQI), which is an offline reinforcement learning method with the stable convergence property. To demonstrate the performance, a modified Food-Poison problem is simulated. The agent determines its movements based on its forward view. In the end the algorithm successfully learns a satisfied policy which has better performance than the results of previous researches.",

keywords = "Convolutional neural network, Deep reinforcement learning, Fitted Q iteration, Vision-based control",

author = "Dongbin Zhao and Yuanheng Zhu and Le Lv and Yaran Chen and Qichao Zhang",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.; 2016 International Joint Conference on Neural Networks, IJCNN 2016 ; Conference date: 24-07-2016 Through 29-07-2016",

year = "2016",

month = oct,

day = "31",

doi = "10.1109/IJCNN.2016.7727794",

language = "English",

series = "Proceedings of the International Joint Conference on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "4539--4544",

booktitle = "2016 International Joint Conference on Neural Networks, IJCNN 2016",

}

Zhao, D, Zhu, Y, Lv, L, Chen, Y & Zhang, Q 2016, Convolutional fitted Q iteration for vision-based control problems. in 2016 International Joint Conference on Neural Networks, IJCNN 2016., 7727794, Proceedings of the International Joint Conference on Neural Networks, vol. 2016-October, Institute of Electrical and Electronics Engineers Inc., pp. 4539-4544, 2016 International Joint Conference on Neural Networks, IJCNN 2016, Vancouver, Canada, 24/07/16. https://doi.org/10.1109/IJCNN.2016.7727794

Convolutional fitted Q iteration for vision-based control problems. / Zhao, Dongbin; Zhu, Yuanheng; Lv, Le et al.
2016 International Joint Conference on Neural Networks, IJCNN 2016. Institute of Electrical and Electronics Engineers Inc., 2016. p. 4539-4544 7727794 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2016-October).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Convolutional fitted Q iteration for vision-based control problems

AU - Zhao, Dongbin

AU - Zhu, Yuanheng

AU - Lv, Le

AU - Chen, Yaran

AU - Zhang, Qichao

PY - 2016/10/31

Y1 - 2016/10/31

N2 - In this paper a deep reinforcement learning (DRL) method is proposed to solve the control problem which takes raw image pixels as input states. A convolutional neural network (CNN) is used to approximate Q functions, termed as Q-CNN. A pretrained network, which is the result of a classification challenge on a vast set of natural images, initializes the parameters of Q-CNN. Such initialization assigns Q-CNN with the features of image representation, so it is more concentrated on the control tasks. The weights are tuned under the scheme of fitted Q iteration (FQI), which is an offline reinforcement learning method with the stable convergence property. To demonstrate the performance, a modified Food-Poison problem is simulated. The agent determines its movements based on its forward view. In the end the algorithm successfully learns a satisfied policy which has better performance than the results of previous researches.

AB - In this paper a deep reinforcement learning (DRL) method is proposed to solve the control problem which takes raw image pixels as input states. A convolutional neural network (CNN) is used to approximate Q functions, termed as Q-CNN. A pretrained network, which is the result of a classification challenge on a vast set of natural images, initializes the parameters of Q-CNN. Such initialization assigns Q-CNN with the features of image representation, so it is more concentrated on the control tasks. The weights are tuned under the scheme of fitted Q iteration (FQI), which is an offline reinforcement learning method with the stable convergence property. To demonstrate the performance, a modified Food-Poison problem is simulated. The agent determines its movements based on its forward view. In the end the algorithm successfully learns a satisfied policy which has better performance than the results of previous researches.

KW - Convolutional neural network

KW - Deep reinforcement learning

KW - Fitted Q iteration

KW - Vision-based control

UR - http://www.scopus.com/inward/record.url?scp=85007275358&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2016.7727794

DO - 10.1109/IJCNN.2016.7727794

M3 - Conference Proceeding

AN - SCOPUS:85007275358

T3 - Proceedings of the International Joint Conference on Neural Networks

SP - 4539

EP - 4544

BT - 2016 International Joint Conference on Neural Networks, IJCNN 2016

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2016 International Joint Conference on Neural Networks, IJCNN 2016

Y2 - 24 July 2016 through 29 July 2016

ER -

Convolutional fitted Q iteration for vision-based control problems

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this