TY - GEN
T1 - Primitives generation policy learning without catastrophic forgetting for robotic manipulation
AU - Xiong, Fangzhou
AU - Liu, Zhiyong
AU - Huang, Kaizhu
AU - Yang, Xu
AU - Hussain, Amir
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - Catastrophic forgetting is a tough challenge when agent attempts to address different tasks sequentially without storing previous information, which gradually hinders the development of continual learning. Except for image classification tasks in continual learning, however, there are little reviews related to robotic manipulation. In this paper, we present a novel hierarchical architecture called Primitives Generation Policy Learning to enable continual learning. More specifically, a generative method by Variational Autoencoder is employed to generate state primitives from task space, then separate policy learning component is designed to learn torque control commands for different tasks sequentially. Furthermore, different task policies could be identified automatically by comparing reconstruction loss in the autoencoder. Experiment on robotic manipulation task shows that the proposed method exhibits substantially improved performance over some other continual learning methods.
AB - Catastrophic forgetting is a tough challenge when agent attempts to address different tasks sequentially without storing previous information, which gradually hinders the development of continual learning. Except for image classification tasks in continual learning, however, there are little reviews related to robotic manipulation. In this paper, we present a novel hierarchical architecture called Primitives Generation Policy Learning to enable continual learning. More specifically, a generative method by Variational Autoencoder is employed to generate state primitives from task space, then separate policy learning component is designed to learn torque control commands for different tasks sequentially. Furthermore, different task policies could be identified automatically by comparing reconstruction loss in the autoencoder. Experiment on robotic manipulation task shows that the proposed method exhibits substantially improved performance over some other continual learning methods.
KW - Catastrophic forgetting
KW - Continual learning
KW - Robotic manipulation
KW - Variational autoencoder
UR - http://www.scopus.com/inward/record.url?scp=85078787063&partnerID=8YFLogxK
U2 - 10.1109/ICDMW.2019.00130
DO - 10.1109/ICDMW.2019.00130
M3 - Conference Proceeding
AN - SCOPUS:85078787063
T3 - IEEE International Conference on Data Mining Workshops, ICDMW
SP - 890
EP - 897
BT - Proceedings - 19th IEEE International Conference on Data Mining Workshops, ICDMW 2019
A2 - Papapetrou, Panagiotis
A2 - Cheng, Xueqi
A2 - He, Qing
PB - IEEE Computer Society
T2 - 19th IEEE International Conference on Data Mining Workshops, ICDMW 2019
Y2 - 8 November 2019 through 11 November 2019
ER -