Primitives generation policy learning without catastrophic forgetting for robotic manipulation

Fangzhou Xiong, Zhiyong Liu*, Kaizhu Huang, Xu Yang, Amir Hussain

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

2 Citations (Scopus)

Abstract

Catastrophic forgetting is a tough challenge when agent attempts to address different tasks sequentially without storing previous information, which gradually hinders the development of continual learning. Except for image classification tasks in continual learning, however, there are little reviews related to robotic manipulation. In this paper, we present a novel hierarchical architecture called Primitives Generation Policy Learning to enable continual learning. More specifically, a generative method by Variational Autoencoder is employed to generate state primitives from task space, then separate policy learning component is designed to learn torque control commands for different tasks sequentially. Furthermore, different task policies could be identified automatically by comparing reconstruction loss in the autoencoder. Experiment on robotic manipulation task shows that the proposed method exhibits substantially improved performance over some other continual learning methods.

Original languageEnglish
Title of host publicationProceedings - 19th IEEE International Conference on Data Mining Workshops, ICDMW 2019
EditorsPanagiotis Papapetrou, Xueqi Cheng, Qing He
PublisherIEEE Computer Society
Pages890-897
Number of pages8
ISBN (Electronic)9781728146034
DOIs
Publication statusPublished - Nov 2019
Event19th IEEE International Conference on Data Mining Workshops, ICDMW 2019 - Beijing, China
Duration: 8 Nov 201911 Nov 2019

Publication series

NameIEEE International Conference on Data Mining Workshops, ICDMW
Volume2019-November
ISSN (Print)2375-9232
ISSN (Electronic)2375-9259

Conference

Conference19th IEEE International Conference on Data Mining Workshops, ICDMW 2019
Country/TerritoryChina
CityBeijing
Period8/11/1911/11/19

Keywords

  • Catastrophic forgetting
  • Continual learning
  • Robotic manipulation
  • Variational autoencoder

Fingerprint

Dive into the research topics of 'Primitives generation policy learning without catastrophic forgetting for robotic manipulation'. Together they form a unique fingerprint.

Cite this