TY - GEN
T1 - SKILL-IL
T2 - 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022
AU - Xihan, Bian
AU - Mendez, Oscar
AU - Hadfield, Simon
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - In this work, we introduce a new perspective for learning transferable content in multi-task imitation learning. Humans are capable of transferring skills and knowledge. If we can cycle to work and drive to the store, we can also cycle to the store and drive to work. We take inspiration from this and hypothesize the latent memory of a policy network can be disentangled into two partitions. These contain either the knowledge of the environmental context for the task or the generalisable skill needed to solve the task. This allows an improved training efficiency and better generalization over previously unseen combinations of skills in the same environment, and the same task in unseen environments. We used the proposed approach to train a disentangled agent for two different multi-task IL environments. In both cases, we out-performed the SOTA by 30% in task success rate. We also demonstrated this for navigation on a real robot.
AB - In this work, we introduce a new perspective for learning transferable content in multi-task imitation learning. Humans are capable of transferring skills and knowledge. If we can cycle to work and drive to the store, we can also cycle to the store and drive to work. We take inspiration from this and hypothesize the latent memory of a policy network can be disentangled into two partitions. These contain either the knowledge of the environmental context for the task or the generalisable skill needed to solve the task. This allows an improved training efficiency and better generalization over previously unseen combinations of skills in the same environment, and the same task in unseen environments. We used the proposed approach to train a disentangled agent for two different multi-task IL environments. In both cases, we out-performed the SOTA by 30% in task success rate. We also demonstrated this for navigation on a real robot.
UR - http://www.scopus.com/inward/record.url?scp=85146313800&partnerID=8YFLogxK
U2 - 10.1109/IROS47612.2022.9981375
DO - 10.1109/IROS47612.2022.9981375
M3 - Conference Proceeding
AN - SCOPUS:85146313800
T3 - IEEE International Conference on Intelligent Robots and Systems
SP - 7060
EP - 7065
BT - IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 23 October 2022 through 27 October 2022
ER -