TY - GEN
T1 - Multiple-Pilot Collaboration for Advanced Remote Intervention using Reinforcement Learning
AU - Wang, Ziwei
AU - Bai, Weibang
AU - Chen, Zhang
AU - Xiao, Bo
AU - Liang, Bin
AU - Yeatman, Eric M.
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/10/13
Y1 - 2021/10/13
N2 - The traditional master-slave teleoperation relies on human expertise without correction mechanisms, resulting in excessive physical and mental workloads. To address these issues, a co-pilot-in-the-loop control framework is investigated for cooperative teleoperation. A deep deterministic policy gradient (DDPG) based agent is realised to effectively restore the master operators' intents without prior knowledge on time delay. The proposed framework allows for introducing an operator (i.e., copilot) to generate commands at the slave side, whose weights are optimally assigned online through DDPG-based arbitration, thereby enhancing the command robustness in the case of possible human operational errors. With the help of interval type-2 (IT2) Takagi-Sugeno (T-S) fuzzy identification, force feedback can be reconstructed at the master side without a sense of delay, thus ensuring the telepresence performance in the force-sensor-free scenarios. Two experimental applications validate the effectiveness of the proposed framework.
AB - The traditional master-slave teleoperation relies on human expertise without correction mechanisms, resulting in excessive physical and mental workloads. To address these issues, a co-pilot-in-the-loop control framework is investigated for cooperative teleoperation. A deep deterministic policy gradient (DDPG) based agent is realised to effectively restore the master operators' intents without prior knowledge on time delay. The proposed framework allows for introducing an operator (i.e., copilot) to generate commands at the slave side, whose weights are optimally assigned online through DDPG-based arbitration, thereby enhancing the command robustness in the case of possible human operational errors. With the help of interval type-2 (IT2) Takagi-Sugeno (T-S) fuzzy identification, force feedback can be reconstructed at the master side without a sense of delay, thus ensuring the telepresence performance in the force-sensor-free scenarios. Two experimental applications validate the effectiveness of the proposed framework.
KW - Collaborative teleoperation
KW - IT2 fuzzy system
KW - Kalman filter
KW - Reinforcement learning
KW - Time delay
UR - http://www.scopus.com/inward/record.url?scp=85119520730&partnerID=8YFLogxK
U2 - 10.1109/IECON48115.2021.9589570
DO - 10.1109/IECON48115.2021.9589570
M3 - Conference Proceeding
AN - SCOPUS:85119520730
T3 - IECON Proceedings (Industrial Electronics Conference)
BT - IECON 2021 - 47th Annual Conference of the IEEE Industrial Electronics Society
PB - IEEE Computer Society
T2 - 47th Annual Conference of the IEEE Industrial Electronics Society, IECON 2021
Y2 - 13 July 2007 through 16 October 2021
ER -