TY - JOUR
T1 - MACNS
T2 - A generic graph neural network integrated deep reinforcement learning based multi-agent collaborative navigation system for dynamic trajectory planning
AU - Xiao, Ziren
AU - Li, Peisong
AU - Liu, Chang
AU - Gao, Honghao
AU - Wang, Xinheng
N1 - Publisher Copyright:
© 2024
PY - 2024/5
Y1 - 2024/5
N2 - Multi-agent collaborative navigation is prevalent in modern transportation systems, including delivery logistics, warehouse automation, and personalised tourism, where multiple moving agents (e.g., robots and people) must meet at a common destination from different starting points. However, traditional methods face challenges in efficiently and accurately optimising routes for multiple moving agents (e.g., robots). These include dynamically adjusting the common destination in response to changing traffic conditions, encoding the real-world map, and the training speed. Therefore, we propose a generic Multi-Agent Collaborative Navigation System (MACNS) to address those challenges. First, we formalise the solution of the problem and challenges into a Markov Decision Process (MDP), which is further developed as a training environment where a Deep Reinforcement Learning (DRL) agent can learn patterns efficiently. Second, the proposed framework integrates a Graph Neural Network (GNN) into the policy network of Proximal Policy Optimisation for the homogeneous decision-making of each individual agent, showing excellent generalisation, extensibility and convergence speed. Finally, we demonstrate how MACNS can be applied and implemented in a real-world use case: tour groups gathering and picking up. Extensive simulations and real-world tests validate the effectiveness of the MACNS-based use case, showcasing its superiority over other state-of-the-art PPO-related methods.
AB - Multi-agent collaborative navigation is prevalent in modern transportation systems, including delivery logistics, warehouse automation, and personalised tourism, where multiple moving agents (e.g., robots and people) must meet at a common destination from different starting points. However, traditional methods face challenges in efficiently and accurately optimising routes for multiple moving agents (e.g., robots). These include dynamically adjusting the common destination in response to changing traffic conditions, encoding the real-world map, and the training speed. Therefore, we propose a generic Multi-Agent Collaborative Navigation System (MACNS) to address those challenges. First, we formalise the solution of the problem and challenges into a Markov Decision Process (MDP), which is further developed as a training environment where a Deep Reinforcement Learning (DRL) agent can learn patterns efficiently. Second, the proposed framework integrates a Graph Neural Network (GNN) into the policy network of Proximal Policy Optimisation for the homogeneous decision-making of each individual agent, showing excellent generalisation, extensibility and convergence speed. Finally, we demonstrate how MACNS can be applied and implemented in a real-world use case: tour groups gathering and picking up. Extensive simulations and real-world tests validate the effectiveness of the MACNS-based use case, showcasing its superiority over other state-of-the-art PPO-related methods.
KW - Graph neural network
KW - Graph proximal policy optimisation
KW - Multi-agent navigation
UR - http://www.scopus.com/inward/record.url?scp=85182601809&partnerID=8YFLogxK
U2 - 10.1016/j.inffus.2024.102250
DO - 10.1016/j.inffus.2024.102250
M3 - Article
AN - SCOPUS:85182601809
SN - 1566-2535
VL - 105
JO - Information Fusion
JF - Information Fusion
M1 - 102250
ER -