SATF: A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning

Bin Chen; Zehong Cao; Quan Bai

doi:10.1109/TNNLS.2024.3387397

SATF: A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning

Bin Chen, Zehong Cao, Quan Bai

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

It is challenging to train an efficient learning procedure with multiagent reinforcement learning (MARL) when the number of agents increases as the observation space exponentially expands, especially in large-scale multiagent systems. In this article, we proposed a scalable attentive transfer framework (SATF) for efficient MARL, which achieved goals faster and more accurately in homogeneous and heterogeneous combat tasks by transferring learned knowledge from a small number of agents (4) to a large number of agents (up to 64). To reduce and align the dimensionality of the observed state variations caused by increasing numbers of agents, the proposed SATF deployed a novel state representation network with a self-attention mechanism, known as dynamic observation representation network (DorNet), to extract the dominant observed information with excellent cost-effectiveness. The experiments on the <italic>MAgent</italic> platform showed that the SATF outperformed the distributed MARL (independent Q-learning (IQL) and A2C) in task sequences from 8 to 64 agents. The experiments on <italic>StarCraft II</italic> showed that the SATF demonstrated superior performance relative to the centralized training with decentralized execution MARL (QMIX) by presenting shorter training steps, achieving a desired win rate of up to approximately 90% when increasing the number of agents from 4 to 32. The findings of our study showed the great potential for enhancing the efficiency of MARL training in large-scale agent combat missions.

Original language	English
Pages (from-to)	1-15
Number of pages	15
Journal	IEEE Transactions on Neural Networks and Learning Systems
DOIs	https://doi.org/10.1109/TNNLS.2024.3387397
Publication status	Accepted/In press - 2024
Externally published	Yes

Keywords

Australia
Knowledge transfer
Multiagent reinforcement learning (MARL)
observation representation
Scalability
Standards
Task analysis
Training
training efficiency
transfer learning
Transfer learning

Access to Document

10.1109/TNNLS.2024.3387397

Cite this

@article{073930ad80e9420184e8123f7751be80,

title = "SATF: A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning",

abstract = "It is challenging to train an efficient learning procedure with multiagent reinforcement learning (MARL) when the number of agents increases as the observation space exponentially expands, especially in large-scale multiagent systems. In this article, we proposed a scalable attentive transfer framework (SATF) for efficient MARL, which achieved goals faster and more accurately in homogeneous and heterogeneous combat tasks by transferring learned knowledge from a small number of agents (4) to a large number of agents (up to 64). To reduce and align the dimensionality of the observed state variations caused by increasing numbers of agents, the proposed SATF deployed a novel state representation network with a self-attention mechanism, known as dynamic observation representation network (DorNet), to extract the dominant observed information with excellent cost-effectiveness. The experiments on the MAgent platform showed that the SATF outperformed the distributed MARL (independent Q-learning (IQL) and A2C) in task sequences from 8 to 64 agents. The experiments on StarCraft II showed that the SATF demonstrated superior performance relative to the centralized training with decentralized execution MARL (QMIX) by presenting shorter training steps, achieving a desired win rate of up to approximately 90% when increasing the number of agents from 4 to 32. The findings of our study showed the great potential for enhancing the efficiency of MARL training in large-scale agent combat missions.",

keywords = "Australia, Knowledge transfer, Multiagent reinforcement learning (MARL), observation representation, Scalability, Standards, Task analysis, Training, training efficiency, transfer learning, Transfer learning",

author = "Bin Chen and Zehong Cao and Quan Bai",

note = "Publisher Copyright: IEEE",

year = "2024",

doi = "10.1109/TNNLS.2024.3387397",

language = "English",

pages = "1--15",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE",

}

TY - JOUR

T1 - SATF

T2 - A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning

AU - Chen, Bin

AU - Cao, Zehong

AU - Bai, Quan

N1 - Publisher Copyright: IEEE

PY - 2024

Y1 - 2024

N2 - It is challenging to train an efficient learning procedure with multiagent reinforcement learning (MARL) when the number of agents increases as the observation space exponentially expands, especially in large-scale multiagent systems. In this article, we proposed a scalable attentive transfer framework (SATF) for efficient MARL, which achieved goals faster and more accurately in homogeneous and heterogeneous combat tasks by transferring learned knowledge from a small number of agents (4) to a large number of agents (up to 64). To reduce and align the dimensionality of the observed state variations caused by increasing numbers of agents, the proposed SATF deployed a novel state representation network with a self-attention mechanism, known as dynamic observation representation network (DorNet), to extract the dominant observed information with excellent cost-effectiveness. The experiments on the MAgent platform showed that the SATF outperformed the distributed MARL (independent Q-learning (IQL) and A2C) in task sequences from 8 to 64 agents. The experiments on StarCraft II showed that the SATF demonstrated superior performance relative to the centralized training with decentralized execution MARL (QMIX) by presenting shorter training steps, achieving a desired win rate of up to approximately 90% when increasing the number of agents from 4 to 32. The findings of our study showed the great potential for enhancing the efficiency of MARL training in large-scale agent combat missions.

AB - It is challenging to train an efficient learning procedure with multiagent reinforcement learning (MARL) when the number of agents increases as the observation space exponentially expands, especially in large-scale multiagent systems. In this article, we proposed a scalable attentive transfer framework (SATF) for efficient MARL, which achieved goals faster and more accurately in homogeneous and heterogeneous combat tasks by transferring learned knowledge from a small number of agents (4) to a large number of agents (up to 64). To reduce and align the dimensionality of the observed state variations caused by increasing numbers of agents, the proposed SATF deployed a novel state representation network with a self-attention mechanism, known as dynamic observation representation network (DorNet), to extract the dominant observed information with excellent cost-effectiveness. The experiments on the MAgent platform showed that the SATF outperformed the distributed MARL (independent Q-learning (IQL) and A2C) in task sequences from 8 to 64 agents. The experiments on StarCraft II showed that the SATF demonstrated superior performance relative to the centralized training with decentralized execution MARL (QMIX) by presenting shorter training steps, achieving a desired win rate of up to approximately 90% when increasing the number of agents from 4 to 32. The findings of our study showed the great potential for enhancing the efficiency of MARL training in large-scale agent combat missions.

KW - Australia

KW - Knowledge transfer

KW - Multiagent reinforcement learning (MARL)

KW - observation representation

KW - Scalability

KW - Standards

KW - Task analysis

KW - Training

KW - training efficiency

KW - transfer learning

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85191299918&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2024.3387397

DO - 10.1109/TNNLS.2024.3387397

M3 - Article

AN - SCOPUS:85191299918

SN - 2162-237X

SP - 1

EP - 15

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

ER -

SATF: A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this