Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective

Yuanjian Li; A. S. Madhukumar; Tan Zheng Hui Ernest; Gan Zheng; Walid Saad; A. Hamid Aghvami

doi:10.1109/TCOMM.2025.3552746

Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective

Yuanjian Li^*, A. S. Madhukumar, Tan Zheng Hui Ernest, Gan Zheng, Walid Saad, A. Hamid Aghvami

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

Original language	English
Journal	IEEE Transactions on Communications
DOIs	https://doi.org/10.1109/TCOMM.2025.3552746
Publication status	Accepted/In press - Mar 2025
Externally published	Yes

Keywords

Multi-access edge computing (MEC)
Unmanned aerial vehicle (UAV)
Multi-agent deep reinforcement learning (MADRL)
Energy efficiency maximization
Path planning

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/TCOMM.2025.3552746

Cite this

@article{741d68b643be4ee5bde3f4f787711a5c,

title = "Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective",

abstract = "In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.",

keywords = "Multi-access edge computing (MEC), Unmanned aerial vehicle (UAV), Multi-agent deep reinforcement learning (MADRL), Energy efficiency maximization, Path planning",

author = "Yuanjian Li and Madhukumar, {A. S.} and Ernest, {Tan Zheng Hui} and Gan Zheng and Walid Saad and Aghvami, {A. Hamid}",

year = "2025",

month = mar,

doi = "10.1109/TCOMM.2025.3552746",

language = "English",

journal = "IEEE Transactions on Communications",

issn = "0090-6778",

}

TY - JOUR

T1 - Energy-Efficient UAV-Driven Multi-Access Edge Computing

T2 - A Distributed Many-Agent Perspective

AU - Li, Yuanjian

AU - Madhukumar, A. S.

AU - Ernest, Tan Zheng Hui

AU - Zheng, Gan

AU - Saad, Walid

AU - Aghvami, A. Hamid

PY - 2025/3

Y1 - 2025/3

N2 - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

AB - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

KW - Multi-access edge computing (MEC)

KW - Unmanned aerial vehicle (UAV)

KW - Multi-agent deep reinforcement learning (MADRL)

KW - Energy efficiency maximization

KW - Path planning

U2 - 10.1109/TCOMM.2025.3552746

DO - 10.1109/TCOMM.2025.3552746

M3 - Article

SN - 0090-6778

JO - IEEE Transactions on Communications

JF - IEEE Transactions on Communications

ER -

Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective

Abstract

Keywords

UN SDGs

Access to Document

Fingerprint

Cite this