Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective

Yuanjian Li; A. S. Madhukumar; Tan Zheng Hui Ernest; Gan Zheng; Walid Saad; A. Hamid Aghvami

doi:10.1109/TCOMM.2025.3552746

Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective

Yuanjian Li^*, A. S. Madhukumar, Tan Zheng Hui Ernest, Gan Zheng, Walid Saad, A. Hamid Aghvami

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

Original language	English
Journal	IEEE Transactions on Communications
DOIs	https://doi.org/10.1109/TCOMM.2025.3552746
Publication status	Accepted/In press - 2025
Externally published	Yes

Keywords

energy efficiency maximization
Multi-access edge computing (MEC)
multi-agent deep reinforcement learning (MADRL)
path planning
unmanned aerial vehicle (UAV)

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/TCOMM.2025.3552746

Cite this

@article{aeb5fad822004ba4889c774b7347e563,

title = "Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective",

abstract = "In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.",

keywords = "energy efficiency maximization, Multi-access edge computing (MEC), multi-agent deep reinforcement learning (MADRL), path planning, unmanned aerial vehicle (UAV)",

author = "Yuanjian Li and Madhukumar, {A. S.} and Ernest, {Tan Zheng Hui} and Gan Zheng and Walid Saad and {Hamid Aghvami}, A.",

note = "Publisher Copyright: {\textcopyright} 1972-2012 IEEE.",

year = "2025",

doi = "10.1109/TCOMM.2025.3552746",

language = "English",

journal = "IEEE Transactions on Communications",

issn = "0090-6778",

}

TY - JOUR

T1 - Energy-Efficient UAV-Driven Multi-Access Edge Computing

T2 - A Distributed Many-Agent Perspective

AU - Li, Yuanjian

AU - Madhukumar, A. S.

AU - Ernest, Tan Zheng Hui

AU - Zheng, Gan

AU - Saad, Walid

AU - Hamid Aghvami, A.

PY - 2025

Y1 - 2025

N2 - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

AB - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

KW - energy efficiency maximization

KW - Multi-access edge computing (MEC)

KW - multi-agent deep reinforcement learning (MADRL)

KW - path planning

KW - unmanned aerial vehicle (UAV)

UR - http://www.scopus.com/inward/record.url?scp=105000871413&partnerID=8YFLogxK

U2 - 10.1109/TCOMM.2025.3552746

DO - 10.1109/TCOMM.2025.3552746

M3 - Article

AN - SCOPUS:105000871413

SN - 0090-6778

JO - IEEE Transactions on Communications

JF - IEEE Transactions on Communications

ER -

Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this