Energy-Efficient UAV-Aided Computation Offloading on THz Band: A MADRL Solution

Yuanjian Li; A. S. Madhukumar; Tan Zheng Hui Ernest; Gan Zheng; Walid Saad; A. Hamid Aghvami

Energy-Efficient UAV-Aided Computation Offloading on THz Band: A MADRL Solution

Yuanjian Li, A. S. Madhukumar, Tan Zheng Hui Ernest, Gan Zheng, Walid Saad, A. Hamid Aghvami

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted computation offloading over the Terahertz (THz) spectrum is investigated. In the studied system, several UAVs are deployed as edge servers to aid task executions for multiple energy-limited computation-scarce terrestrial user equipments (UEs). Then, an expected energy efficiency maximization problem is formulated, aiming to jointly optimize UAVs’ trajectories, UEs’ local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UEs’ offloading powers. To tackle the considered multi-dimensional optimization problem, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Compared to representative benchmarks in simulations, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN can achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

Original language	English
Title of host publication	IEEE Conference on Global Communications (GLOBECOM)
Publisher	IEEE
Publication status	Published - Dec 2024
Externally published	Yes

Keywords

Terahertz communications
multi-access edge computing (MEC)
unmanned aerial vehicle (UAV)
deep reinforcement learning (DRL)
energy-efficiciency
trajectory optimization
multi-agent deep reinforcement learning (MADRL)
communication offloading

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Cite this

@inproceedings{9e4d47337ed542098b324ffa5a1277b2,

title = "Energy-Efficient UAV-Aided Computation Offloading on THz Band: A MADRL Solution",

abstract = "In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted computation offloading over the Terahertz (THz) spectrum is investigated. In the studied system, several UAVs are deployed as edge servers to aid task executions for multiple energy-limited computation-scarce terrestrial user equipments (UEs). Then, an expected energy efficiency maximization problem is formulated, aiming to jointly optimize UAVs{\textquoteright} trajectories, UEs{\textquoteright} local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UEs{\textquoteright} offloading powers. To tackle the considered multi-dimensional optimization problem, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Compared to representative benchmarks in simulations, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN can achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.",

keywords = "Terahertz communications, multi-access edge computing (MEC), unmanned aerial vehicle (UAV), deep reinforcement learning (DRL), energy-efficiciency, trajectory optimization, multi-agent deep reinforcement learning (MADRL), communication offloading",

author = "Yuanjian Li and Madhukumar, {A. S.} and Ernest, {Tan Zheng Hui} and Gan Zheng and Walid Saad and Aghvami, {A. Hamid}",

year = "2024",

month = dec,

language = "English",

booktitle = "IEEE Conference on Global Communications (GLOBECOM)",

publisher = "IEEE",

}

TY - GEN

T1 - Energy-Efficient UAV-Aided Computation Offloading on THz Band

T2 - A MADRL Solution

AU - Li, Yuanjian

AU - Madhukumar, A. S.

AU - Ernest, Tan Zheng Hui

AU - Zheng, Gan

AU - Saad, Walid

AU - Aghvami, A. Hamid

PY - 2024/12

Y1 - 2024/12

N2 - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted computation offloading over the Terahertz (THz) spectrum is investigated. In the studied system, several UAVs are deployed as edge servers to aid task executions for multiple energy-limited computation-scarce terrestrial user equipments (UEs). Then, an expected energy efficiency maximization problem is formulated, aiming to jointly optimize UAVs’ trajectories, UEs’ local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UEs’ offloading powers. To tackle the considered multi-dimensional optimization problem, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Compared to representative benchmarks in simulations, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN can achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

AB - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted computation offloading over the Terahertz (THz) spectrum is investigated. In the studied system, several UAVs are deployed as edge servers to aid task executions for multiple energy-limited computation-scarce terrestrial user equipments (UEs). Then, an expected energy efficiency maximization problem is formulated, aiming to jointly optimize UAVs’ trajectories, UEs’ local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UEs’ offloading powers. To tackle the considered multi-dimensional optimization problem, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Compared to representative benchmarks in simulations, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN can achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.

KW - Terahertz communications

KW - multi-access edge computing (MEC)

KW - unmanned aerial vehicle (UAV)

KW - deep reinforcement learning (DRL)

KW - energy-efficiciency

KW - trajectory optimization

KW - multi-agent deep reinforcement learning (MADRL)

KW - communication offloading

M3 - Conference Proceeding

BT - IEEE Conference on Global Communications (GLOBECOM)

PB - IEEE

ER -

Energy-Efficient UAV-Aided Computation Offloading on THz Band: A MADRL Solution

Abstract

Keywords

UN SDGs

Fingerprint

Cite this