TY - GEN
T1 - Energy-Efficient UAV-Aided Computation Offloading on THz Band
T2 - 2024 IEEE Global Communications Conference, GLOBECOM 2024
AU - Li, Yuanjian
AU - Madhukumar, A. S.
AU - Ernest, Tan Zheng Hui
AU - Zheng, Gan
AU - Saad, Walid
AU - Aghvami, A. Hamid
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted computation offloading over the Terahertz (THz) spectrum is investigated. In the studied system, several UAVs are deployed as edge servers to aid task executions for multiple energy-limited computation-scarce terrestrial user equipments (UEs). Then, an expected energy efficiency maximization problem is formulated, aiming to jointly optimize UAVs' trajectories, UEs' local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UEs' offloading powers. To tackle the considered multi-dimensional optimization problem, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Compared to representative benchmarks in simulations, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN can achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.
AB - In this paper, the problem of energy-efficient unmanned aerial vehicle (UAV)-assisted computation offloading over the Terahertz (THz) spectrum is investigated. In the studied system, several UAVs are deployed as edge servers to aid task executions for multiple energy-limited computation-scarce terrestrial user equipments (UEs). Then, an expected energy efficiency maximization problem is formulated, aiming to jointly optimize UAVs' trajectories, UEs' local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UEs' offloading powers. To tackle the considered multi-dimensional optimization problem, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Compared to representative benchmarks in simulations, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN can achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.
UR - http://www.scopus.com/inward/record.url?scp=105000826081&partnerID=8YFLogxK
U2 - 10.1109/GLOBECOM52923.2024.10901463
DO - 10.1109/GLOBECOM52923.2024.10901463
M3 - Conference Proceeding
AN - SCOPUS:105000826081
T3 - Proceedings - IEEE Global Communications Conference, GLOBECOM
SP - 1515
EP - 1520
BT - GLOBECOM 2024 - 2024 IEEE Global Communications Conference
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 8 December 2024 through 12 December 2024
ER -