Augmented Deep Reinforcement Learning for Online Energy Minimization of Wireless Powered Mobile Edge Computing

Xiaojing Chen; Weiheng Dai; Wei Ni; Xin Wang; Shunqing Zhang; Shugong Xu; Yanzan Sun

doi:10.1109/TCOMM.2023.3251353

Augmented Deep Reinforcement Learning for Online Energy Minimization of Wireless Powered Mobile Edge Computing

Xiaojing Chen, Weiheng Dai, Wei Ni, Xin Wang, Shunqing Zhang^*, Shugong Xu, Yanzan Sun

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

23 Citations (Scopus)

Abstract

Mobile edge computing (MEC) offers an opportunity for devices relying on wireless power transfer (WPT), to accomplish computationally demanding tasks. Such WPT-powered MEC systems have yet to be optimized for long-term efficiency, due to random and changing task demands and wireless channel states of the devices. This paper presents an augmented two-staged deep Q-network (DQN), referred to as 'TS-DQN,' for online optimization of WPT-powered MEC systems, where the WPT, offloading schedule, channel allocation, and the CPU configurations of the edge server and devices are jointly optimized to minimize the long-term average energy requirement of the systems. The key idea is to design a DQN for learning the channel allocation and task admission, while the WPT, offloading time and CPU configurations are efficiently optimized to precisely evaluate the reward of the DQN and substantially reduce its action space. Another important aspect is that a new action generation method is developed to expand and diversify the actions of the DQN, further accelerating its convergence. As validated by simulations, the proposed TS-DQN is much more energy efficient and converges much faster, than its potential alternative directly using the state-of-the-art Deep Deterministic Policy Gradient algorithm to learn all decision variables.

Original language	English
Pages (from-to)	2698-2710
Number of pages	13
Journal	IEEE Transactions on Communications
Volume	71
Issue number	5
DOIs	https://doi.org/10.1109/TCOMM.2023.3251353
Publication status	Published - 1 May 2023
Externally published	Yes

Keywords

Mobile edge computing
convex optimization
deep Q-network
energy-efficient
resource allocation
wireless power transfer

Access to Document

10.1109/TCOMM.2023.3251353

Cite this

@article{42c90851c5504fa9beb5fe05097f9448,

title = "Augmented Deep Reinforcement Learning for Online Energy Minimization of Wireless Powered Mobile Edge Computing",

abstract = "Mobile edge computing (MEC) offers an opportunity for devices relying on wireless power transfer (WPT), to accomplish computationally demanding tasks. Such WPT-powered MEC systems have yet to be optimized for long-term efficiency, due to random and changing task demands and wireless channel states of the devices. This paper presents an augmented two-staged deep Q-network (DQN), referred to as 'TS-DQN,' for online optimization of WPT-powered MEC systems, where the WPT, offloading schedule, channel allocation, and the CPU configurations of the edge server and devices are jointly optimized to minimize the long-term average energy requirement of the systems. The key idea is to design a DQN for learning the channel allocation and task admission, while the WPT, offloading time and CPU configurations are efficiently optimized to precisely evaluate the reward of the DQN and substantially reduce its action space. Another important aspect is that a new action generation method is developed to expand and diversify the actions of the DQN, further accelerating its convergence. As validated by simulations, the proposed TS-DQN is much more energy efficient and converges much faster, than its potential alternative directly using the state-of-the-art Deep Deterministic Policy Gradient algorithm to learn all decision variables.",

keywords = "Mobile edge computing, convex optimization, deep Q-network, energy-efficient, resource allocation, wireless power transfer",

author = "Xiaojing Chen and Weiheng Dai and Wei Ni and Xin Wang and Shunqing Zhang and Shugong Xu and Yanzan Sun",

note = "Publisher Copyright: {\textcopyright} 1972-2012 IEEE.",

year = "2023",

month = may,

day = "1",

doi = "10.1109/TCOMM.2023.3251353",

language = "English",

volume = "71",

pages = "2698--2710",

journal = "IEEE Transactions on Communications",

issn = "0090-6778",

number = "5",

}

TY - JOUR

T1 - Augmented Deep Reinforcement Learning for Online Energy Minimization of Wireless Powered Mobile Edge Computing

AU - Chen, Xiaojing

AU - Dai, Weiheng

AU - Ni, Wei

AU - Wang, Xin

AU - Zhang, Shunqing

AU - Xu, Shugong

AU - Sun, Yanzan

PY - 2023/5/1

Y1 - 2023/5/1

N2 - Mobile edge computing (MEC) offers an opportunity for devices relying on wireless power transfer (WPT), to accomplish computationally demanding tasks. Such WPT-powered MEC systems have yet to be optimized for long-term efficiency, due to random and changing task demands and wireless channel states of the devices. This paper presents an augmented two-staged deep Q-network (DQN), referred to as 'TS-DQN,' for online optimization of WPT-powered MEC systems, where the WPT, offloading schedule, channel allocation, and the CPU configurations of the edge server and devices are jointly optimized to minimize the long-term average energy requirement of the systems. The key idea is to design a DQN for learning the channel allocation and task admission, while the WPT, offloading time and CPU configurations are efficiently optimized to precisely evaluate the reward of the DQN and substantially reduce its action space. Another important aspect is that a new action generation method is developed to expand and diversify the actions of the DQN, further accelerating its convergence. As validated by simulations, the proposed TS-DQN is much more energy efficient and converges much faster, than its potential alternative directly using the state-of-the-art Deep Deterministic Policy Gradient algorithm to learn all decision variables.

AB - Mobile edge computing (MEC) offers an opportunity for devices relying on wireless power transfer (WPT), to accomplish computationally demanding tasks. Such WPT-powered MEC systems have yet to be optimized for long-term efficiency, due to random and changing task demands and wireless channel states of the devices. This paper presents an augmented two-staged deep Q-network (DQN), referred to as 'TS-DQN,' for online optimization of WPT-powered MEC systems, where the WPT, offloading schedule, channel allocation, and the CPU configurations of the edge server and devices are jointly optimized to minimize the long-term average energy requirement of the systems. The key idea is to design a DQN for learning the channel allocation and task admission, while the WPT, offloading time and CPU configurations are efficiently optimized to precisely evaluate the reward of the DQN and substantially reduce its action space. Another important aspect is that a new action generation method is developed to expand and diversify the actions of the DQN, further accelerating its convergence. As validated by simulations, the proposed TS-DQN is much more energy efficient and converges much faster, than its potential alternative directly using the state-of-the-art Deep Deterministic Policy Gradient algorithm to learn all decision variables.

KW - Mobile edge computing

KW - convex optimization

KW - deep Q-network

KW - energy-efficient

KW - resource allocation

KW - wireless power transfer

UR - http://www.scopus.com/inward/record.url?scp=85149376617&partnerID=8YFLogxK

U2 - 10.1109/TCOMM.2023.3251353

DO - 10.1109/TCOMM.2023.3251353

M3 - Article

AN - SCOPUS:85149376617

SN - 0090-6778

VL - 71

SP - 2698

EP - 2710

JO - IEEE Transactions on Communications

JF - IEEE Transactions on Communications

IS - 5

ER -

Augmented Deep Reinforcement Learning for Online Energy Minimization of Wireless Powered Mobile Edge Computing

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this