TY - JOUR
T1 - Energy Optimization of Multi-Task DNN Inference in MEC-Assisted XR Devices
T2 - A Lyapunov-Guided Reinforcement Learning Approach
AU - Sun, Yanzan
AU - Qiu, Jiacheng
AU - Pan, Guangjin
AU - Xu, Shugong
AU - Zhang, Shunqing
AU - Wang, Xiaoyun
AU - Han, Shuangfeng
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Extended Reality (XR), blending virtual and real worlds, is a key application of future networks. While AI advancements enhance XR capabilities, they also impose significant computational and energy challenges on lightweight XR devices. In this paper, we developed a distributed queue model for multi-task DNN inference, addressing issues of resource competition and queue coupling. In response to the challenges posed by the high energy consumption and limited resources of XR devices, we designed a dual time-scale joint optimization strategy for model partitioning and resource allocation, formulated as a bi-level optimization problem. This strategy aims to minimize the total energy consumption of XR devices while ensuring queue stability and adhering to computational and communication resource constraints. To tackle this problem, we devised a Lyapunov-guided Proximal Policy Optimization algorithm, named LyaPPO. Through numerical results, we show that our LyaPPO algorithm outperforms the baseline algorithms. Specifically, under different maximum local computational capacities, the proposed algorithm decreases 24.29%-56.62% energy compared to the suboptimal baselines.
AB - Extended Reality (XR), blending virtual and real worlds, is a key application of future networks. While AI advancements enhance XR capabilities, they also impose significant computational and energy challenges on lightweight XR devices. In this paper, we developed a distributed queue model for multi-task DNN inference, addressing issues of resource competition and queue coupling. In response to the challenges posed by the high energy consumption and limited resources of XR devices, we designed a dual time-scale joint optimization strategy for model partitioning and resource allocation, formulated as a bi-level optimization problem. This strategy aims to minimize the total energy consumption of XR devices while ensuring queue stability and adhering to computational and communication resource constraints. To tackle this problem, we devised a Lyapunov-guided Proximal Policy Optimization algorithm, named LyaPPO. Through numerical results, we show that our LyaPPO algorithm outperforms the baseline algorithms. Specifically, under different maximum local computational capacities, the proposed algorithm decreases 24.29%-56.62% energy compared to the suboptimal baselines.
KW - collaborative inference
KW - deep reinforcement learning
KW - DNN partitioning
KW - Edge intelligence
KW - energy efficiency
KW - resource allocation
UR - http://www.scopus.com/inward/record.url?scp=85216858519&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2025.3536879
DO - 10.1109/JIOT.2025.3536879
M3 - Article
AN - SCOPUS:85216858519
SN - 2327-4662
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
ER -