Joint Optimization of DNN Inference Delay and Energy under Accuracy Constraints for AR Applications

Guangjin Pan, Heng Zhang, Shugong Xu*, Shunqing Zhang, Xiaojing Chen

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

3 Citations (Scopus)

Abstract

The high computational complexity and high energy consumption of artificial intelligence (AI) algorithms hinder their application in augmented reality (AR) systems. This paper considers the scene of completing video-based AI inference tasks in the mobile edge computing (MEC) system. We use multiply-and-accumulate operations (MACs) for problem analysis and optimize delay and energy consumption under accuracy constraints. To solve this problem, we first assume that offloading policy is known and decouple the problem into two subproblems. After solving these two subproblems, we propose an iterative-based scheduling algorithm to obtain the optimal offloading policy. We also experimentally discuss the relationship between delay, energy consumption, and inference accuracy.

Original languageEnglish
Pages (from-to)2230-2235
Number of pages6
JournalProceedings - IEEE Global Communications Conference, GLOBECOM
DOIs
Publication statusPublished - 2022
Externally publishedYes
Event2022 IEEE Global Communications Conference, GLOBECOM 2022 - Virtual, Online, Brazil
Duration: 4 Dec 20228 Dec 2022

Keywords

  • edge intelligence
  • Mobile augmented reality
  • mobile edge computing
  • resource allocation

Fingerprint

Dive into the research topics of 'Joint Optimization of DNN Inference Delay and Energy under Accuracy Constraints for AR Applications'. Together they form a unique fingerprint.

Cite this