Quantization and Deployment Study of Classification Models for Embedded Platforms

Zihan Huang, Jin Jin, Chaolong Zhang, Zhijie Xu, Yuanping Xu, Chao Kong, Qin Wen, Dan Tang

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Deep learning models find extensive applications across various domains. However, their large number of parameters, high storage requirements, and computational overhead pose challenges for deploying these models on resource-constrained embedded devices. This study focuses on addressing this issue by exploring techniques to optimize and deploy lightweight models on embedded devices. The approach involves optimization and adjustment of the model, followed by model conversion, quantization, and quantization calibration, aimed at reducing model size and improving inference speed. Notably, improvements are made to the quantization calibration algorithm to mitigate accuracy loss caused by model quantization. The experimental results demonstrate that light quantization significantly reduces model size, facilitating storage on embedded devices. Although there is a slight reduction in accuracy, the inference speed is substantially improved, enabling real-time human face recognition in video scenarios.

Original languageEnglish
Title of host publicationICAC 2023 - 28th International Conference on Automation and Computing
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350335859
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event28th International Conference on Automation and Computing, ICAC 2023 - Birmingham, United Kingdom
Duration: 30 Aug 20231 Sept 2023

Publication series

NameICAC 2023 - 28th International Conference on Automation and Computing

Conference

Conference28th International Conference on Automation and Computing, ICAC 2023
Country/TerritoryUnited Kingdom
CityBirmingham
Period30/08/231/09/23

Keywords

  • Deep learning
  • Embedded devices
  • Lightweight models
  • Quantization

Fingerprint

Dive into the research topics of 'Quantization and Deployment Study of Classification Models for Embedded Platforms'. Together they form a unique fingerprint.

Cite this