Quantization and Deployment Study of Classification Models for Embedded Platforms

Zihan Huang; Jin Jin; Chaolong Zhang; Zhijie Xu; Yuanping Xu; Chao Kong; Qin Wen; Dan Tang

doi:10.1109/ICAC57885.2023.10275155

Quantization and Deployment Study of Classification Models for Embedded Platforms

Zihan Huang, Jin Jin, Chaolong Zhang, Zhijie Xu, Yuanping Xu, Chao Kong, Qin Wen, Dan Tang

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Deep learning models find extensive applications across various domains. However, their large number of parameters, high storage requirements, and computational overhead pose challenges for deploying these models on resource-constrained embedded devices. This study focuses on addressing this issue by exploring techniques to optimize and deploy lightweight models on embedded devices. The approach involves optimization and adjustment of the model, followed by model conversion, quantization, and quantization calibration, aimed at reducing model size and improving inference speed. Notably, improvements are made to the quantization calibration algorithm to mitigate accuracy loss caused by model quantization. The experimental results demonstrate that light quantization significantly reduces model size, facilitating storage on embedded devices. Although there is a slight reduction in accuracy, the inference speed is substantially improved, enabling real-time human face recognition in video scenarios.

Original language	English
Title of host publication	ICAC 2023 - 28th International Conference on Automation and Computing
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9798350335859
DOIs	https://doi.org/10.1109/ICAC57885.2023.10275155
Publication status	Published - 2023
Externally published	Yes
Event	28th International Conference on Automation and Computing, ICAC 2023 - Birmingham, United Kingdom Duration: 30 Aug 2023 → 1 Sept 2023

Publication series

Name	ICAC 2023 - 28th International Conference on Automation and Computing

Conference

Conference	28th International Conference on Automation and Computing, ICAC 2023
Country/Territory	United Kingdom
City	Birmingham
Period	30/08/23 → 1/09/23

Keywords

Deep learning
Embedded devices
Lightweight models
Quantization

Access to Document

10.1109/ICAC57885.2023.10275155

Cite this

Huang, Z., Jin, J., Zhang, C., Xu, Z., Xu, Y., Kong, C., Wen, Q., & Tang, D. (2023). Quantization and Deployment Study of Classification Models for Embedded Platforms. In ICAC 2023 - 28th International Conference on Automation and Computing (ICAC 2023 - 28th International Conference on Automation and Computing). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICAC57885.2023.10275155

@inproceedings{e9f123d0bb8a4d418a20c68db90240a7,

title = "Quantization and Deployment Study of Classification Models for Embedded Platforms",

abstract = "Deep learning models find extensive applications across various domains. However, their large number of parameters, high storage requirements, and computational overhead pose challenges for deploying these models on resource-constrained embedded devices. This study focuses on addressing this issue by exploring techniques to optimize and deploy lightweight models on embedded devices. The approach involves optimization and adjustment of the model, followed by model conversion, quantization, and quantization calibration, aimed at reducing model size and improving inference speed. Notably, improvements are made to the quantization calibration algorithm to mitigate accuracy loss caused by model quantization. The experimental results demonstrate that light quantization significantly reduces model size, facilitating storage on embedded devices. Although there is a slight reduction in accuracy, the inference speed is substantially improved, enabling real-time human face recognition in video scenarios.",

keywords = "Deep learning, Embedded devices, Lightweight models, Quantization",

author = "Zihan Huang and Jin Jin and Chaolong Zhang and Zhijie Xu and Yuanping Xu and Chao Kong and Qin Wen and Dan Tang",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 28th International Conference on Automation and Computing, ICAC 2023 ; Conference date: 30-08-2023 Through 01-09-2023",

year = "2023",

doi = "10.1109/ICAC57885.2023.10275155",

language = "English",

series = "ICAC 2023 - 28th International Conference on Automation and Computing",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "ICAC 2023 - 28th International Conference on Automation and Computing",

}

Huang, Z, Jin, J, Zhang, C, Xu, Z, Xu, Y, Kong, C, Wen, Q & Tang, D 2023, Quantization and Deployment Study of Classification Models for Embedded Platforms. in ICAC 2023 - 28th International Conference on Automation and Computing. ICAC 2023 - 28th International Conference on Automation and Computing, Institute of Electrical and Electronics Engineers Inc., 28th International Conference on Automation and Computing, ICAC 2023, Birmingham, United Kingdom, 30/08/23. https://doi.org/10.1109/ICAC57885.2023.10275155

Quantization and Deployment Study of Classification Models for Embedded Platforms. / Huang, Zihan; Jin, Jin; Zhang, Chaolong et al.
ICAC 2023 - 28th International Conference on Automation and Computing. Institute of Electrical and Electronics Engineers Inc., 2023. (ICAC 2023 - 28th International Conference on Automation and Computing).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Quantization and Deployment Study of Classification Models for Embedded Platforms

AU - Huang, Zihan

AU - Jin, Jin

AU - Zhang, Chaolong

AU - Xu, Zhijie

AU - Xu, Yuanping

AU - Kong, Chao

AU - Wen, Qin

AU - Tang, Dan

PY - 2023

Y1 - 2023

N2 - Deep learning models find extensive applications across various domains. However, their large number of parameters, high storage requirements, and computational overhead pose challenges for deploying these models on resource-constrained embedded devices. This study focuses on addressing this issue by exploring techniques to optimize and deploy lightweight models on embedded devices. The approach involves optimization and adjustment of the model, followed by model conversion, quantization, and quantization calibration, aimed at reducing model size and improving inference speed. Notably, improvements are made to the quantization calibration algorithm to mitigate accuracy loss caused by model quantization. The experimental results demonstrate that light quantization significantly reduces model size, facilitating storage on embedded devices. Although there is a slight reduction in accuracy, the inference speed is substantially improved, enabling real-time human face recognition in video scenarios.

AB - Deep learning models find extensive applications across various domains. However, their large number of parameters, high storage requirements, and computational overhead pose challenges for deploying these models on resource-constrained embedded devices. This study focuses on addressing this issue by exploring techniques to optimize and deploy lightweight models on embedded devices. The approach involves optimization and adjustment of the model, followed by model conversion, quantization, and quantization calibration, aimed at reducing model size and improving inference speed. Notably, improvements are made to the quantization calibration algorithm to mitigate accuracy loss caused by model quantization. The experimental results demonstrate that light quantization significantly reduces model size, facilitating storage on embedded devices. Although there is a slight reduction in accuracy, the inference speed is substantially improved, enabling real-time human face recognition in video scenarios.

KW - Deep learning

KW - Embedded devices

KW - Lightweight models

KW - Quantization

UR - http://www.scopus.com/inward/record.url?scp=85175575340&partnerID=8YFLogxK

U2 - 10.1109/ICAC57885.2023.10275155

DO - 10.1109/ICAC57885.2023.10275155

M3 - Conference Proceeding

AN - SCOPUS:85175575340

T3 - ICAC 2023 - 28th International Conference on Automation and Computing

BT - ICAC 2023 - 28th International Conference on Automation and Computing

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 28th International Conference on Automation and Computing, ICAC 2023

Y2 - 30 August 2023 through 1 September 2023

ER -

Huang Z, Jin J, Zhang C, Xu Z, Xu Y, Kong C et al. Quantization and Deployment Study of Classification Models for Embedded Platforms. In ICAC 2023 - 28th International Conference on Automation and Computing. Institute of Electrical and Electronics Engineers Inc. 2023. (ICAC 2023 - 28th International Conference on Automation and Computing). doi: 10.1109/ICAC57885.2023.10275155

Quantization and Deployment Study of Classification Models for Embedded Platforms

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this