Configurable CNN Accelerator in Speech Processing based on Vector Convolution

Lanqing Hui; Shan Cao; Zhiyong Chen; Shan Li; Shugong Xu

doi:10.1109/AICAS54282.2022.9869904

Configurable CNN Accelerator in Speech Processing based on Vector Convolution

Lanqing Hui, Shan Cao^*, Zhiyong Chen, Shan Li, Shugong Xu

^*Corresponding author for this work

Shanghai University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

6 Citations (Scopus)

Abstract

In speech applications, both input feature maps (IFMs) and kernels of neural networks are greatly diverse in shapes and sizes, which poses significant challenges to hardware acceleration. In this paper, a configurable CNN accelerator is introduced to make a good balance between the flexibility and efficiency for various neural network models in speech processing. The vector convolution scheme is first proposed by re-arrangement of IFM rows and weight values in vectors, by which the element convolution is converted into vector operations to break the limit of kernel-centric processing. The structure of vector processing element (VPE) is introduced to fit the continuous scaling down of IFMs with little control overheads, and the architecture of the CNN accelerator is proposed accordingly. FPGA implementation results demonstrate that the throughput is increased by 86% by the proposed architecture compared to state-of-the-art FPGA accelerators for the VGG16 network, while high DSP utilization is guaranteed for both 1D and 2D CNNs with various input sizes.

Original language	English
Title of host publication	Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	146-149
Number of pages	4
ISBN (Electronic)	9781665409964
DOIs	https://doi.org/10.1109/AICAS54282.2022.9869904
Publication status	Published - 2022
Externally published	Yes
Event	4th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022 - Incheon, Korea, Republic of Duration: 13 Jun 2022 → 15 Jun 2022

Publication series

Name	Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022

Conference

Conference	4th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022
Country/Territory	Korea, Republic of
City	Incheon
Period	13/06/22 → 15/06/22

Keywords

Accelerator
CNN
FPGA implementation
speech processing

Access to Document

10.1109/AICAS54282.2022.9869904

Cite this

Hui, L., Cao, S., Chen, Z., Li, S., & Xu, S. (2022). Configurable CNN Accelerator in Speech Processing based on Vector Convolution. In Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022 (pp. 146-149). (Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/AICAS54282.2022.9869904

Hui, Lanqing ; Cao, Shan ; Chen, Zhiyong et al. / Configurable CNN Accelerator in Speech Processing based on Vector Convolution. Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022. Institute of Electrical and Electronics Engineers Inc., 2022. pp. 146-149 (Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022).

@inproceedings{7558ce709180414ca9801ead8d730882,

title = "Configurable CNN Accelerator in Speech Processing based on Vector Convolution",

abstract = "In speech applications, both input feature maps (IFMs) and kernels of neural networks are greatly diverse in shapes and sizes, which poses significant challenges to hardware acceleration. In this paper, a configurable CNN accelerator is introduced to make a good balance between the flexibility and efficiency for various neural network models in speech processing. The vector convolution scheme is first proposed by re-arrangement of IFM rows and weight values in vectors, by which the element convolution is converted into vector operations to break the limit of kernel-centric processing. The structure of vector processing element (VPE) is introduced to fit the continuous scaling down of IFMs with little control overheads, and the architecture of the CNN accelerator is proposed accordingly. FPGA implementation results demonstrate that the throughput is increased by 86% by the proposed architecture compared to state-of-the-art FPGA accelerators for the VGG16 network, while high DSP utilization is guaranteed for both 1D and 2D CNNs with various input sizes.",

keywords = "Accelerator, CNN, FPGA implementation, speech processing",

author = "Lanqing Hui and Shan Cao and Zhiyong Chen and Shan Li and Shugong Xu",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 4th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022 ; Conference date: 13-06-2022 Through 15-06-2022",

year = "2022",

doi = "10.1109/AICAS54282.2022.9869904",

language = "English",

series = "Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "146--149",

booktitle = "Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022",

}

Hui, L, Cao, S, Chen, Z, Li, S & Xu, S 2022, Configurable CNN Accelerator in Speech Processing based on Vector Convolution. in Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022. Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022, Institute of Electrical and Electronics Engineers Inc., pp. 146-149, 4th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022, Incheon, Korea, Republic of, 13/06/22. https://doi.org/10.1109/AICAS54282.2022.9869904

Configurable CNN Accelerator in Speech Processing based on Vector Convolution. / Hui, Lanqing; Cao, Shan; Chen, Zhiyong et al.
Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022. Institute of Electrical and Electronics Engineers Inc., 2022. p. 146-149 (Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Configurable CNN Accelerator in Speech Processing based on Vector Convolution

AU - Hui, Lanqing

AU - Cao, Shan

AU - Chen, Zhiyong

AU - Li, Shan

AU - Xu, Shugong

PY - 2022

Y1 - 2022

N2 - In speech applications, both input feature maps (IFMs) and kernels of neural networks are greatly diverse in shapes and sizes, which poses significant challenges to hardware acceleration. In this paper, a configurable CNN accelerator is introduced to make a good balance between the flexibility and efficiency for various neural network models in speech processing. The vector convolution scheme is first proposed by re-arrangement of IFM rows and weight values in vectors, by which the element convolution is converted into vector operations to break the limit of kernel-centric processing. The structure of vector processing element (VPE) is introduced to fit the continuous scaling down of IFMs with little control overheads, and the architecture of the CNN accelerator is proposed accordingly. FPGA implementation results demonstrate that the throughput is increased by 86% by the proposed architecture compared to state-of-the-art FPGA accelerators for the VGG16 network, while high DSP utilization is guaranteed for both 1D and 2D CNNs with various input sizes.

AB - In speech applications, both input feature maps (IFMs) and kernels of neural networks are greatly diverse in shapes and sizes, which poses significant challenges to hardware acceleration. In this paper, a configurable CNN accelerator is introduced to make a good balance between the flexibility and efficiency for various neural network models in speech processing. The vector convolution scheme is first proposed by re-arrangement of IFM rows and weight values in vectors, by which the element convolution is converted into vector operations to break the limit of kernel-centric processing. The structure of vector processing element (VPE) is introduced to fit the continuous scaling down of IFMs with little control overheads, and the architecture of the CNN accelerator is proposed accordingly. FPGA implementation results demonstrate that the throughput is increased by 86% by the proposed architecture compared to state-of-the-art FPGA accelerators for the VGG16 network, while high DSP utilization is guaranteed for both 1D and 2D CNNs with various input sizes.

KW - Accelerator

KW - CNN

KW - FPGA implementation

KW - speech processing

UR - http://www.scopus.com/inward/record.url?scp=85139059515&partnerID=8YFLogxK

U2 - 10.1109/AICAS54282.2022.9869904

DO - 10.1109/AICAS54282.2022.9869904

M3 - Conference Proceeding

AN - SCOPUS:85139059515

T3 - Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022

SP - 146

EP - 149

BT - Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 4th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022

Y2 - 13 June 2022 through 15 June 2022

ER -

Hui L, Cao S, Chen Z, Li S, Xu S. Configurable CNN Accelerator in Speech Processing based on Vector Convolution. In Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022. Institute of Electrical and Electronics Engineers Inc. 2022. p. 146-149. (Proceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022). doi: 10.1109/AICAS54282.2022.9869904

Configurable CNN Accelerator in Speech Processing based on Vector Convolution

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this