A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization

Shan Li; Shan Cao; Lanqing Hui; Zhiyuan Jiang; Yanzan Sun; Shugong Xu

doi:10.1109/SiPS55645.2022.9919242

A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization

Shan Li, Shan Cao^*, Lanqing Hui, Zhiyuan Jiang, Yanzan Sun, Shugong Xu

^*Corresponding author for this work

Shanghai University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

5 Citations (Scopus)

Abstract

Deformable convolution networks (DCNs) shows performance boosts on object recognition tasks by enabling variable geometric modeling. However, the irregular addressing of memory accesses makes it inefficient for hardware acceleration. In this paper, we propose a computational-efficient hardware accelerator for DCNs. First, a hardware-friendly DCNs inference scheme is introduced based on the original DCNs algorithm with little accuracy loss. Secondly, a hardware accelerator architecture is presented correspondingly, and an speed matching method is introduced to maximizing the number of deformable layers without latency increase. The proposed accelerator is implemented on the Arria 10 FPGA, results of which show that the proposed design achieves the highest throughput and DSP efficiency compared with state-of-the-art DCNs accelerators.

Original language	English
Title of host publication	2022 IEEE Workshop on Signal Processing Systems, SiPS 2022
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781665485241
DOIs	https://doi.org/10.1109/SiPS55645.2022.9919242
Publication status	Published - 2022
Externally published	Yes
Event	36th IEEE Workshop on Signal Processing Systems, SiPS 2022 - Rennes, France Duration: 2 Nov 2022 → 4 Nov 2022

Publication series

Name	IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
Volume	2022-November
ISSN (Print)	1520-6130

Conference

Conference	36th IEEE Workshop on Signal Processing Systems, SiPS 2022
Country/Territory	France
City	Rennes
Period	2/11/22 → 4/11/22

Keywords

deformable convolution
FPGA
Hardware accelerator
hardware-friendly algorithm

Access to Document

10.1109/SiPS55645.2022.9919242

Cite this

Li, S., Cao, S., Hui, L., Jiang, Z., Sun, Y., & Xu, S. (2022). A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization. In 2022 IEEE Workshop on Signal Processing Systems, SiPS 2022 (IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation; Vol. 2022-November). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SiPS55645.2022.9919242

@inproceedings{a970965214574ffc9240d383bfdc783d,

title = "A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization",

abstract = "Deformable convolution networks (DCNs) shows performance boosts on object recognition tasks by enabling variable geometric modeling. However, the irregular addressing of memory accesses makes it inefficient for hardware acceleration. In this paper, we propose a computational-efficient hardware accelerator for DCNs. First, a hardware-friendly DCNs inference scheme is introduced based on the original DCNs algorithm with little accuracy loss. Secondly, a hardware accelerator architecture is presented correspondingly, and an speed matching method is introduced to maximizing the number of deformable layers without latency increase. The proposed accelerator is implemented on the Arria 10 FPGA, results of which show that the proposed design achieves the highest throughput and DSP efficiency compared with state-of-the-art DCNs accelerators.",

keywords = "deformable convolution, FPGA, Hardware accelerator, hardware-friendly algorithm",

author = "Shan Li and Shan Cao and Lanqing Hui and Zhiyuan Jiang and Yanzan Sun and Shugong Xu",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 36th IEEE Workshop on Signal Processing Systems, SiPS 2022 ; Conference date: 02-11-2022 Through 04-11-2022",

year = "2022",

doi = "10.1109/SiPS55645.2022.9919242",

language = "English",

series = "IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2022 IEEE Workshop on Signal Processing Systems, SiPS 2022",

}

Li, S, Cao, S, Hui, L, Jiang, Z, Sun, Y & Xu, S 2022, A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization. in 2022 IEEE Workshop on Signal Processing Systems, SiPS 2022. IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation, vol. 2022-November, Institute of Electrical and Electronics Engineers Inc., 36th IEEE Workshop on Signal Processing Systems, SiPS 2022, Rennes, France, 2/11/22. https://doi.org/10.1109/SiPS55645.2022.9919242

A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization. / Li, Shan; Cao, Shan; Hui, Lanqing et al.
2022 IEEE Workshop on Signal Processing Systems, SiPS 2022. Institute of Electrical and Electronics Engineers Inc., 2022. (IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation; Vol. 2022-November).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization

AU - Li, Shan

AU - Cao, Shan

AU - Hui, Lanqing

AU - Jiang, Zhiyuan

AU - Sun, Yanzan

AU - Xu, Shugong

PY - 2022

Y1 - 2022

N2 - Deformable convolution networks (DCNs) shows performance boosts on object recognition tasks by enabling variable geometric modeling. However, the irregular addressing of memory accesses makes it inefficient for hardware acceleration. In this paper, we propose a computational-efficient hardware accelerator for DCNs. First, a hardware-friendly DCNs inference scheme is introduced based on the original DCNs algorithm with little accuracy loss. Secondly, a hardware accelerator architecture is presented correspondingly, and an speed matching method is introduced to maximizing the number of deformable layers without latency increase. The proposed accelerator is implemented on the Arria 10 FPGA, results of which show that the proposed design achieves the highest throughput and DSP efficiency compared with state-of-the-art DCNs accelerators.

AB - Deformable convolution networks (DCNs) shows performance boosts on object recognition tasks by enabling variable geometric modeling. However, the irregular addressing of memory accesses makes it inefficient for hardware acceleration. In this paper, we propose a computational-efficient hardware accelerator for DCNs. First, a hardware-friendly DCNs inference scheme is introduced based on the original DCNs algorithm with little accuracy loss. Secondly, a hardware accelerator architecture is presented correspondingly, and an speed matching method is introduced to maximizing the number of deformable layers without latency increase. The proposed accelerator is implemented on the Arria 10 FPGA, results of which show that the proposed design achieves the highest throughput and DSP efficiency compared with state-of-the-art DCNs accelerators.

KW - deformable convolution

KW - FPGA

KW - Hardware accelerator

KW - hardware-friendly algorithm

UR - http://www.scopus.com/inward/record.url?scp=85141763307&partnerID=8YFLogxK

U2 - 10.1109/SiPS55645.2022.9919242

DO - 10.1109/SiPS55645.2022.9919242

M3 - Conference Proceeding

AN - SCOPUS:85141763307

T3 - IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation

BT - 2022 IEEE Workshop on Signal Processing Systems, SiPS 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 36th IEEE Workshop on Signal Processing Systems, SiPS 2022

Y2 - 2 November 2022 through 4 November 2022

ER -

Li S, Cao S, Hui L, Jiang Z, Sun Y, Xu S. A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization. In 2022 IEEE Workshop on Signal Processing Systems, SiPS 2022. Institute of Electrical and Electronics Engineers Inc. 2022. (IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation). doi: 10.1109/SiPS55645.2022.9919242

A Computational-Efficient Deformable Convolution Network Accelerator via Hardware and Algorithm Co-Optimization

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this