TY - GEN
T1 - Cost-Efficient Parallel RSA Decryption with Integrated GPGPU and OpenCL
AU - Gao, Sanshan
AU - Zhang, Shunqing
AU - Fu, Michael
AU - Xu, Shugong
AU - Li, Weiping
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/1/12
Y1 - 2017/1/12
N2 - Secure communications become more, more important with the exponential growth of Internet data services. However, to massively deploy such services in current networks, we need to deal with different architectures, time-consuming encryption/decryption algorithms. Recently, numerous research efforts have been focused on parallel computing using high-end discrete GPGPU to achieve peak throughput, while the study on more cost-efficient integrated GPGPU is still open, which is included in almost all of the Intel CPUs currently, e.g. Celeron, Pentium, Core, even Xeon-E3 families. In this paper, we propose a generic heterogeneous computing approach for the decryption of RSA algorithm. Based on OpenCL, we are able to implement our idea on the off-the-shelf mini PC with low-cost integrated GPGPU. According to the experimental results, we show that the proposed scheme can fully utilize the capacity of integrated GPGPU, a 2 times throughput improvement over the CPUonly solution can be expected.
AB - Secure communications become more, more important with the exponential growth of Internet data services. However, to massively deploy such services in current networks, we need to deal with different architectures, time-consuming encryption/decryption algorithms. Recently, numerous research efforts have been focused on parallel computing using high-end discrete GPGPU to achieve peak throughput, while the study on more cost-efficient integrated GPGPU is still open, which is included in almost all of the Intel CPUs currently, e.g. Celeron, Pentium, Core, even Xeon-E3 families. In this paper, we propose a generic heterogeneous computing approach for the decryption of RSA algorithm. Based on OpenCL, we are able to implement our idea on the off-the-shelf mini PC with low-cost integrated GPGPU. According to the experimental results, we show that the proposed scheme can fully utilize the capacity of integrated GPGPU, a 2 times throughput improvement over the CPUonly solution can be expected.
KW - GPGPU
KW - Heterogenous computing
KW - OpenCL
KW - Parallel computing
KW - RSA
UR - http://www.scopus.com/inward/record.url?scp=85013159405&partnerID=8YFLogxK
U2 - 10.1109/UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld.2016.0101
DO - 10.1109/UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld.2016.0101
M3 - Conference Proceeding
AN - SCOPUS:85013159405
T3 - Proceedings - 13th IEEE International Conference on Ubiquitous Intelligence and Computing, 13th IEEE International Conference on Advanced and Trusted Computing, 16th IEEE International Conference on Scalable Computing and Communications, IEEE International Conference on Cloud and Big Data Computing, IEEE International Conference on Internet of People and IEEE Smart World Congress and Workshops, UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld 2016
SP - 597
EP - 602
BT - Proceedings - 13th IEEE International Conference on Ubiquitous Intelligence and Computing, 13th IEEE International Conference on Advanced and Trusted Computing, 16th IEEE International Conference on Scalable Computing and Communications, IEEE International Conference on Cloud and Big Data Computing, IEEE International Conference on Internet of People and IEEE Smart World Congress and Workshops, UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld 2016
A2 - El Baz, Didier
A2 - Bourgeois, Julien
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 13th IEEE International Conference on Ubiquitous Intelligence and Computing, 13th IEEE International Conference on Advanced and Trusted Computing, 16th IEEE International Conference on Scalable Computing and Communications, IEEE International Conference on Cloud and Big Data Computing, IEEE International Conference on Internet of People and IEEE Smart World Congress and Workshops, UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld 2016
Y2 - 18 July 2016 through 21 July 2016
ER -