TY - GEN
T1 - Privacy-Preserving Logistic Regression as a Cloud Service Based on Residue Number System
AU - Cortés-Mendoza, Jorge M.
AU - Tchernykh, Andrei
AU - Babenko, Mikhail
AU - Pulido-Gaytán, Luis Bernardo
AU - Radchenko, Gleb
AU - Leprevost, Franck
AU - Wang, Xinheng
AU - Avetisyan, Arutyun
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - The security of data storage, transmission, and processing is emerging as an important consideration in many data analytics techniques and technologies. For instance, in machine learning, the datasets could contain sensitive information that cannot be protected by traditional encryption approaches. Homomorphic encryption schemes and secure multi-party computation are considered as a solution for privacy protection. In this paper, we propose a homomorphic Logistic Regression based on Residue Number System (LR-RNS) that provides security, parallel processing, scalability, error detection, and correction. We verify it using six known datasets from medicine (diabetes, cancer, drugs, etc.) and genomics. We provide experimental analysis with 30 configurations for each dataset to compare the performance and quality of our solution with the state of the art algorithms. For a fair comparison, we use the same 5-fold cross-validation technique. The results show that LR-RNS demonstrates similar accuracy and performance of the classification algorithm at various thresholds settings but with the reduction of training time from 85.9% to 96.1%.
AB - The security of data storage, transmission, and processing is emerging as an important consideration in many data analytics techniques and technologies. For instance, in machine learning, the datasets could contain sensitive information that cannot be protected by traditional encryption approaches. Homomorphic encryption schemes and secure multi-party computation are considered as a solution for privacy protection. In this paper, we propose a homomorphic Logistic Regression based on Residue Number System (LR-RNS) that provides security, parallel processing, scalability, error detection, and correction. We verify it using six known datasets from medicine (diabetes, cancer, drugs, etc.) and genomics. We provide experimental analysis with 30 configurations for each dataset to compare the performance and quality of our solution with the state of the art algorithms. For a fair comparison, we use the same 5-fold cross-validation technique. The results show that LR-RNS demonstrates similar accuracy and performance of the classification algorithm at various thresholds settings but with the reduction of training time from 85.9% to 96.1%.
KW - Cloud security
KW - Homomorphic encryption
KW - Logistic regression
KW - Residue number system
UR - http://www.scopus.com/inward/record.url?scp=85097853082&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-64616-5_51
DO - 10.1007/978-3-030-64616-5_51
M3 - Conference Proceeding
AN - SCOPUS:85097853082
SN - 9783030646158
T3 - Communications in Computer and Information Science
SP - 598
EP - 610
BT - Supercomputing - 6th Russian Supercomputing Days, RuSCDays 2020, Revised Selected Papers
A2 - Voevodin, Vladimir
A2 - Sobolev, Sergey
PB - Springer Science and Business Media Deutschland GmbH
T2 - 6th Russian Supercomputing Days, RuSCDays 2020
Y2 - 21 September 2020 through 22 September 2020
ER -