Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets

Jiawen Li; Pascal Lefevre; Anwar PP Abdul Majeed

doi:10.1109/CyberC62439.2024.00020

Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets

Jiawen Li^*, Pascal Lefevre, Anwar PP Abdul Majeed

^*Corresponding author for this work

Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.

Original language	English
Title of host publication	Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	56-59
Number of pages	4
ISBN (Electronic)	9798331506896
DOIs	https://doi.org/10.1109/CyberC62439.2024.00020
Publication status	Published - 2024
Event	16th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024 - Guangzhou, China Duration: 24 Oct 2024 → 26 Oct 2024

Publication series

Name	Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024

Conference

Conference	16th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024
Country/Territory	China
City	Guangzhou
Period	24/10/24 → 26/10/24

Keywords

Deep Learning
Interpolation
Optimization
Stochastic Gradient Descent(SGD)

Access to Document

10.1109/CyberC62439.2024.00020

Cite this

Li, J., Lefevre, P., & PP Abdul Majeed, A. (2024). Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets. In Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024 (pp. 56-59). (Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CyberC62439.2024.00020

Li, Jiawen ; Lefevre, Pascal ; PP Abdul Majeed, Anwar. / Randomness and Interpolation Improve Gradient Descent : A Simple Exploration in CIFAR Datasets. Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 56-59 (Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024).

@inproceedings{c99f2667264c47dcbc0accac376bee55,

title = "Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets",

abstract = "Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.",

keywords = "Deep Learning, Interpolation, Optimization, Stochastic Gradient Descent(SGD)",

author = "Jiawen Li and Pascal Lefevre and {PP Abdul Majeed}, Anwar",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 16th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024 ; Conference date: 24-10-2024 Through 26-10-2024",

year = "2024",

doi = "10.1109/CyberC62439.2024.00020",

language = "English",

series = "Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "56--59",

booktitle = "Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024",

}

Li, J, Lefevre, P & PP Abdul Majeed, A 2024, Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets. in Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024. Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024, Institute of Electrical and Electronics Engineers Inc., pp. 56-59, 16th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024, Guangzhou, China, 24/10/24. https://doi.org/10.1109/CyberC62439.2024.00020

Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets. / Li, Jiawen; Lefevre, Pascal; PP Abdul Majeed, Anwar.
Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 56-59 (Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Randomness and Interpolation Improve Gradient Descent

T2 - 16th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024

AU - Li, Jiawen

AU - Lefevre, Pascal

AU - PP Abdul Majeed, Anwar

PY - 2024

Y1 - 2024

N2 - Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.

AB - Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.

KW - Deep Learning

KW - Interpolation

KW - Optimization

KW - Stochastic Gradient Descent(SGD)

UR - http://www.scopus.com/inward/record.url?scp=85215130364&partnerID=8YFLogxK

U2 - 10.1109/CyberC62439.2024.00020

DO - 10.1109/CyberC62439.2024.00020

M3 - Conference Proceeding

AN - SCOPUS:85215130364

T3 - Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024

SP - 56

EP - 59

BT - Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 24 October 2024 through 26 October 2024

ER -

Li J, Lefevre P, PP Abdul Majeed A. Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets. In Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 56-59. (Proceedings - 2024 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discover, CyberC 2024). doi: 10.1109/CyberC62439.2024.00020

Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR Datasets

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this