Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets

jiawen Li; Pascal LEFEVRE; Anwar PP Majeed

Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets

jiawen Li, Pascal LEFEVRE, Anwar PP Majeed

School of AI and Advanced Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.

Original language	English
Title of host publication	Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets
Publisher	IEEE TCCC CyberC
Publication status	Published - Sept 2024

Cite this

@inproceedings{081c5406b1c44d1289ef9a4d30044000,

title = "Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets",

abstract = "Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.",

author = "jiawen Li and Pascal LEFEVRE and {PP Majeed}, Anwar",

year = "2024",

month = sep,

language = "English",

booktitle = "Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets",

publisher = "IEEE TCCC CyberC",

}

TY - GEN

T1 - Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets

AU - Li, jiawen

AU - LEFEVRE, Pascal

AU - PP Majeed, Anwar

PY - 2024/9

Y1 - 2024/9

N2 - Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.

AB - Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process during training, assuming relevancy in gradients between iterations. To avoid over-fitting, NRSGD incorporates a noise regularization technique that introduces controlled noise to the gradients during the optimization process. Comparative experiments of this research are conducted on the CIFAR-10, and CIFAR-100 datasets, benchmarking different CNNs(Convolutional Neural Networks) with IAGD and NRSGD against classical optimizers in Keras Package. Results demonstrate the potential of those two viable improvement methods in SGD, implicating the effectiveness of the advancements.

M3 - Conference Proceeding

BT - Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets

PB - IEEE TCCC CyberC

ER -

Randomness and Interpolation Improve Gradient Descent: A Simple Exploration in CIFAR datasets

Abstract

Fingerprint

Cite this