Study on the Correlation of Trainable Parameters and Hyperparameters with the Performance of Deep Learning Models

Song Quan Ong, Pradeep Isawasan*, Gomesh Nair, Khairulliza Ahmad Salleh, Umi Kalsom Yusof

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

1 Citation (Scopus)

Abstract

Trainable parameters and hyperparameters are critical to the development of a deep learning model. However, the components have typically been studied individually, and most studies have found it difficult to investigate the effects of their combination on model performance. We are interested in examining the correlation between the number of trainable parameters in a deep learning model and its performance metrics under different hyperparameters. Specifically, we want to study the effects of using either the Adam or SGD optimizers at three varying learning rates. We use six pre-trained models whose trainable parameters have been quantitatively defined using two strategies: (1) freezing the convolutional basis with partially trainable weights and (2) training the whole model with most trainable weights to obtain a set of trainable parameters. Our experimental result shows a positive correlation between the trainable parameters and the test accuracy regardless of the level of the learning rate. However, for the generalization of the model, it was not guaranteed that a higher number of trainable parameters would lead to higher accuracy and F1 score. We have shown that the correlation between trainable parameters and model generalization becomes positive by using Adam with the smallest learning rate.

Original languageEnglish
Title of host publication2023 4th International Conference on Artificial Intelligence and Data Sciences
Subtitle of host publicationDiscovering Technological Advancement in Artificial Intelligence and Data Science, AiDAS 2023 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages235-238
Number of pages4
ISBN (Electronic)9798350318432
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event4th International Conference on Artificial Intelligence and Data Sciences, AiDAS 2023 - Virtual, Online, Malaysia
Duration: 6 Sept 20237 Sept 2023

Publication series

Name2023 4th International Conference on Artificial Intelligence and Data Sciences: Discovering Technological Advancement in Artificial Intelligence and Data Science, AiDAS 2023 - Proceedings

Conference

Conference4th International Conference on Artificial Intelligence and Data Sciences, AiDAS 2023
Country/TerritoryMalaysia
CityVirtual, Online
Period6/09/237/09/23

Keywords

  • Deep Convolutional Neural Network
  • Fine-tuning
  • Parameters
  • Regularization

Fingerprint

Dive into the research topics of 'Study on the Correlation of Trainable Parameters and Hyperparameters with the Performance of Deep Learning Models'. Together they form a unique fingerprint.

Cite this