MoVE-CNNs: Model aVeraging Ensemble of Convolutional Neural Networks for Facial Expression Recognition

Jing Xuan Yu; Kian Ming Lim; Chin Poo Lee

MoVE-CNNs: Model aVeraging Ensemble of Convolutional Neural Networks for Facial Expression Recognition

Jing Xuan Yu^*, Kian Ming Lim, Chin Poo Lee

^*Corresponding author for this work

Multimedia University

Research output: Contribution to journal › Article › peer-review

5 Citations (Scopus)

Abstract

Facial expression is a powerful non-verbal communication that can express emotions and messages without saying a single word. In view of the prominence of facial expression, we propose a model averaging ensemble of Convolutional Neural Networks (CNN) that consolidates multiple pre-trained CNN models. Each pre-trained CNN model first undergoes transfer learning with the classification layer substituted with a multilayer perceptron. The newly formed model is then fine-tuned on the facial expression datasets and adapted to facial expression recognition. The predictions returned by all models are combined by model averaging to determine the final class probability distributions. The proposed model averaging ensemble of CNNs is evaluated on three facial expression datasets: FER-2013, modified CK+ and RAF-DB. Since the modified CK+ dataset is a small dataset, data augmentation is leveraged to increase the size and diversity of data. Apart from that, oversampling is adopted to address the class imbalance challenge in RAF-DB. The empirical results demonstrate that the proposed model averaging ensemble of CNNs outperforms the individual ensemble model at the test accuracy of 77.70%, 94.10% and 87.50% in FER 2013, modified CK+ and RAF-DB datasets, respectively.

Original language	English
Pages (from-to)	1-5
Number of pages	5
Journal	IAENG International Journal of Computer Science
Volume	48
Issue number	3
Publication status	Published - 2021
Externally published	Yes

Keywords

convolutional neural network
data augmentation
ensemble
facial expression
facial expression recognition
model averaging
oversampling
transfer learning

Cite this

@article{098f1d087735440cbd31b9f95fbaa388,

title = "MoVE-CNNs: Model aVeraging Ensemble of Convolutional Neural Networks for Facial Expression Recognition",

abstract = "Facial expression is a powerful non-verbal communication that can express emotions and messages without saying a single word. In view of the prominence of facial expression, we propose a model averaging ensemble of Convolutional Neural Networks (CNN) that consolidates multiple pre-trained CNN models. Each pre-trained CNN model first undergoes transfer learning with the classification layer substituted with a multilayer perceptron. The newly formed model is then fine-tuned on the facial expression datasets and adapted to facial expression recognition. The predictions returned by all models are combined by model averaging to determine the final class probability distributions. The proposed model averaging ensemble of CNNs is evaluated on three facial expression datasets: FER-2013, modified CK+ and RAF-DB. Since the modified CK+ dataset is a small dataset, data augmentation is leveraged to increase the size and diversity of data. Apart from that, oversampling is adopted to address the class imbalance challenge in RAF-DB. The empirical results demonstrate that the proposed model averaging ensemble of CNNs outperforms the individual ensemble model at the test accuracy of 77.70%, 94.10% and 87.50% in FER 2013, modified CK+ and RAF-DB datasets, respectively.",

keywords = "convolutional neural network, data augmentation, ensemble, facial expression, facial expression recognition, model averaging, oversampling, transfer learning",

author = "Yu, {Jing Xuan} and Lim, {Kian Ming} and Lee, {Chin Poo}",

year = "2021",

language = "English",

volume = "48",

pages = "1--5",

journal = "IAENG International Journal of Computer Science",

issn = "1819-656X",

number = "3",

}

TY - JOUR

T1 - MoVE-CNNs: Model aVeraging Ensemble of Convolutional Neural Networks for Facial Expression Recognition

AU - Yu, Jing Xuan

AU - Lim, Kian Ming

AU - Lee, Chin Poo

PY - 2021

Y1 - 2021

N2 - Facial expression is a powerful non-verbal communication that can express emotions and messages without saying a single word. In view of the prominence of facial expression, we propose a model averaging ensemble of Convolutional Neural Networks (CNN) that consolidates multiple pre-trained CNN models. Each pre-trained CNN model first undergoes transfer learning with the classification layer substituted with a multilayer perceptron. The newly formed model is then fine-tuned on the facial expression datasets and adapted to facial expression recognition. The predictions returned by all models are combined by model averaging to determine the final class probability distributions. The proposed model averaging ensemble of CNNs is evaluated on three facial expression datasets: FER-2013, modified CK+ and RAF-DB. Since the modified CK+ dataset is a small dataset, data augmentation is leveraged to increase the size and diversity of data. Apart from that, oversampling is adopted to address the class imbalance challenge in RAF-DB. The empirical results demonstrate that the proposed model averaging ensemble of CNNs outperforms the individual ensemble model at the test accuracy of 77.70%, 94.10% and 87.50% in FER 2013, modified CK+ and RAF-DB datasets, respectively.

AB - Facial expression is a powerful non-verbal communication that can express emotions and messages without saying a single word. In view of the prominence of facial expression, we propose a model averaging ensemble of Convolutional Neural Networks (CNN) that consolidates multiple pre-trained CNN models. Each pre-trained CNN model first undergoes transfer learning with the classification layer substituted with a multilayer perceptron. The newly formed model is then fine-tuned on the facial expression datasets and adapted to facial expression recognition. The predictions returned by all models are combined by model averaging to determine the final class probability distributions. The proposed model averaging ensemble of CNNs is evaluated on three facial expression datasets: FER-2013, modified CK+ and RAF-DB. Since the modified CK+ dataset is a small dataset, data augmentation is leveraged to increase the size and diversity of data. Apart from that, oversampling is adopted to address the class imbalance challenge in RAF-DB. The empirical results demonstrate that the proposed model averaging ensemble of CNNs outperforms the individual ensemble model at the test accuracy of 77.70%, 94.10% and 87.50% in FER 2013, modified CK+ and RAF-DB datasets, respectively.

KW - convolutional neural network

KW - data augmentation

KW - ensemble

KW - facial expression

KW - facial expression recognition

KW - model averaging

KW - oversampling

KW - transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85116045544&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:85116045544

SN - 1819-656X

VL - 48

SP - 1

EP - 5

JO - IAENG International Journal of Computer Science

JF - IAENG International Journal of Computer Science

IS - 3

ER -

MoVE-CNNs: Model aVeraging Ensemble of Convolutional Neural Networks for Facial Expression Recognition

Abstract

Keywords

Other files and links

Fingerprint

Cite this