Continuous Valence-Arousal Space Prediction and Recognition Based on Feature Fusion

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

In emotion recognition, the multimodal feature fusion approach for facial expression recognition is useful due to its versatility and adaptability. It leads to improved model performance by capturing information from different modalities. In this study, we employ feature-level fusion, integrating CNN and HOG features. To predict continuous valence and arousal values, we utilize a Feedforward neural network and Gradient Boosting. Performance evaluation is conducted using Mean Squared Error (MSE) and Root Mean Squared Error (RMSE). The paper presents experiments using the ADFES dataset, considering low, medium, and high intensities, as well as an augmented video dataset. The results shows that instead of relying on complex models, accuracy can be achieved by combining various types of features with appropriate hyperparameter settings and tuning. This approach is not only cost-effective in terms of computation but also robust and computationally efficient.

Original languageEnglish
Title of host publicationICIT 2024 - 2024 25th International Conference on Industrial Technology
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350340266
DOIs
Publication statusPublished - 2024
Event25th IEEE International Conference on Industrial Technology, ICIT 2024 - Bristol, United Kingdom
Duration: 25 Mar 202427 Mar 2024

Publication series

NameProceedings of the IEEE International Conference on Industrial Technology
ISSN (Print)2641-0184
ISSN (Electronic)2643-2978

Conference

Conference25th IEEE International Conference on Industrial Technology, ICIT 2024
Country/TerritoryUnited Kingdom
CityBristol
Period25/03/2427/03/24

Keywords

  • Emotion recognition
  • Feature Fusion
  • CNN-HOG
  • Valence-Arousal Space
  • CNN
  • HOG
  • Feature-Level-Fusion
  • Multimodal Fusion

Fingerprint

Dive into the research topics of 'Continuous Valence-Arousal Space Prediction and Recognition Based on Feature Fusion'. Together they form a unique fingerprint.

Cite this