TY - GEN
T1 - Continuous Valence-Arousal Space Prediction and Recognition Based on Feature Fusion
AU - Ayoub, Misbah
AU - Zhang, Haiyang
AU - Abel, Andrew
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In emotion recognition, the multimodal feature fusion approach for facial expression recognition is useful due to its versatility and adaptability. It leads to improved model performance by capturing information from different modalities. In this study, we employ feature-level fusion, integrating CNN and HOG features. To predict continuous valence and arousal values, we utilize a Feedforward neural network and Gradient Boosting. Performance evaluation is conducted using Mean Squared Error (MSE) and Root Mean Squared Error (RMSE). The paper presents experiments using the ADFES dataset, considering low, medium, and high intensities, as well as an augmented video dataset. The results shows that instead of relying on complex models, accuracy can be achieved by combining various types of features with appropriate hyperparameter settings and tuning. This approach is not only cost-effective in terms of computation but also robust and computationally efficient.
AB - In emotion recognition, the multimodal feature fusion approach for facial expression recognition is useful due to its versatility and adaptability. It leads to improved model performance by capturing information from different modalities. In this study, we employ feature-level fusion, integrating CNN and HOG features. To predict continuous valence and arousal values, we utilize a Feedforward neural network and Gradient Boosting. Performance evaluation is conducted using Mean Squared Error (MSE) and Root Mean Squared Error (RMSE). The paper presents experiments using the ADFES dataset, considering low, medium, and high intensities, as well as an augmented video dataset. The results shows that instead of relying on complex models, accuracy can be achieved by combining various types of features with appropriate hyperparameter settings and tuning. This approach is not only cost-effective in terms of computation but also robust and computationally efficient.
KW - Emotion recognition
KW - Feature Fusion
KW - CNN-HOG
KW - Valence-Arousal Space
KW - CNN
KW - HOG
KW - Feature-Level-Fusion
KW - Multimodal Fusion
UR - http://www.scopus.com/inward/record.url?scp=85195783577&partnerID=8YFLogxK
U2 - 10.1109/ICIT58233.2024.10540915
DO - 10.1109/ICIT58233.2024.10540915
M3 - Conference Proceeding
AN - SCOPUS:85195783577
T3 - Proceedings of the IEEE International Conference on Industrial Technology
BT - ICIT 2024 - 2024 25th International Conference on Industrial Technology
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE International Conference on Industrial Technology, ICIT 2024
Y2 - 25 March 2024 through 27 March 2024
ER -