Facial Expression Recognition Based on TripletLoss and Attention Mechanism

Rongqiang Gou; Qiuyan Gai; Zhijie Xu; Yuanping Xu; Chaolong Zhang; Jin Jin; Jia He; Yajing Shi

doi:10.1109/ICAC57885.2023.10275257

Facial Expression Recognition Based on TripletLoss and Attention Mechanism

Rongqiang Gou, Qiuyan Gai, Zhijie Xu, Yuanping Xu, Chaolong Zhang, Jin Jin, Jia He, Yajing Shi

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

1 Citation (Scopus)

Abstract

The paper proposes a facial expression recognition method called Triplet-Loss Attention Network (TAN), which aims to address the problem of large intra-class and inter-class distances in facial expression recognition. The method uses a self-attention mechanism to calculate the weight of triplet expression samples, which helps in determining the influence of key regions on expression recognition. The method uses images with attention weights above a certain threshold as training samples to form new hard triplets. The distance between high-weight samples in the three sets of samples is calculated using the Mahalanobis distance formula, and the difference in distance between high-weight groups of triplets is calculated using the Maharaja Loss function. The Triplet Loss is mainly used as the loss function in TAN, and the model is jointly optimized using Triplet Loss, Mahalanobis Loss, and Cross Entropy Loss functions to improve the performance of the model in facial expression recognition. Experimental results show that TAN performs well in alleviating the intra-class and inter-class distance problem and has good robustness and generalization performance. On the RAF-DB and FERPlus datasets, TAN achieves recognition accuracies of 88.40% and 88.73%, respectively, which is 1.37% and 0.18% higher than the previous state-of-the-art methods.

Original language	English
Title of host publication	ICAC 2023 - 28th International Conference on Automation and Computing
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9798350335859
DOIs	https://doi.org/10.1109/ICAC57885.2023.10275257
Publication status	Published - 2023
Externally published	Yes
Event	28th International Conference on Automation and Computing, ICAC 2023 - Birmingham, United Kingdom Duration: 30 Aug 2023 → 1 Sept 2023

Publication series

Name	ICAC 2023 - 28th International Conference on Automation and Computing

Conference

Conference	28th International Conference on Automation and Computing, ICAC 2023
Country/Territory	United Kingdom
City	Birmingham
Period	30/08/23 → 1/09/23

Keywords

Attention Mechanism
FER
Triplet Loss

Access to Document

10.1109/ICAC57885.2023.10275257

Cite this

Gou, R., Gai, Q., Xu, Z., Xu, Y., Zhang, C., Jin, J., He, J., & Shi, Y. (2023). Facial Expression Recognition Based on TripletLoss and Attention Mechanism. In ICAC 2023 - 28th International Conference on Automation and Computing (ICAC 2023 - 28th International Conference on Automation and Computing). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICAC57885.2023.10275257

@inproceedings{5ae3486d3e794828ba458056df685ad2,

title = "Facial Expression Recognition Based on TripletLoss and Attention Mechanism",

abstract = "The paper proposes a facial expression recognition method called Triplet-Loss Attention Network (TAN), which aims to address the problem of large intra-class and inter-class distances in facial expression recognition. The method uses a self-attention mechanism to calculate the weight of triplet expression samples, which helps in determining the influence of key regions on expression recognition. The method uses images with attention weights above a certain threshold as training samples to form new hard triplets. The distance between high-weight samples in the three sets of samples is calculated using the Mahalanobis distance formula, and the difference in distance between high-weight groups of triplets is calculated using the Maharaja Loss function. The Triplet Loss is mainly used as the loss function in TAN, and the model is jointly optimized using Triplet Loss, Mahalanobis Loss, and Cross Entropy Loss functions to improve the performance of the model in facial expression recognition. Experimental results show that TAN performs well in alleviating the intra-class and inter-class distance problem and has good robustness and generalization performance. On the RAF-DB and FERPlus datasets, TAN achieves recognition accuracies of 88.40% and 88.73%, respectively, which is 1.37% and 0.18% higher than the previous state-of-the-art methods.",

keywords = "Attention Mechanism, FER, Triplet Loss",

author = "Rongqiang Gou and Qiuyan Gai and Zhijie Xu and Yuanping Xu and Chaolong Zhang and Jin Jin and Jia He and Yajing Shi",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 28th International Conference on Automation and Computing, ICAC 2023 ; Conference date: 30-08-2023 Through 01-09-2023",

year = "2023",

doi = "10.1109/ICAC57885.2023.10275257",

language = "English",

series = "ICAC 2023 - 28th International Conference on Automation and Computing",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "ICAC 2023 - 28th International Conference on Automation and Computing",

}

Gou, R, Gai, Q, Xu, Z, Xu, Y, Zhang, C, Jin, J, He, J & Shi, Y 2023, Facial Expression Recognition Based on TripletLoss and Attention Mechanism. in ICAC 2023 - 28th International Conference on Automation and Computing. ICAC 2023 - 28th International Conference on Automation and Computing, Institute of Electrical and Electronics Engineers Inc., 28th International Conference on Automation and Computing, ICAC 2023, Birmingham, United Kingdom, 30/08/23. https://doi.org/10.1109/ICAC57885.2023.10275257

Facial Expression Recognition Based on TripletLoss and Attention Mechanism. / Gou, Rongqiang; Gai, Qiuyan; Xu, Zhijie et al.
ICAC 2023 - 28th International Conference on Automation and Computing. Institute of Electrical and Electronics Engineers Inc., 2023. (ICAC 2023 - 28th International Conference on Automation and Computing).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Facial Expression Recognition Based on TripletLoss and Attention Mechanism

AU - Gou, Rongqiang

AU - Gai, Qiuyan

AU - Xu, Zhijie

AU - Xu, Yuanping

AU - Zhang, Chaolong

AU - Jin, Jin

AU - He, Jia

AU - Shi, Yajing

PY - 2023

Y1 - 2023

N2 - The paper proposes a facial expression recognition method called Triplet-Loss Attention Network (TAN), which aims to address the problem of large intra-class and inter-class distances in facial expression recognition. The method uses a self-attention mechanism to calculate the weight of triplet expression samples, which helps in determining the influence of key regions on expression recognition. The method uses images with attention weights above a certain threshold as training samples to form new hard triplets. The distance between high-weight samples in the three sets of samples is calculated using the Mahalanobis distance formula, and the difference in distance between high-weight groups of triplets is calculated using the Maharaja Loss function. The Triplet Loss is mainly used as the loss function in TAN, and the model is jointly optimized using Triplet Loss, Mahalanobis Loss, and Cross Entropy Loss functions to improve the performance of the model in facial expression recognition. Experimental results show that TAN performs well in alleviating the intra-class and inter-class distance problem and has good robustness and generalization performance. On the RAF-DB and FERPlus datasets, TAN achieves recognition accuracies of 88.40% and 88.73%, respectively, which is 1.37% and 0.18% higher than the previous state-of-the-art methods.

AB - The paper proposes a facial expression recognition method called Triplet-Loss Attention Network (TAN), which aims to address the problem of large intra-class and inter-class distances in facial expression recognition. The method uses a self-attention mechanism to calculate the weight of triplet expression samples, which helps in determining the influence of key regions on expression recognition. The method uses images with attention weights above a certain threshold as training samples to form new hard triplets. The distance between high-weight samples in the three sets of samples is calculated using the Mahalanobis distance formula, and the difference in distance between high-weight groups of triplets is calculated using the Maharaja Loss function. The Triplet Loss is mainly used as the loss function in TAN, and the model is jointly optimized using Triplet Loss, Mahalanobis Loss, and Cross Entropy Loss functions to improve the performance of the model in facial expression recognition. Experimental results show that TAN performs well in alleviating the intra-class and inter-class distance problem and has good robustness and generalization performance. On the RAF-DB and FERPlus datasets, TAN achieves recognition accuracies of 88.40% and 88.73%, respectively, which is 1.37% and 0.18% higher than the previous state-of-the-art methods.

KW - Attention Mechanism

KW - FER

KW - Triplet Loss

UR - http://www.scopus.com/inward/record.url?scp=85175576341&partnerID=8YFLogxK

U2 - 10.1109/ICAC57885.2023.10275257

DO - 10.1109/ICAC57885.2023.10275257

M3 - Conference Proceeding

AN - SCOPUS:85175576341

T3 - ICAC 2023 - 28th International Conference on Automation and Computing

BT - ICAC 2023 - 28th International Conference on Automation and Computing

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 28th International Conference on Automation and Computing, ICAC 2023

Y2 - 30 August 2023 through 1 September 2023

ER -

Facial Expression Recognition Based on TripletLoss and Attention Mechanism

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this