DLSANet: Facial expression recognition with double-code LBP-layer spatial-attention network

Xing Guo; Siyuan Lu; Shuihua Wang; Zhihai Lu; Yudong Zhang

doi:10.1049/ipr2.12817

DLSANet: Facial expression recognition with double-code LBP-layer spatial-attention network

Xing Guo, Siyuan Lu, Shuihua Wang, Zhihai Lu^*, Yudong Zhang^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Facial expression recognition (FER) is widely used in many fields. To further improve the accuracy of FER, this paper proposes a method based on double-code LBP-layer spatial-attention network (DLSANet). The backbone model for the DLSANet is an emotion network (ENet), which is modified with a double-code LBP (DLBP) layer and a spatial attention module. The DLBP layer is at the front of the first convolutional layer. More valuable features can be extracted by inputting the image processed by DLBP into convolutional layers. The JAFFE and CK+ datasets are used, which contain seven expressions: happiness, anger, disgust, neutral, fear, sadness, and surprise. The average of fivefold cross-validation shows that DLSANet achieves a recognition accuracy of 93.81% and 98.68% on the JAFFE and CK+ datasets. The experiment reveals that the DLSANet can produce better classification results than state-of-the-art methods.

Original language	English
Pages (from-to)	2659-2672
Number of pages	14
Journal	IET Image Processing
Volume	17
Issue number	9
DOIs	https://doi.org/10.1049/ipr2.12817
Publication status	Published - 20 Jul 2023
Externally published	Yes

Keywords

artificial intelligence
belief networks
convolutional neural network
local binary pattern
pattern recognition
spatial attention module

Access to Document

10.1049/ipr2.12817

Cite this

@article{85f5df3770aa46b781a91fee92e64bd6,

title = "DLSANet: Facial expression recognition with double-code LBP-layer spatial-attention network",

abstract = "Facial expression recognition (FER) is widely used in many fields. To further improve the accuracy of FER, this paper proposes a method based on double-code LBP-layer spatial-attention network (DLSANet). The backbone model for the DLSANet is an emotion network (ENet), which is modified with a double-code LBP (DLBP) layer and a spatial attention module. The DLBP layer is at the front of the first convolutional layer. More valuable features can be extracted by inputting the image processed by DLBP into convolutional layers. The JAFFE and CK+ datasets are used, which contain seven expressions: happiness, anger, disgust, neutral, fear, sadness, and surprise. The average of fivefold cross-validation shows that DLSANet achieves a recognition accuracy of 93.81% and 98.68% on the JAFFE and CK+ datasets. The experiment reveals that the DLSANet can produce better classification results than state-of-the-art methods.",

keywords = "artificial intelligence, belief networks, convolutional neural network, local binary pattern, pattern recognition, spatial attention module",

author = "Xing Guo and Siyuan Lu and Shuihua Wang and Zhihai Lu and Yudong Zhang",

note = "Publisher Copyright: {\textcopyright} 2023 The Authors. IET Image Processing published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.",

year = "2023",

month = jul,

day = "20",

doi = "10.1049/ipr2.12817",

language = "English",

volume = "17",

pages = "2659--2672",

journal = "IET Image Processing",

issn = "1751-9659",

number = "9",

}

TY - JOUR

T1 - DLSANet

T2 - Facial expression recognition with double-code LBP-layer spatial-attention network

AU - Guo, Xing

AU - Lu, Siyuan

AU - Wang, Shuihua

AU - Lu, Zhihai

AU - Zhang, Yudong

PY - 2023/7/20

Y1 - 2023/7/20

N2 - Facial expression recognition (FER) is widely used in many fields. To further improve the accuracy of FER, this paper proposes a method based on double-code LBP-layer spatial-attention network (DLSANet). The backbone model for the DLSANet is an emotion network (ENet), which is modified with a double-code LBP (DLBP) layer and a spatial attention module. The DLBP layer is at the front of the first convolutional layer. More valuable features can be extracted by inputting the image processed by DLBP into convolutional layers. The JAFFE and CK+ datasets are used, which contain seven expressions: happiness, anger, disgust, neutral, fear, sadness, and surprise. The average of fivefold cross-validation shows that DLSANet achieves a recognition accuracy of 93.81% and 98.68% on the JAFFE and CK+ datasets. The experiment reveals that the DLSANet can produce better classification results than state-of-the-art methods.

AB - Facial expression recognition (FER) is widely used in many fields. To further improve the accuracy of FER, this paper proposes a method based on double-code LBP-layer spatial-attention network (DLSANet). The backbone model for the DLSANet is an emotion network (ENet), which is modified with a double-code LBP (DLBP) layer and a spatial attention module. The DLBP layer is at the front of the first convolutional layer. More valuable features can be extracted by inputting the image processed by DLBP into convolutional layers. The JAFFE and CK+ datasets are used, which contain seven expressions: happiness, anger, disgust, neutral, fear, sadness, and surprise. The average of fivefold cross-validation shows that DLSANet achieves a recognition accuracy of 93.81% and 98.68% on the JAFFE and CK+ datasets. The experiment reveals that the DLSANet can produce better classification results than state-of-the-art methods.

KW - artificial intelligence

KW - belief networks

KW - convolutional neural network

KW - local binary pattern

KW - pattern recognition

KW - spatial attention module

UR - http://www.scopus.com/inward/record.url?scp=85153600659&partnerID=8YFLogxK

U2 - 10.1049/ipr2.12817

DO - 10.1049/ipr2.12817

M3 - Article

AN - SCOPUS:85153600659

SN - 1751-9659

VL - 17

SP - 2659

EP - 2672

JO - IET Image Processing

JF - IET Image Processing

IS - 9

ER -

DLSANet: Facial expression recognition with double-code LBP-layer spatial-attention network

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this