PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition

Jia Le Ngwe; Kian Ming Lim; Chin Poo Lee; Thian Song Ong; Ali Alqahtani

doi:10.1109/ACCESS.2024.3407108

PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition

Jia Le Ngwe, Kian Ming Lim^*, Chin Poo Lee, Thian Song Ong, Ali Alqahtani

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

6 Citations (Scopus)

Abstract

Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER performance under challenging conditions. A truncated ImageNet-pre-trained MobileNetV1 is utilized as the backbone feature extractor of the proposed method. In place of the truncated layers is a patch extraction block that is proposed for extracting significant local facial features to enhance the representation from MobileNetV1, especially under challenging conditions. An attention classifier is also proposed to improve the learning of these patched feature maps from the extremely lightweight feature extractor. The experimental results on public benchmark databases proved the effectiveness of the proposed method. PAtt-Lite achieved state-of-the-art results on CK+, RAF-DB, FER2013, FERPlus, and the challenging conditions subsets for RAF-DB and FERPlus.

Original language	English
Pages (from-to)	79327-79341
Number of pages	15
Journal	IEEE Access
Volume	12
DOIs	https://doi.org/10.1109/ACCESS.2024.3407108
Publication status	Published - 2024
Externally published	Yes

Keywords

Facial expression recognition
MobileNetV1
patch extraction
self-attention

Access to Document

10.1109/ACCESS.2024.3407108

Cite this

@article{508fc409775b4421993c4dad09b6bffc,

title = "PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition",

abstract = "Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER performance under challenging conditions. A truncated ImageNet-pre-trained MobileNetV1 is utilized as the backbone feature extractor of the proposed method. In place of the truncated layers is a patch extraction block that is proposed for extracting significant local facial features to enhance the representation from MobileNetV1, especially under challenging conditions. An attention classifier is also proposed to improve the learning of these patched feature maps from the extremely lightweight feature extractor. The experimental results on public benchmark databases proved the effectiveness of the proposed method. PAtt-Lite achieved state-of-the-art results on CK+, RAF-DB, FER2013, FERPlus, and the challenging conditions subsets for RAF-DB and FERPlus.",

keywords = "Facial expression recognition, MobileNetV1, patch extraction, self-attention",

author = "Ngwe, {Jia Le} and Lim, {Kian Ming} and Lee, {Chin Poo} and Ong, {Thian Song} and Ali Alqahtani",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2024",

doi = "10.1109/ACCESS.2024.3407108",

language = "English",

volume = "12",

pages = "79327--79341",

journal = "IEEE Access",

issn = "2169-3536",

}

TY - JOUR

T1 - PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition

AU - Ngwe, Jia Le

AU - Lim, Kian Ming

AU - Lee, Chin Poo

AU - Ong, Thian Song

AU - Alqahtani, Ali

PY - 2024

Y1 - 2024

N2 - Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER performance under challenging conditions. A truncated ImageNet-pre-trained MobileNetV1 is utilized as the backbone feature extractor of the proposed method. In place of the truncated layers is a patch extraction block that is proposed for extracting significant local facial features to enhance the representation from MobileNetV1, especially under challenging conditions. An attention classifier is also proposed to improve the learning of these patched feature maps from the extremely lightweight feature extractor. The experimental results on public benchmark databases proved the effectiveness of the proposed method. PAtt-Lite achieved state-of-the-art results on CK+, RAF-DB, FER2013, FERPlus, and the challenging conditions subsets for RAF-DB and FERPlus.

AB - Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER performance under challenging conditions. A truncated ImageNet-pre-trained MobileNetV1 is utilized as the backbone feature extractor of the proposed method. In place of the truncated layers is a patch extraction block that is proposed for extracting significant local facial features to enhance the representation from MobileNetV1, especially under challenging conditions. An attention classifier is also proposed to improve the learning of these patched feature maps from the extremely lightweight feature extractor. The experimental results on public benchmark databases proved the effectiveness of the proposed method. PAtt-Lite achieved state-of-the-art results on CK+, RAF-DB, FER2013, FERPlus, and the challenging conditions subsets for RAF-DB and FERPlus.

KW - Facial expression recognition

KW - MobileNetV1

KW - patch extraction

KW - self-attention

UR - http://www.scopus.com/inward/record.url?scp=85194898453&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2024.3407108

DO - 10.1109/ACCESS.2024.3407108

M3 - Article

AN - SCOPUS:85194898453

SN - 2169-3536

VL - 12

SP - 79327

EP - 79341

JO - IEEE Access

JF - IEEE Access

ER -

PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this