Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer

Jashila Nair Mogan; Chin Poo Lee; Kian Ming Lim; Mohammed Ali; Ali Alqahtani

doi:10.3390/s23083809

Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer

Jashila Nair Mogan, Chin Poo Lee^*, Kian Ming Lim, Mohammed Ali, Ali Alqahtani

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

26 Citations (Scopus)

Abstract

Gait recognition, the task of identifying an individual based on their unique walking style, can be difficult because walking styles can be influenced by external factors such as clothing, viewing angle, and carrying conditions. To address these challenges, this paper proposes a multi-model gait recognition system that integrates Convolutional Neural Networks (CNNs) and Vision Transformer. The first step in the process is to obtain a gait energy image, which is achieved by applying an averaging technique to a gait cycle. The gait energy image is then fed into three different models, DenseNet-201, VGG-16, and a Vision Transformer. These models are pre-trained and fine-tuned to encode the salient gait features that are specific to an individual’s walking style. Each model provides prediction scores for the classes based on the encoded features, and these scores are then summed and averaged to produce the final class label. The performance of this multi-model gait recognition system was evaluated on three datasets, CASIA-B, OU-ISIR dataset D, and OU-ISIR Large Population dataset. The experimental results showed substantial improvement compared to existing methods on all three datasets. The integration of CNNs and ViT allows the system to learn both the pre-defined and distinct features, providing a robust solution for gait recognition even under the influence of covariates.

Original language	English
Article number	3809
Journal	Sensors
Volume	23
Issue number	8
DOIs	https://doi.org/10.3390/s23083809
Publication status	Published - Apr 2023
Externally published	Yes

Keywords

deep learning
ensemble
gait
gait recognition

Access to Document

10.3390/s23083809

Cite this

@article{9de371ae2c2743509f89136ead77e632,

title = "Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer",

abstract = "Gait recognition, the task of identifying an individual based on their unique walking style, can be difficult because walking styles can be influenced by external factors such as clothing, viewing angle, and carrying conditions. To address these challenges, this paper proposes a multi-model gait recognition system that integrates Convolutional Neural Networks (CNNs) and Vision Transformer. The first step in the process is to obtain a gait energy image, which is achieved by applying an averaging technique to a gait cycle. The gait energy image is then fed into three different models, DenseNet-201, VGG-16, and a Vision Transformer. These models are pre-trained and fine-tuned to encode the salient gait features that are specific to an individual{\textquoteright}s walking style. Each model provides prediction scores for the classes based on the encoded features, and these scores are then summed and averaged to produce the final class label. The performance of this multi-model gait recognition system was evaluated on three datasets, CASIA-B, OU-ISIR dataset D, and OU-ISIR Large Population dataset. The experimental results showed substantial improvement compared to existing methods on all three datasets. The integration of CNNs and ViT allows the system to learn both the pre-defined and distinct features, providing a robust solution for gait recognition even under the influence of covariates.",

keywords = "deep learning, ensemble, gait, gait recognition",

author = "Mogan, {Jashila Nair} and Lee, {Chin Poo} and Lim, {Kian Ming} and Mohammed Ali and Ali Alqahtani",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = apr,

doi = "10.3390/s23083809",

language = "English",

volume = "23",

journal = "Sensors",

issn = "1424-8220",

publisher = "MDPI (Basel, Switzerland) ",

number = "8",

}

TY - JOUR

T1 - Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer

T2 - Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer

AU - Mogan, Jashila Nair

AU - Lee, Chin Poo

AU - Lim, Kian Ming

AU - Ali, Mohammed

AU - Alqahtani, Ali

PY - 2023/4

Y1 - 2023/4

N2 - Gait recognition, the task of identifying an individual based on their unique walking style, can be difficult because walking styles can be influenced by external factors such as clothing, viewing angle, and carrying conditions. To address these challenges, this paper proposes a multi-model gait recognition system that integrates Convolutional Neural Networks (CNNs) and Vision Transformer. The first step in the process is to obtain a gait energy image, which is achieved by applying an averaging technique to a gait cycle. The gait energy image is then fed into three different models, DenseNet-201, VGG-16, and a Vision Transformer. These models are pre-trained and fine-tuned to encode the salient gait features that are specific to an individual’s walking style. Each model provides prediction scores for the classes based on the encoded features, and these scores are then summed and averaged to produce the final class label. The performance of this multi-model gait recognition system was evaluated on three datasets, CASIA-B, OU-ISIR dataset D, and OU-ISIR Large Population dataset. The experimental results showed substantial improvement compared to existing methods on all three datasets. The integration of CNNs and ViT allows the system to learn both the pre-defined and distinct features, providing a robust solution for gait recognition even under the influence of covariates.

AB - Gait recognition, the task of identifying an individual based on their unique walking style, can be difficult because walking styles can be influenced by external factors such as clothing, viewing angle, and carrying conditions. To address these challenges, this paper proposes a multi-model gait recognition system that integrates Convolutional Neural Networks (CNNs) and Vision Transformer. The first step in the process is to obtain a gait energy image, which is achieved by applying an averaging technique to a gait cycle. The gait energy image is then fed into three different models, DenseNet-201, VGG-16, and a Vision Transformer. These models are pre-trained and fine-tuned to encode the salient gait features that are specific to an individual’s walking style. Each model provides prediction scores for the classes based on the encoded features, and these scores are then summed and averaged to produce the final class label. The performance of this multi-model gait recognition system was evaluated on three datasets, CASIA-B, OU-ISIR dataset D, and OU-ISIR Large Population dataset. The experimental results showed substantial improvement compared to existing methods on all three datasets. The integration of CNNs and ViT allows the system to learn both the pre-defined and distinct features, providing a robust solution for gait recognition even under the influence of covariates.

KW - deep learning

KW - ensemble

KW - gait

KW - gait recognition

UR - http://www.scopus.com/inward/record.url?scp=85153935325&partnerID=8YFLogxK

U2 - 10.3390/s23083809

DO - 10.3390/s23083809

M3 - Article

C2 - 37112147

AN - SCOPUS:85153935325

SN - 1424-8220

VL - 23

JO - Sensors

JF - Sensors

IS - 8

M1 - 3809

ER -

Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this