Isolated sign language recognition using Convolutional Neural Network hand modelling and Hand Energy Image

Kian Ming Lim; Alan Wee Chiat Tan; Chin Poo Lee; Shing Chiang Tan

doi:10.1007/s11042-019-7263-7

Isolated sign language recognition using Convolutional Neural Network hand modelling and Hand Energy Image

Kian Ming Lim^*, Alan Wee Chiat Tan, Chin Poo Lee, Shing Chiang Tan

^*Corresponding author for this work

Multimedia University

Research output: Contribution to journal › Article › peer-review

50 Citations (Scopus)

Abstract

This paper presents an isolated sign language recognition system that comprises of two main phases: hand tracking and hand representation. In the hand tracking phase, an annotated hand dataset is used to extract the hand patches to pre-train Convolutional Neural Network (CNN) hand models. The hand tracking is performed by the particle filter that combines hand motion and CNN pre-trained hand models into a joint likelihood observation model. The predicted hand position corresponds to the location of the particle with the highest joint likelihood. Based on the predicted hand position, a square hand region centered around the predicted position is segmented and serves as the input to the hand representation phase. In the hand representation phase, a compact hand representation is computed by averaging the segmented hand regions. The obtained hand representation is referred to as “Hand Energy Image (HEI)”. Quantitative and qualitative analysis show that the proposed hand tracking method is able to predict the hand positions that are closer to the ground truth. Similarly, the proposed HEI hand representation outperforms other methods in the isolated sign language recognition.

Original language	English
Pages (from-to)	19917-19944
Number of pages	28
Journal	Multimedia Tools and Applications
Volume	78
Issue number	14
DOIs	https://doi.org/10.1007/s11042-019-7263-7
Publication status	Published - 30 Jul 2019
Externally published	Yes

Keywords

Convolutional Neural Network
Hand Energy Image
Hand gesture recognition
Sign language recognition

Access to Document

10.1007/s11042-019-7263-7

Cite this

@article{13a251a93b6a40a79cb5e99fe5e9164e,

title = "Isolated sign language recognition using Convolutional Neural Network hand modelling and Hand Energy Image",

abstract = "This paper presents an isolated sign language recognition system that comprises of two main phases: hand tracking and hand representation. In the hand tracking phase, an annotated hand dataset is used to extract the hand patches to pre-train Convolutional Neural Network (CNN) hand models. The hand tracking is performed by the particle filter that combines hand motion and CNN pre-trained hand models into a joint likelihood observation model. The predicted hand position corresponds to the location of the particle with the highest joint likelihood. Based on the predicted hand position, a square hand region centered around the predicted position is segmented and serves as the input to the hand representation phase. In the hand representation phase, a compact hand representation is computed by averaging the segmented hand regions. The obtained hand representation is referred to as “Hand Energy Image (HEI)”. Quantitative and qualitative analysis show that the proposed hand tracking method is able to predict the hand positions that are closer to the ground truth. Similarly, the proposed HEI hand representation outperforms other methods in the isolated sign language recognition.",

keywords = "Convolutional Neural Network, Hand Energy Image, Hand gesture recognition, Sign language recognition",

author = "Lim, {Kian Ming} and Tan, {Alan Wee Chiat} and Lee, {Chin Poo} and Tan, {Shing Chiang}",

note = "Publisher Copyright: {\textcopyright} 2019, Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2019",

month = jul,

day = "30",

doi = "10.1007/s11042-019-7263-7",

language = "English",

volume = "78",

pages = "19917--19944",

journal = "Multimedia Tools and Applications",

issn = "1380-7501",

publisher = "Springer",

number = "14",

}

TY - JOUR

T1 - Isolated sign language recognition using Convolutional Neural Network hand modelling and Hand Energy Image

AU - Lim, Kian Ming

AU - Tan, Alan Wee Chiat

AU - Lee, Chin Poo

AU - Tan, Shing Chiang

PY - 2019/7/30

Y1 - 2019/7/30

N2 - This paper presents an isolated sign language recognition system that comprises of two main phases: hand tracking and hand representation. In the hand tracking phase, an annotated hand dataset is used to extract the hand patches to pre-train Convolutional Neural Network (CNN) hand models. The hand tracking is performed by the particle filter that combines hand motion and CNN pre-trained hand models into a joint likelihood observation model. The predicted hand position corresponds to the location of the particle with the highest joint likelihood. Based on the predicted hand position, a square hand region centered around the predicted position is segmented and serves as the input to the hand representation phase. In the hand representation phase, a compact hand representation is computed by averaging the segmented hand regions. The obtained hand representation is referred to as “Hand Energy Image (HEI)”. Quantitative and qualitative analysis show that the proposed hand tracking method is able to predict the hand positions that are closer to the ground truth. Similarly, the proposed HEI hand representation outperforms other methods in the isolated sign language recognition.

AB - This paper presents an isolated sign language recognition system that comprises of two main phases: hand tracking and hand representation. In the hand tracking phase, an annotated hand dataset is used to extract the hand patches to pre-train Convolutional Neural Network (CNN) hand models. The hand tracking is performed by the particle filter that combines hand motion and CNN pre-trained hand models into a joint likelihood observation model. The predicted hand position corresponds to the location of the particle with the highest joint likelihood. Based on the predicted hand position, a square hand region centered around the predicted position is segmented and serves as the input to the hand representation phase. In the hand representation phase, a compact hand representation is computed by averaging the segmented hand regions. The obtained hand representation is referred to as “Hand Energy Image (HEI)”. Quantitative and qualitative analysis show that the proposed hand tracking method is able to predict the hand positions that are closer to the ground truth. Similarly, the proposed HEI hand representation outperforms other methods in the isolated sign language recognition.

KW - Convolutional Neural Network

KW - Hand Energy Image

KW - Hand gesture recognition

KW - Sign language recognition

UR - http://www.scopus.com/inward/record.url?scp=85062043058&partnerID=8YFLogxK

U2 - 10.1007/s11042-019-7263-7

DO - 10.1007/s11042-019-7263-7

M3 - Article

AN - SCOPUS:85062043058

SN - 1380-7501

VL - 78

SP - 19917

EP - 19944

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

IS - 14

ER -

Isolated sign language recognition using Convolutional Neural Network hand modelling and Hand Energy Image

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this