A Survey on Artificial Intelligence in Chinese Sign Language Recognition

Xianwei Jiang; Suresh Chandra Satapathy; Longxiang Yang; Shui Hua Wang; Yu Dong Zhang

doi:10.1007/s13369-020-04758-2

A Survey on Artificial Intelligence in Chinese Sign Language Recognition

Xianwei Jiang, Suresh Chandra Satapathy, Longxiang Yang, Shui Hua Wang^*, Yu Dong Zhang^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

52 Citations (Scopus)

Abstract

Chinese Sign Language (CSL) offers the main means of communication for the hearing impaired in China. Sign Language Recognition (SLR) can shorten the distance between the hearing-impaired and healthy people and help them integrate into the society. Therefore, SLR has become the focus of sign language application research. Over the years, the continuous development of new technologies provides a source and motivation for SLR. This paper aims to cover the most recent approaches in Chinese Sign Language Recognition (CSLR). With a thorough review of superior methods from 2000 to 2019 in CSLR researches, various techniques and algorithms such as scale-invariant feature transform, histogram of oriented gradients, wavelet entropy, Hu moment invariant, Fourier descriptor, gray-level co-occurrence matrix, dynamic time warping, principal component analysis, autoencoder, hidden Markov model (HMM), support vector machine (SVM), random forest, skin color modeling method, k-NN, artificial neural network, convolutional neural network (CNN), and transfer learning are discussed in detail, which are based on several major stages, that is, data acquisition, preprocessing, feature extraction, and classification. CSLR was summarized from some aspect as follows: methods of classification and feature extraction, accuracy/performance evaluation, and sample size/datasets. The advantages and limitations of different CSLR approaches were compared. It was found that data acquisition is mainly through Kinect and camera, and the feature extraction focuses on hand’s shape and spatiotemporal factors, but ignoring facial expressions. HMM and SVM are used most in the classification. CNN is becoming more and more popular, and a deep neural network-based recognition approach will be the future trend. However, due to the complexity of the contemporary Chinese language, CSLR generally has a lower accuracy than other SLR. It is necessary to establish an appropriate dataset to conduct comparable experiments. The issue of decreasing accuracy as the dataset increases needs to resolve. Overall, our study is hoped to give a comprehensive presentation for those people who are interested in CSLR and SLR and to further contribute to the future research.

Original language	English
Pages (from-to)	9859-9894
Number of pages	36
Journal	Arabian Journal for Science and Engineering
Volume	45
Issue number	12
DOIs	https://doi.org/10.1007/s13369-020-04758-2
Publication status	Published - Dec 2020
Externally published	Yes

Keywords

Artificial intelligence
Chinese Sign Language
Classification
Deep neural network
Feature extraction
Fingerspelling recognition
Gesture recognition
Machine learning

Access to Document

10.1007/s13369-020-04758-2

Cite this

@article{4a9cd6b9546f4e738be9ad09a1c28d38,

title = "A Survey on Artificial Intelligence in Chinese Sign Language Recognition",

abstract = "Chinese Sign Language (CSL) offers the main means of communication for the hearing impaired in China. Sign Language Recognition (SLR) can shorten the distance between the hearing-impaired and healthy people and help them integrate into the society. Therefore, SLR has become the focus of sign language application research. Over the years, the continuous development of new technologies provides a source and motivation for SLR. This paper aims to cover the most recent approaches in Chinese Sign Language Recognition (CSLR). With a thorough review of superior methods from 2000 to 2019 in CSLR researches, various techniques and algorithms such as scale-invariant feature transform, histogram of oriented gradients, wavelet entropy, Hu moment invariant, Fourier descriptor, gray-level co-occurrence matrix, dynamic time warping, principal component analysis, autoencoder, hidden Markov model (HMM), support vector machine (SVM), random forest, skin color modeling method, k-NN, artificial neural network, convolutional neural network (CNN), and transfer learning are discussed in detail, which are based on several major stages, that is, data acquisition, preprocessing, feature extraction, and classification. CSLR was summarized from some aspect as follows: methods of classification and feature extraction, accuracy/performance evaluation, and sample size/datasets. The advantages and limitations of different CSLR approaches were compared. It was found that data acquisition is mainly through Kinect and camera, and the feature extraction focuses on hand{\textquoteright}s shape and spatiotemporal factors, but ignoring facial expressions. HMM and SVM are used most in the classification. CNN is becoming more and more popular, and a deep neural network-based recognition approach will be the future trend. However, due to the complexity of the contemporary Chinese language, CSLR generally has a lower accuracy than other SLR. It is necessary to establish an appropriate dataset to conduct comparable experiments. The issue of decreasing accuracy as the dataset increases needs to resolve. Overall, our study is hoped to give a comprehensive presentation for those people who are interested in CSLR and SLR and to further contribute to the future research.",

keywords = "Artificial intelligence, Chinese Sign Language, Classification, Deep neural network, Feature extraction, Fingerspelling recognition, Gesture recognition, Machine learning",

author = "Xianwei Jiang and Satapathy, {Suresh Chandra} and Longxiang Yang and Wang, {Shui Hua} and Zhang, {Yu Dong}",

note = "Publisher Copyright: {\textcopyright} 2020, King Fahd University of Petroleum & Minerals.",

year = "2020",

month = dec,

doi = "10.1007/s13369-020-04758-2",

language = "English",

volume = "45",

pages = "9859--9894",

journal = "Arabian Journal for Science and Engineering",

issn = "2193-567X",

number = "12",

}

TY - JOUR

T1 - A Survey on Artificial Intelligence in Chinese Sign Language Recognition

AU - Jiang, Xianwei

AU - Satapathy, Suresh Chandra

AU - Yang, Longxiang

AU - Wang, Shui Hua

AU - Zhang, Yu Dong

PY - 2020/12

Y1 - 2020/12

N2 - Chinese Sign Language (CSL) offers the main means of communication for the hearing impaired in China. Sign Language Recognition (SLR) can shorten the distance between the hearing-impaired and healthy people and help them integrate into the society. Therefore, SLR has become the focus of sign language application research. Over the years, the continuous development of new technologies provides a source and motivation for SLR. This paper aims to cover the most recent approaches in Chinese Sign Language Recognition (CSLR). With a thorough review of superior methods from 2000 to 2019 in CSLR researches, various techniques and algorithms such as scale-invariant feature transform, histogram of oriented gradients, wavelet entropy, Hu moment invariant, Fourier descriptor, gray-level co-occurrence matrix, dynamic time warping, principal component analysis, autoencoder, hidden Markov model (HMM), support vector machine (SVM), random forest, skin color modeling method, k-NN, artificial neural network, convolutional neural network (CNN), and transfer learning are discussed in detail, which are based on several major stages, that is, data acquisition, preprocessing, feature extraction, and classification. CSLR was summarized from some aspect as follows: methods of classification and feature extraction, accuracy/performance evaluation, and sample size/datasets. The advantages and limitations of different CSLR approaches were compared. It was found that data acquisition is mainly through Kinect and camera, and the feature extraction focuses on hand’s shape and spatiotemporal factors, but ignoring facial expressions. HMM and SVM are used most in the classification. CNN is becoming more and more popular, and a deep neural network-based recognition approach will be the future trend. However, due to the complexity of the contemporary Chinese language, CSLR generally has a lower accuracy than other SLR. It is necessary to establish an appropriate dataset to conduct comparable experiments. The issue of decreasing accuracy as the dataset increases needs to resolve. Overall, our study is hoped to give a comprehensive presentation for those people who are interested in CSLR and SLR and to further contribute to the future research.

AB - Chinese Sign Language (CSL) offers the main means of communication for the hearing impaired in China. Sign Language Recognition (SLR) can shorten the distance between the hearing-impaired and healthy people and help them integrate into the society. Therefore, SLR has become the focus of sign language application research. Over the years, the continuous development of new technologies provides a source and motivation for SLR. This paper aims to cover the most recent approaches in Chinese Sign Language Recognition (CSLR). With a thorough review of superior methods from 2000 to 2019 in CSLR researches, various techniques and algorithms such as scale-invariant feature transform, histogram of oriented gradients, wavelet entropy, Hu moment invariant, Fourier descriptor, gray-level co-occurrence matrix, dynamic time warping, principal component analysis, autoencoder, hidden Markov model (HMM), support vector machine (SVM), random forest, skin color modeling method, k-NN, artificial neural network, convolutional neural network (CNN), and transfer learning are discussed in detail, which are based on several major stages, that is, data acquisition, preprocessing, feature extraction, and classification. CSLR was summarized from some aspect as follows: methods of classification and feature extraction, accuracy/performance evaluation, and sample size/datasets. The advantages and limitations of different CSLR approaches were compared. It was found that data acquisition is mainly through Kinect and camera, and the feature extraction focuses on hand’s shape and spatiotemporal factors, but ignoring facial expressions. HMM and SVM are used most in the classification. CNN is becoming more and more popular, and a deep neural network-based recognition approach will be the future trend. However, due to the complexity of the contemporary Chinese language, CSLR generally has a lower accuracy than other SLR. It is necessary to establish an appropriate dataset to conduct comparable experiments. The issue of decreasing accuracy as the dataset increases needs to resolve. Overall, our study is hoped to give a comprehensive presentation for those people who are interested in CSLR and SLR and to further contribute to the future research.

KW - Artificial intelligence

KW - Chinese Sign Language

KW - Classification

KW - Deep neural network

KW - Feature extraction

KW - Fingerspelling recognition

KW - Gesture recognition

KW - Machine learning

UR - http://www.scopus.com/inward/record.url?scp=85088945160&partnerID=8YFLogxK

U2 - 10.1007/s13369-020-04758-2

DO - 10.1007/s13369-020-04758-2

M3 - Article

AN - SCOPUS:85088945160

SN - 2193-567X

VL - 45

SP - 9859

EP - 9894

JO - Arabian Journal for Science and Engineering

JF - Arabian Journal for Science and Engineering

IS - 12

ER -

A Survey on Artificial Intelligence in Chinese Sign Language Recognition

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this