Driving posture recognition by convolutional neural networks

Chao Yan; Frans Coenen; Bailing Zhang

doi:10.1049/iet-cvi.2015.0175

Driving posture recognition by convolutional neural networks

Chao Yan^*, Frans Coenen, Bailing Zhang

^*Corresponding author for this work

School of Advanced Technology

Research output: Contribution to journal › Article › peer-review

128 Citations (Scopus)

Abstract

Driver fatigue and inattention have long been recognised as the main contributing factors in traffic accidents. This study presents a novel system which applies convolutional neural network (CNN) to automatically learn and predict pre-defined driving postures. The main idea is to monitor driver hand position with discriminative information extracted to predict safe/unsafe driving posture. In comparison to previous approaches, CNNs can automatically learn discriminative features directly from raw images. In the authors' works, a CNN model was first pre-trained by an unsupervised feature learning method called sparse filtering, and subsequently fine-tuned with classification. The approach was verified using the Southeast University driving posture dataset, which comprised of video clips covering four driving postures, including normal driving, responding to a cell phone call, eating, and smoking. Compared with other popular approaches with different image descriptors and classification methods, the authors' scheme achieves the best performance with an overall accuracy of 99.78%. To evaluate the effectiveness and generalisation performance in more realistic conditions, the method was further tested using other two specially designed datasets which takes into account of the poor illuminations and different road conditions, achieving an overall accuracy of 99.3 and 95.77%, respectively.

Original language	English
Pages (from-to)	103-114
Number of pages	12
Journal	IET Computer Vision
Volume	10
Issue number	2
DOIs	https://doi.org/10.1049/iet-cvi.2015.0175
Publication status	Published - 1 Mar 2016

Access to Document

10.1049/iet-cvi.2015.0175

Cite this

@article{4d94e00261d9438f9d78a8efd3b6f555,

title = "Driving posture recognition by convolutional neural networks",

abstract = "Driver fatigue and inattention have long been recognised as the main contributing factors in traffic accidents. This study presents a novel system which applies convolutional neural network (CNN) to automatically learn and predict pre-defined driving postures. The main idea is to monitor driver hand position with discriminative information extracted to predict safe/unsafe driving posture. In comparison to previous approaches, CNNs can automatically learn discriminative features directly from raw images. In the authors' works, a CNN model was first pre-trained by an unsupervised feature learning method called sparse filtering, and subsequently fine-tuned with classification. The approach was verified using the Southeast University driving posture dataset, which comprised of video clips covering four driving postures, including normal driving, responding to a cell phone call, eating, and smoking. Compared with other popular approaches with different image descriptors and classification methods, the authors' scheme achieves the best performance with an overall accuracy of 99.78%. To evaluate the effectiveness and generalisation performance in more realistic conditions, the method was further tested using other two specially designed datasets which takes into account of the poor illuminations and different road conditions, achieving an overall accuracy of 99.3 and 95.77%, respectively.",

author = "Chao Yan and Frans Coenen and Bailing Zhang",

note = "Publisher Copyright: {\textcopyright} The Institution of Engineering and Technology 2016.",

year = "2016",

month = mar,

day = "1",

doi = "10.1049/iet-cvi.2015.0175",

language = "English",

volume = "10",

pages = "103--114",

journal = "IET Computer Vision",

issn = "1751-9632",

number = "2",

}

TY - JOUR

T1 - Driving posture recognition by convolutional neural networks

AU - Yan, Chao

AU - Coenen, Frans

AU - Zhang, Bailing

N1 - Publisher Copyright: © The Institution of Engineering and Technology 2016.

PY - 2016/3/1

Y1 - 2016/3/1

N2 - Driver fatigue and inattention have long been recognised as the main contributing factors in traffic accidents. This study presents a novel system which applies convolutional neural network (CNN) to automatically learn and predict pre-defined driving postures. The main idea is to monitor driver hand position with discriminative information extracted to predict safe/unsafe driving posture. In comparison to previous approaches, CNNs can automatically learn discriminative features directly from raw images. In the authors' works, a CNN model was first pre-trained by an unsupervised feature learning method called sparse filtering, and subsequently fine-tuned with classification. The approach was verified using the Southeast University driving posture dataset, which comprised of video clips covering four driving postures, including normal driving, responding to a cell phone call, eating, and smoking. Compared with other popular approaches with different image descriptors and classification methods, the authors' scheme achieves the best performance with an overall accuracy of 99.78%. To evaluate the effectiveness and generalisation performance in more realistic conditions, the method was further tested using other two specially designed datasets which takes into account of the poor illuminations and different road conditions, achieving an overall accuracy of 99.3 and 95.77%, respectively.

AB - Driver fatigue and inattention have long been recognised as the main contributing factors in traffic accidents. This study presents a novel system which applies convolutional neural network (CNN) to automatically learn and predict pre-defined driving postures. The main idea is to monitor driver hand position with discriminative information extracted to predict safe/unsafe driving posture. In comparison to previous approaches, CNNs can automatically learn discriminative features directly from raw images. In the authors' works, a CNN model was first pre-trained by an unsupervised feature learning method called sparse filtering, and subsequently fine-tuned with classification. The approach was verified using the Southeast University driving posture dataset, which comprised of video clips covering four driving postures, including normal driving, responding to a cell phone call, eating, and smoking. Compared with other popular approaches with different image descriptors and classification methods, the authors' scheme achieves the best performance with an overall accuracy of 99.78%. To evaluate the effectiveness and generalisation performance in more realistic conditions, the method was further tested using other two specially designed datasets which takes into account of the poor illuminations and different road conditions, achieving an overall accuracy of 99.3 and 95.77%, respectively.

UR - http://www.scopus.com/inward/record.url?scp=84959043138&partnerID=8YFLogxK

U2 - 10.1049/iet-cvi.2015.0175

DO - 10.1049/iet-cvi.2015.0175

M3 - Article

AN - SCOPUS:84959043138

SN - 1751-9632

VL - 10

SP - 103

EP - 114

JO - IET Computer Vision

JF - IET Computer Vision

IS - 2

ER -

Driving posture recognition by convolutional neural networks

Abstract

Access to Document

Other files and links

Fingerprint

Cite this