Deep reinforcement learning with visual attention for vehicle classification

Dongbin Zhao, Yaran Chen, Le Lv

Research output: Contribution to journalArticlepeer-review

176 Citations (Scopus)

Abstract

Automatic vehicle classification is crucial to intelligent transportation system, especially for vehicle-tracking by police. Due to the complex lighting and image capture conditions, image-based vehicle classification in real-world environments is still a challenging task and the performance is far from being satisfactory. However, owing to the mechanism of visual attention, the human vision system shows remarkable capability compared with the computer vision system, especially in distinguishing nuances processing. Inspired by this mechanism, we propose a convolutional neural network (CNN) model of visual attention for image classification. A visual attention-based image processing module is used to highlight one part of an image and weaken the others, generating a focused image. Then the focused image is input into the CNN to be classified. According to the classification probability distribution, we compute the information entropy to guide a reinforcement learning agent to achieve a better policy for image classification to select the key parts of an image. Systematic experiments on a surveillance-nature dataset which contains images captured by surveillance cameras in the front view, demonstrate that the proposed model is more competitive than the large-scale CNN in vehicle classification tasks.

Original languageEnglish
Article number7580631
Pages (from-to)356-367
Number of pages12
JournalIEEE Transactions on Cognitive and Developmental Systems
Volume9
Issue number4
DOIs
Publication statusPublished - Dec 2017

Keywords

  • Convolutional neural network (CNN)
  • Reinforcement learning
  • Vehicle classification
  • Visual attention

Fingerprint

Dive into the research topics of 'Deep reinforcement learning with visual attention for vehicle classification'. Together they form a unique fingerprint.

Cite this