A visual attention based convolutional neural network for image classification

Yaran Chen, Dongbin Zhao, Le Lv, Chengdong Li

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

20 Citations (Scopus)

Abstract

This paper presents a visual attention based convolutional neural network (CNN) to solve the image classification problem in the real complex world scene. The presented method can simulate the process of recognizing objects and find the area of interest which is related with the task. Compared with the CNN method in image classification, the model is proficient in fine-grained classification problem and has a better robustness due to its mechanism of multi-glance and visual attention. We evaluate the model on vehicle dataset, where its performance exceeds CNN baseline on image classification.

Original languageEnglish
Title of host publicationProceedings of the 2016 12th World Congress on Intelligent Control and Automation, WCICA 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages764-769
Number of pages6
ISBN (Electronic)9781467384148
DOIs
Publication statusPublished - 27 Sept 2016
Externally publishedYes
Event12th World Congress on Intelligent Control and Automation, WCICA 2016 - Guilin, China
Duration: 12 Jun 201615 Jun 2016

Publication series

NameProceedings of the World Congress on Intelligent Control and Automation (WCICA)
Volume2016-September

Conference

Conference12th World Congress on Intelligent Control and Automation, WCICA 2016
Country/TerritoryChina
CityGuilin
Period12/06/1615/06/16

Fingerprint

Dive into the research topics of 'A visual attention based convolutional neural network for image classification'. Together they form a unique fingerprint.

Cite this