How many labeled license plates are needed?

Changhao Wu, Shugong Xu*, Guocong Song, Shunqing Zhang

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

26 Citations (Scopus)

Abstract

Training a good deep learning model often requires a lot of annotated data. As a large amount of labeled data is typically difficult to collect and even more difficult to annotate, data augmentation and data generation are widely used in the process of training deep neural networks. However, there is no clear common understanding on how much labeled data is needed to get satisfactory performance. In this paper, we try to address such a question using vehicle license plate character recognition as an example application. We apply computer graphic scripts and Generative Adversarial Networks to generate and augment a large number of annotated, synthesized license plate images with realistic colors, fonts, and character composition from a small number of real, manually labeled license plate images. Generated and augmented data are mixed and used as training data for the license plate recognition network modified from DenseNet. The experimental results show that the model trained from the generated mixed training data has good generalization ability, and the proposed approach achieves a new state-of-the-art accuracy on Dataset-1 and AOLP, even with a very limited number of original real license plates. In addition, the accuracy improvement caused by data generation becomes more significant when the number of labeled images is reduced. Data augmentation also plays a more significant role when the number of labeled images is increased.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proceedings
EditorsXilin Chen, Jian-Huang Lai, Nanning Zheng, Cheng-Lin Liu, Tieniu Tan, Jie Zhou, Hongbin Zha
PublisherSpringer Verlag
Pages334-346
Number of pages13
ISBN (Print)9783030033408
DOIs
Publication statusPublished - 2018
Externally publishedYes
Event1st Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2018 - Guangzhou, China
Duration: 23 Nov 201826 Nov 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11259 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2018
Country/TerritoryChina
CityGuangzhou
Period23/11/1826/11/18

Keywords

  • Data augmentation
  • GANs
  • License plate recognition

Fingerprint

Dive into the research topics of 'How many labeled license plates are needed?'. Together they form a unique fingerprint.

Cite this