How many labeled license plates are needed?

Changhao Wu; Shugong Xu; Guocong Song; Shunqing Zhang

doi:10.1007/978-3-030-03341-5_28

How many labeled license plates are needed?

Changhao Wu, Shugong Xu^*, Guocong Song, Shunqing Zhang

^*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

26 Citations (Scopus)

Abstract

Training a good deep learning model often requires a lot of annotated data. As a large amount of labeled data is typically difficult to collect and even more difficult to annotate, data augmentation and data generation are widely used in the process of training deep neural networks. However, there is no clear common understanding on how much labeled data is needed to get satisfactory performance. In this paper, we try to address such a question using vehicle license plate character recognition as an example application. We apply computer graphic scripts and Generative Adversarial Networks to generate and augment a large number of annotated, synthesized license plate images with realistic colors, fonts, and character composition from a small number of real, manually labeled license plate images. Generated and augmented data are mixed and used as training data for the license plate recognition network modified from DenseNet. The experimental results show that the model trained from the generated mixed training data has good generalization ability, and the proposed approach achieves a new state-of-the-art accuracy on Dataset-1 and AOLP, even with a very limited number of original real license plates. In addition, the accuracy improvement caused by data generation becomes more significant when the number of labeled images is reduced. Data augmentation also plays a more significant role when the number of labeled images is increased.

Original language	English
Title of host publication	Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proceedings
Editors	Xilin Chen, Jian-Huang Lai, Nanning Zheng, Cheng-Lin Liu, Tieniu Tan, Jie Zhou, Hongbin Zha
Publisher	Springer Verlag
Pages	334-346
Number of pages	13
ISBN (Print)	9783030033408
DOIs	https://doi.org/10.1007/978-3-030-03341-5_28
Publication status	Published - 2018
Externally published	Yes
Event	1st Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2018 - Guangzhou, China Duration: 23 Nov 2018 → 26 Nov 2018

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11259 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	1st Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2018
Country/Territory	China
City	Guangzhou
Period	23/11/18 → 26/11/18

Keywords

Data augmentation
GANs
License plate recognition

Access to Document

10.1007/978-3-030-03341-5_28

Cite this

Wu, C., Xu, S., Song, G., & Zhang, S. (2018). How many labeled license plates are needed? In X. Chen, J.-H. Lai, N. Zheng, C.-L. Liu, T. Tan, J. Zhou, & H. Zha (Eds.), Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proceedings (pp. 334-346). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11259 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-030-03341-5_28

Wu, Changhao ; Xu, Shugong ; Song, Guocong et al. / How many labeled license plates are needed?. Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proceedings. editor / Xilin Chen ; Jian-Huang Lai ; Nanning Zheng ; Cheng-Lin Liu ; Tieniu Tan ; Jie Zhou ; Hongbin Zha. Springer Verlag, 2018. pp. 334-346 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{078f559c680340a2852f91c5e920d57d,

title = "How many labeled license plates are needed?",

abstract = "Training a good deep learning model often requires a lot of annotated data. As a large amount of labeled data is typically difficult to collect and even more difficult to annotate, data augmentation and data generation are widely used in the process of training deep neural networks. However, there is no clear common understanding on how much labeled data is needed to get satisfactory performance. In this paper, we try to address such a question using vehicle license plate character recognition as an example application. We apply computer graphic scripts and Generative Adversarial Networks to generate and augment a large number of annotated, synthesized license plate images with realistic colors, fonts, and character composition from a small number of real, manually labeled license plate images. Generated and augmented data are mixed and used as training data for the license plate recognition network modified from DenseNet. The experimental results show that the model trained from the generated mixed training data has good generalization ability, and the proposed approach achieves a new state-of-the-art accuracy on Dataset-1 and AOLP, even with a very limited number of original real license plates. In addition, the accuracy improvement caused by data generation becomes more significant when the number of labeled images is reduced. Data augmentation also plays a more significant role when the number of labeled images is increased.",

keywords = "Data augmentation, GANs, License plate recognition",

author = "Changhao Wu and Shugong Xu and Guocong Song and Shunqing Zhang",

note = "Publisher Copyright: {\textcopyright} Springer Nature Switzerland AG 2018.; 1st Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2018 ; Conference date: 23-11-2018 Through 26-11-2018",

year = "2018",

doi = "10.1007/978-3-030-03341-5_28",

language = "English",

isbn = "9783030033408",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "334--346",

editor = "Xilin Chen and Jian-Huang Lai and Nanning Zheng and Cheng-Lin Liu and Tieniu Tan and Jie Zhou and Hongbin Zha",

booktitle = "Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proceedings",

}

Wu, C, Xu, S, Song, G & Zhang, S 2018, How many labeled license plates are needed? in X Chen, J-H Lai, N Zheng, C-L Liu, T Tan, J Zhou & H Zha (eds), Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11259 LNCS, Springer Verlag, pp. 334-346, 1st Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2018, Guangzhou, China, 23/11/18. https://doi.org/10.1007/978-3-030-03341-5_28

How many labeled license plates are needed? / Wu, Changhao; Xu, Shugong; Song, Guocong et al.
Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proceedings. ed. / Xilin Chen; Jian-Huang Lai; Nanning Zheng; Cheng-Lin Liu; Tieniu Tan; Jie Zhou; Hongbin Zha. Springer Verlag, 2018. p. 334-346 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11259 LNCS).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review