Adversarial Rectification Network for Scene Text Regularization

Jing Li; Qiu Feng Wang; Rui Zhang; Kaizhu Huang

doi:10.1007/978-3-030-63833-7_13

Adversarial Rectification Network for Scene Text Regularization

Jing Li, Qiu Feng Wang^*, Rui Zhang, Kaizhu Huang

^*Corresponding author for this work

Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

3 Citations (Scopus)

Abstract

Scene text recognition with irregular layouts is a challenging yet important problem in computer vision. One widely used method is to employ a rectification network before the recognition stage. However, most previous rectification methods either did not consider recognition information or were integrated into end-to-end recognition models without considering rectification explicitly. To overcome this issue, we propose an adversarial learning-based rectification network that integrates transformation (from irregular texts to regular texts) with recognition information into a unified framework. In this framework, we optimize the rectification network with an extended Generative Adversarial Network that competes between rectifier and discriminator, together with the results of a recognizer. To evaluate the rectification performance, we generated a regular-irregular pair set from the benchmark datasets, and experimental results show that the proposed method can achieve significant improvement on the rectification performance with comparable recognition performance. Specifically, the PSNR and SSIM are improved by 0.81 and 0.051, respectively, which demonstrates its effectiveness.

Original language	English
Title of host publication	Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings
Editors	Haiqin Yang, Kitsuchart Pasupa, Andrew Chi-Sing Leung, James T. Kwok, Jonathan H. Chan, Irwin King
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	152-163
Number of pages	12
ISBN (Print)	9783030638320
DOIs	https://doi.org/10.1007/978-3-030-63833-7_13
Publication status	Published - 2020
Event	27th International Conference on Neural Information Processing, ICONIP 2020 - Bangkok, Thailand Duration: 18 Nov 2020 → 22 Nov 2020

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	12533 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	27th International Conference on Neural Information Processing, ICONIP 2020
Country/Territory	Thailand
City	Bangkok
Period	18/11/20 → 22/11/20

Keywords

Generative adversarial networks
Irregular text
Rectification network
Scene text recognition

Access to Document

10.1007/978-3-030-63833-7_13

Cite this

Li, J., Wang, Q. F., Zhang, R., & Huang, K. (2020). Adversarial Rectification Network for Scene Text Regularization. In H. Yang, K. Pasupa, A. C.-S. Leung, J. T. Kwok, J. H. Chan, & I. King (Eds.), Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings (pp. 152-163). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12533 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-63833-7_13

Li, Jing ; Wang, Qiu Feng ; Zhang, Rui et al. / Adversarial Rectification Network for Scene Text Regularization. Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings. editor / Haiqin Yang ; Kitsuchart Pasupa ; Andrew Chi-Sing Leung ; James T. Kwok ; Jonathan H. Chan ; Irwin King. Springer Science and Business Media Deutschland GmbH, 2020. pp. 152-163 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{629531c5decc462996ad7b43efcbaea0,

title = "Adversarial Rectification Network for Scene Text Regularization",

abstract = "Scene text recognition with irregular layouts is a challenging yet important problem in computer vision. One widely used method is to employ a rectification network before the recognition stage. However, most previous rectification methods either did not consider recognition information or were integrated into end-to-end recognition models without considering rectification explicitly. To overcome this issue, we propose an adversarial learning-based rectification network that integrates transformation (from irregular texts to regular texts) with recognition information into a unified framework. In this framework, we optimize the rectification network with an extended Generative Adversarial Network that competes between rectifier and discriminator, together with the results of a recognizer. To evaluate the rectification performance, we generated a regular-irregular pair set from the benchmark datasets, and experimental results show that the proposed method can achieve significant improvement on the rectification performance with comparable recognition performance. Specifically, the PSNR and SSIM are improved by 0.81 and 0.051, respectively, which demonstrates its effectiveness.",

keywords = "Generative adversarial networks, Irregular text, Rectification network, Scene text recognition",

author = "Jing Li and Wang, {Qiu Feng} and Rui Zhang and Kaizhu Huang",

note = "Publisher Copyright: {\textcopyright} 2020, Springer Nature Switzerland AG.; 27th International Conference on Neural Information Processing, ICONIP 2020 ; Conference date: 18-11-2020 Through 22-11-2020",

year = "2020",

doi = "10.1007/978-3-030-63833-7_13",

language = "English",

isbn = "9783030638320",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "152--163",

editor = "Haiqin Yang and Kitsuchart Pasupa and Leung, {Andrew Chi-Sing} and Kwok, {James T.} and Chan, {Jonathan H.} and Irwin King",

booktitle = "Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings",

}

Li, J, Wang, QF , Zhang, R & Huang, K 2020, Adversarial Rectification Network for Scene Text Regularization. in H Yang, K Pasupa, AC-S Leung, JT Kwok, JH Chan & I King (eds), Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12533 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 152-163, 27th International Conference on Neural Information Processing, ICONIP 2020, Bangkok, Thailand, 18/11/20. https://doi.org/10.1007/978-3-030-63833-7_13

Adversarial Rectification Network for Scene Text Regularization. / Li, Jing; Wang, Qiu Feng ; Zhang, Rui et al.
Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings. ed. / Haiqin Yang; Kitsuchart Pasupa; Andrew Chi-Sing Leung; James T. Kwok; Jonathan H. Chan; Irwin King. Springer Science and Business Media Deutschland GmbH, 2020. p. 152-163 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12533 LNCS).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Adversarial Rectification Network for Scene Text Regularization

AU - Li, Jing

AU - Wang, Qiu Feng

AU - Zhang, Rui

AU - Huang, Kaizhu

PY - 2020

Y1 - 2020

N2 - Scene text recognition with irregular layouts is a challenging yet important problem in computer vision. One widely used method is to employ a rectification network before the recognition stage. However, most previous rectification methods either did not consider recognition information or were integrated into end-to-end recognition models without considering rectification explicitly. To overcome this issue, we propose an adversarial learning-based rectification network that integrates transformation (from irregular texts to regular texts) with recognition information into a unified framework. In this framework, we optimize the rectification network with an extended Generative Adversarial Network that competes between rectifier and discriminator, together with the results of a recognizer. To evaluate the rectification performance, we generated a regular-irregular pair set from the benchmark datasets, and experimental results show that the proposed method can achieve significant improvement on the rectification performance with comparable recognition performance. Specifically, the PSNR and SSIM are improved by 0.81 and 0.051, respectively, which demonstrates its effectiveness.

AB - Scene text recognition with irregular layouts is a challenging yet important problem in computer vision. One widely used method is to employ a rectification network before the recognition stage. However, most previous rectification methods either did not consider recognition information or were integrated into end-to-end recognition models without considering rectification explicitly. To overcome this issue, we propose an adversarial learning-based rectification network that integrates transformation (from irregular texts to regular texts) with recognition information into a unified framework. In this framework, we optimize the rectification network with an extended Generative Adversarial Network that competes between rectifier and discriminator, together with the results of a recognizer. To evaluate the rectification performance, we generated a regular-irregular pair set from the benchmark datasets, and experimental results show that the proposed method can achieve significant improvement on the rectification performance with comparable recognition performance. Specifically, the PSNR and SSIM are improved by 0.81 and 0.051, respectively, which demonstrates its effectiveness.

KW - Generative adversarial networks

KW - Irregular text

KW - Rectification network

KW - Scene text recognition

UR - http://www.scopus.com/inward/record.url?scp=85097386316&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-63833-7_13

DO - 10.1007/978-3-030-63833-7_13

M3 - Conference Proceeding

AN - SCOPUS:85097386316

SN - 9783030638320

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 152

EP - 163

BT - Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings

A2 - Yang, Haiqin

A2 - Pasupa, Kitsuchart

A2 - Leung, Andrew Chi-Sing

A2 - Kwok, James T.

A2 - Chan, Jonathan H.

A2 - King, Irwin

PB - Springer Science and Business Media Deutschland GmbH

T2 - 27th International Conference on Neural Information Processing, ICONIP 2020

Y2 - 18 November 2020 through 22 November 2020

ER -

Li J, Wang QF , Zhang R, Huang K. Adversarial Rectification Network for Scene Text Regularization. In Yang H, Pasupa K, Leung ACS, Kwok JT, Chan JH, King I, editors, Neural Information Processing - 27th International Conference, ICONIP 2020, Proceedings. Springer Science and Business Media Deutschland GmbH. 2020. p. 152-163. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-63833-7_13