Robust Text Image Recognition via Adversarial Sequence-to-Sequence Domain Adaptation

Yaping Zhang, Shuai Nie, Shan Liang, Wenju Liu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

25 Citations (Scopus)

Abstract

Robust text reading is a very challenging problem, due to the distribution of text images changing significantly in real-world scenarios. One effective solution is to align the distribution between different domains by domain adaptation methods. However, we found that these methods might struggle when dealing sequence-like text images. An important reason is that conventional domain adaptation methods strive to align images as a whole, while text images consist of variable-length fine-grained character information. To address this issue, we propose a novel Adversarial Sequence-to-Sequence Domain Adaptation (ASSDA) method to learn 'where to adapt' and 'how to align' the sequential image. Our key idea is to mine the local regions that contain characters, and focus on aligning them across domains in an adversarial manner. Extensive text recognition experiments show the ASSDA could efficiently transfer sequence knowledge and validate the promising power towards the various domain shift in the real world applications.

Original languageEnglish
Article number9384298
Pages (from-to)3922-3933
Number of pages12
JournalIEEE Transactions on Image Processing
Volume30
DOIs
Publication statusPublished - 2021
Externally publishedYes

Keywords

  • domain adaptation
  • Sequence-to-sequence
  • text image recognition

Fingerprint

Dive into the research topics of 'Robust Text Image Recognition via Adversarial Sequence-to-Sequence Domain Adaptation'. Together they form a unique fingerprint.

Cite this