TY - GEN
T1 - Improving irregular text recognition by integrating gabor convolutional network
AU - Guo, Zhaohong
AU - Xu, Hui
AU - Lu, Feng
AU - Wang, Qiufeng
AU - Zhou, Xiangdong
AU - Shi, Yu
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - Scene text, especially irregular text, is difficult to recognize due to the arbitrary-oriented characters and irregular arrangement. Most existing methods address the irregular text by rectifying it into a regular one, which achieve good performance. However, these methods are possible to remove character information in some curved texts. To overcome this issue, we focus on extracting features that are robust to orientation changes instead of rectifying. In this work, we propose an end-to-end trainable model that combines a Gabor Convolutional Network (GCN) and a Sequence Recognition Network (SRN). The GCN is capable of extracting more robust features against the orientation, which is produced by incorporating Gabor filters of different orientations into Convolutional Neural Network (CNN). The SRN is an attention-based sequence-to-sequence model that sequentially outputs characters from the robust features. We evaluate the recognition accuracy of the proposed method on various benchmark datasets of scene text, including both regular and irregular texts. The extensive experimental results show that our proposed method achieves the state-of-the-art recognition performance on most of the irregular benchmarks as well as a regular benchmark.
AB - Scene text, especially irregular text, is difficult to recognize due to the arbitrary-oriented characters and irregular arrangement. Most existing methods address the irregular text by rectifying it into a regular one, which achieve good performance. However, these methods are possible to remove character information in some curved texts. To overcome this issue, we focus on extracting features that are robust to orientation changes instead of rectifying. In this work, we propose an end-to-end trainable model that combines a Gabor Convolutional Network (GCN) and a Sequence Recognition Network (SRN). The GCN is capable of extracting more robust features against the orientation, which is produced by incorporating Gabor filters of different orientations into Convolutional Neural Network (CNN). The SRN is an attention-based sequence-to-sequence model that sequentially outputs characters from the robust features. We evaluate the recognition accuracy of the proposed method on various benchmark datasets of scene text, including both regular and irregular texts. The extensive experimental results show that our proposed method achieves the state-of-the-art recognition performance on most of the irregular benchmarks as well as a regular benchmark.
KW - Arbitrary-Oriented Character
KW - Gabor Filter
KW - Irregular Text Recognition
KW - Sequence to Sequence Learning
UR - http://www.scopus.com/inward/record.url?scp=85081079568&partnerID=8YFLogxK
U2 - 10.1109/ICTAI.2019.00048
DO - 10.1109/ICTAI.2019.00048
M3 - Conference Proceeding
AN - SCOPUS:85081079568
T3 - Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI
SP - 286
EP - 293
BT - Proceedings - IEEE 31st International Conference on Tools with Artificial Intelligence, ICTAI 2019
PB - IEEE Computer Society
T2 - 31st IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2019
Y2 - 4 November 2019 through 6 November 2019
ER -