TY - GEN
T1 - IFR
T2 - 4th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2021
AU - Jia, Zhiwei
AU - Xu, Shugong
AU - Mu, Shiyi
AU - Tao, Yue
AU - Cao, Shan
AU - Chen, Zhiyong
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Although recent works based on deep learning have made progress in improving recognition accuracy on scene text recognition, how to handle low-quality text images in end-to-end deep networks remains a research challenge. In this paper, we propose an Iterative Fusion based Recognizer (IFR) for low quality scene text recognition, taking advantage of refined text images input and robust feature representation. IFR contains two branches which focus on scene text recognition and low quality scene text image recovery respectively. We utilize an iterative collaboration between two branches, which can effectively alleviate the impact of low quality input. A feature fusion module is proposed to strengthen the feature representation of the two branches, where the features from the Recognizer are Fused with image Restoration branch, referred to as RRF. Without changing the recognition network structure, extensive quantitative and qualitative experimental results show that the proposed method significantly outperforms the baseline methods in boosting the recognition accuracy of benchmark datasets and low resolution images in TextZoom dataset.
AB - Although recent works based on deep learning have made progress in improving recognition accuracy on scene text recognition, how to handle low-quality text images in end-to-end deep networks remains a research challenge. In this paper, we propose an Iterative Fusion based Recognizer (IFR) for low quality scene text recognition, taking advantage of refined text images input and robust feature representation. IFR contains two branches which focus on scene text recognition and low quality scene text image recovery respectively. We utilize an iterative collaboration between two branches, which can effectively alleviate the impact of low quality input. A feature fusion module is proposed to strengthen the feature representation of the two branches, where the features from the Recognizer are Fused with image Restoration branch, referred to as RRF. Without changing the recognition network structure, extensive quantitative and qualitative experimental results show that the proposed method significantly outperforms the baseline methods in boosting the recognition accuracy of benchmark datasets and low resolution images in TextZoom dataset.
KW - Feature fusion
KW - Iterative collaboration
KW - Scene text recognition
UR - http://www.scopus.com/inward/record.url?scp=85118222543&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-88007-1_15
DO - 10.1007/978-3-030-88007-1_15
M3 - Conference Proceeding
AN - SCOPUS:85118222543
SN - 9783030880064
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 180
EP - 191
BT - Pattern Recognition and Computer Vision - 4th Chinese Conference, PRCV 2021, Proceedings
A2 - Ma, Huimin
A2 - Wang, Liang
A2 - Zhang, Changshui
A2 - Wu, Fei
A2 - Tan, Tieniu
A2 - Wang, Yaonan
A2 - Lai, Jianhuang
A2 - Zhao, Yao
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 29 October 2021 through 1 November 2021
ER -