Unsupervised language model adaptation for handwritten Chinese text recognition

Qiu Feng Wang; Fei Yin; Cheng Lin Liu

doi:10.1016/j.patcog.2013.09.015

Unsupervised language model adaptation for handwritten Chinese text recognition

Qiu Feng Wang, Fei Yin, Cheng Lin Liu^*

^*Corresponding author for this work

CAS - Institute of Automation

Research output: Contribution to journal › Article › peer-review

12 Citations (Scopus)

Abstract

This paper presents an effective approach for unsupervised language model adaptation (LMA) using multiple models in offline recognition of unconstrained handwritten Chinese texts. The domain of the document to recognize is variable and usually unknown a priori, so we use a two-pass recognition strategy with a pre-defined multi-domain language model set. We propose three methods to dynamically generate an adaptive language model to match the text output by first-pass recognition: model selection, model combination and model reconstruction. In model selection, we use the language model with minimum perplexity on the first-pass recognized text. By model combination, we learn the combination weights via minimizing the sum of squared error with both L2-norm and L1-norm regularization. For model reconstruction, we use a group of orthogonal bases to reconstruct a language model with the coefficients learned to match the document to recognize. Moreover, we reduce the storage size of multiple language models using two compression methods of split vector quantization (SVQ) and principal component analysis (PCA). Comprehensive experiments on two public Chinese handwriting databases CASIA-HWDB and HIT-MW show that the proposed unsupervised LMA approach improves the recognition performance impressively, particularly for ancient domain documents with the recognition accuracy improved by 7 percent. Meanwhile, the combination of the two compression methods largely reduces the storage size of language models with little loss of recognition accuracy.

Original language	English
Pages (from-to)	1202-1216
Number of pages	15
Journal	Pattern Recognition
Volume	47
Issue number	3
DOIs	https://doi.org/10.1016/j.patcog.2013.09.015
Publication status	Published - Mar 2014
Externally published	Yes

Keywords

Character string recognition
Chinese handwriting recognition
Language model compression
Unsupervised language model adaptation

Access to Document

10.1016/j.patcog.2013.09.015

Cite this

@article{88449077e4f140c7a0dd000af23e470c,

title = "Unsupervised language model adaptation for handwritten Chinese text recognition",

abstract = "This paper presents an effective approach for unsupervised language model adaptation (LMA) using multiple models in offline recognition of unconstrained handwritten Chinese texts. The domain of the document to recognize is variable and usually unknown a priori, so we use a two-pass recognition strategy with a pre-defined multi-domain language model set. We propose three methods to dynamically generate an adaptive language model to match the text output by first-pass recognition: model selection, model combination and model reconstruction. In model selection, we use the language model with minimum perplexity on the first-pass recognized text. By model combination, we learn the combination weights via minimizing the sum of squared error with both L2-norm and L1-norm regularization. For model reconstruction, we use a group of orthogonal bases to reconstruct a language model with the coefficients learned to match the document to recognize. Moreover, we reduce the storage size of multiple language models using two compression methods of split vector quantization (SVQ) and principal component analysis (PCA). Comprehensive experiments on two public Chinese handwriting databases CASIA-HWDB and HIT-MW show that the proposed unsupervised LMA approach improves the recognition performance impressively, particularly for ancient domain documents with the recognition accuracy improved by 7 percent. Meanwhile, the combination of the two compression methods largely reduces the storage size of language models with little loss of recognition accuracy.",

keywords = "Character string recognition, Chinese handwriting recognition, Language model compression, Unsupervised language model adaptation",

author = "Wang, {Qiu Feng} and Fei Yin and Liu, {Cheng Lin}",

note = "Funding Information: This work was supported by the National Natural Science Foundation of China (NSFC) under Grants 60933010 and 61305005. ",

year = "2014",

month = mar,

doi = "10.1016/j.patcog.2013.09.015",

language = "English",

volume = "47",

pages = "1202--1216",

journal = "Pattern Recognition",

issn = "0031-3203",

number = "3",

}

TY - JOUR

T1 - Unsupervised language model adaptation for handwritten Chinese text recognition

AU - Wang, Qiu Feng

AU - Yin, Fei

AU - Liu, Cheng Lin

N1 - Funding Information: This work was supported by the National Natural Science Foundation of China (NSFC) under Grants 60933010 and 61305005.

PY - 2014/3

Y1 - 2014/3

N2 - This paper presents an effective approach for unsupervised language model adaptation (LMA) using multiple models in offline recognition of unconstrained handwritten Chinese texts. The domain of the document to recognize is variable and usually unknown a priori, so we use a two-pass recognition strategy with a pre-defined multi-domain language model set. We propose three methods to dynamically generate an adaptive language model to match the text output by first-pass recognition: model selection, model combination and model reconstruction. In model selection, we use the language model with minimum perplexity on the first-pass recognized text. By model combination, we learn the combination weights via minimizing the sum of squared error with both L2-norm and L1-norm regularization. For model reconstruction, we use a group of orthogonal bases to reconstruct a language model with the coefficients learned to match the document to recognize. Moreover, we reduce the storage size of multiple language models using two compression methods of split vector quantization (SVQ) and principal component analysis (PCA). Comprehensive experiments on two public Chinese handwriting databases CASIA-HWDB and HIT-MW show that the proposed unsupervised LMA approach improves the recognition performance impressively, particularly for ancient domain documents with the recognition accuracy improved by 7 percent. Meanwhile, the combination of the two compression methods largely reduces the storage size of language models with little loss of recognition accuracy.

AB - This paper presents an effective approach for unsupervised language model adaptation (LMA) using multiple models in offline recognition of unconstrained handwritten Chinese texts. The domain of the document to recognize is variable and usually unknown a priori, so we use a two-pass recognition strategy with a pre-defined multi-domain language model set. We propose three methods to dynamically generate an adaptive language model to match the text output by first-pass recognition: model selection, model combination and model reconstruction. In model selection, we use the language model with minimum perplexity on the first-pass recognized text. By model combination, we learn the combination weights via minimizing the sum of squared error with both L2-norm and L1-norm regularization. For model reconstruction, we use a group of orthogonal bases to reconstruct a language model with the coefficients learned to match the document to recognize. Moreover, we reduce the storage size of multiple language models using two compression methods of split vector quantization (SVQ) and principal component analysis (PCA). Comprehensive experiments on two public Chinese handwriting databases CASIA-HWDB and HIT-MW show that the proposed unsupervised LMA approach improves the recognition performance impressively, particularly for ancient domain documents with the recognition accuracy improved by 7 percent. Meanwhile, the combination of the two compression methods largely reduces the storage size of language models with little loss of recognition accuracy.

KW - Character string recognition

KW - Chinese handwriting recognition

KW - Language model compression

KW - Unsupervised language model adaptation

UR - http://www.scopus.com/inward/record.url?scp=84888367927&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2013.09.015

DO - 10.1016/j.patcog.2013.09.015

M3 - Article

AN - SCOPUS:84888367927

SN - 0031-3203

VL - 47

SP - 1202

EP - 1216

JO - Pattern Recognition

JF - Pattern Recognition

IS - 3

ER -

Unsupervised language model adaptation for handwritten Chinese text recognition

Abstract

Keywords

Access to Document

Other files and links

Cite this