A touching character database from Chinese handwriting for assessing segmentation algorithms

Liang Xu*, Fei Yin, Qiu Feng Wang, Cheng Lin Liu

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

4 Citations (Scopus)

Abstract

For assessing touching character segmentation algorithms, we present a database of touching characters collected from the Chinese handwriting database CASIA-HWDB, called CASIA-HWDB-T. It includes 56,469 two-character or multiple-character touching strings, among which 1,818 strings have multipletouching characters. We also partition the touching strings into 50,157 all-Chinese strings, 2,788 all-digit ones, 328 all-letter ones, and 3,196 mixed-character ones. All the strings are annotated with the character classes, locations of touching points, and auxiliary values like string height and average stroke width. And last, we measure the segmentation performance of three existing algorithms on this database for reference.

Original languageEnglish
Title of host publicationProceedings - 13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
Pages89-94
Number of pages6
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012 - Bari, Italy
Duration: 18 Sept 201220 Sept 2012

Publication series

NameProceedings - International Workshop on Frontiers in Handwriting Recognition, IWFHR
ISSN (Print)1550-5235

Conference

Conference13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
Country/TerritoryItaly
CityBari
Period18/09/1220/09/12

Fingerprint

Dive into the research topics of 'A touching character database from Chinese handwriting for assessing segmentation algorithms'. Together they form a unique fingerprint.

Cite this