TY - JOUR
T1 - A comprehensive survey of oracle character recognition
T2 - Challenges, datasets, methodology, and beyond
AU - Li, Jing
AU - Chi, Xueke
AU - Wang, Qiufeng
AU - Huang, Kaizhu
AU - Wang, Da Han
AU - Liu, Yongge
AU - Liu, Cheng Lin
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2026/1
Y1 - 2026/1
N2 - Oracle character recognition — an analysis of ancient Chinese inscriptions found on oracle bones — has become a pivotal field intersecting archaeology, paleography, and historical cultural studies. Traditional methods of oracle character recognition have relied heavily on manual interpretation by experts, which is not only labor-intensive but also limits broader accessibility to the general public. With recent breakthroughs in pattern recognition and deep learning, there is a growing movement toward the automation of oracle character recognition (OrCR), showing considerable promise in tackling the challenges inherent to these ancient scripts. However, a comprehensive understanding of OrCR still remains elusive. Therefore, this paper presents a systematic and structured survey of the current landscape of OrCR research. We commence by identifying and analyzing the key challenges of OrCR. Then, we provide an overview of the primary benchmark datasets and digital resources available for OrCR. A review of contemporary research methodologies follows, in which their respective efficacies, limitations, and applicability to the complex nature of oracle characters are critically highlighted and examined. Additionally, our review extends to ancillary tasks associated with OrCR across diverse disciplines, providing a broad-spectrum analysis of its applications. We conclude with a forward-looking perspective, proposing potential avenues for future investigations that could yield significant advancements in the field.
AB - Oracle character recognition — an analysis of ancient Chinese inscriptions found on oracle bones — has become a pivotal field intersecting archaeology, paleography, and historical cultural studies. Traditional methods of oracle character recognition have relied heavily on manual interpretation by experts, which is not only labor-intensive but also limits broader accessibility to the general public. With recent breakthroughs in pattern recognition and deep learning, there is a growing movement toward the automation of oracle character recognition (OrCR), showing considerable promise in tackling the challenges inherent to these ancient scripts. However, a comprehensive understanding of OrCR still remains elusive. Therefore, this paper presents a systematic and structured survey of the current landscape of OrCR research. We commence by identifying and analyzing the key challenges of OrCR. Then, we provide an overview of the primary benchmark datasets and digital resources available for OrCR. A review of contemporary research methodologies follows, in which their respective efficacies, limitations, and applicability to the complex nature of oracle characters are critically highlighted and examined. Additionally, our review extends to ancillary tasks associated with OrCR across diverse disciplines, providing a broad-spectrum analysis of its applications. We conclude with a forward-looking perspective, proposing potential avenues for future investigations that could yield significant advancements in the field.
KW - Dataset
KW - Handwriting recognition
KW - Oracle bone script
KW - Oracle character recognition
KW - Survey
UR - http://www.scopus.com/inward/record.url?scp=105007538642&partnerID=8YFLogxK
U2 - 10.1016/j.patcog.2025.111824
DO - 10.1016/j.patcog.2025.111824
M3 - Review article
AN - SCOPUS:105007538642
SN - 0031-3203
VL - 169
JO - Pattern Recognition
JF - Pattern Recognition
M1 - 111824
ER -