Decoupled Learning for Long-Tailed Oracle Character Recognition

Jing Li; Bin Dong; Qiu Feng Wang; Lei Ding; Rui Zhang; Kaizhu Huang

doi:10.1007/978-3-031-41685-9_11

Decoupled Learning for Long-Tailed Oracle Character Recognition

Jing Li, Bin Dong, Qiu Feng Wang^*, Lei Ding, Rui Zhang, Kaizhu Huang

^*Corresponding author for this work

Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

5 Citations (Scopus)

Abstract

Oracle character recognition has recently made significant progress with the success of deep neural networks (DNNs), but it is far from being solved. Most works do not consider the long-tailed distribution issue in oracle character recognition, resulting in a biased DNN towards head classes. To overcome this issue, we propose a two-stage decoupled learning method to train an unbiased DNN model for long-tailed oracle character recognition. In the first stage, we optimize the DNN under instance-balanced sampling, obtaining a robust backbone but biased classifier. In the second stage, we propose two strategies to refine the classifier under class-balanced sampling. Specifically, we add a learnable weight scaling module which can adjust the classifier to respect tail classes; meanwhile, we integrate the KL-divergence loss to maintain attention to head classes through knowledge distillation from the first stage. Coupling these two designs enables us to train an unbiased DNN model in oracle character recognition. Our proposed method achieves new state-of-the-art performance on three benchmark datasets, including OBC306, Oracle-AYNU and Oracle-20K.

Original language	English
Title of host publication	Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings
Editors	Gernot A. Fink, Rajiv Jain, Koichi Kise, Richard Zanibbi
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	165-181
Number of pages	17
ISBN (Print)	9783031416842
DOIs	https://doi.org/10.1007/978-3-031-41685-9_11
Publication status	Published - 2023
Event	17th International Conference on Document Analysis and Recognition, ICDAR 2023 - San José, United States Duration: 21 Aug 2023 → 26 Aug 2023

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	14190 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	17th International Conference on Document Analysis and Recognition, ICDAR 2023
Country/Territory	United States
City	San José
Period	21/08/23 → 26/08/23

Keywords

Decoupled learning
Knowledge distillation
Long tail
Oracle character recognition

Access to Document

10.1007/978-3-031-41685-9_11

Cite this

Li, J., Dong, B., Wang, Q. F., Ding, L., Zhang, R., & Huang, K. (2023). Decoupled Learning for Long-Tailed Oracle Character Recognition. In G. A. Fink, R. Jain, K. Kise, & R. Zanibbi (Eds.), Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings (pp. 165-181). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14190 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-41685-9_11

Li, Jing ; Dong, Bin ; Wang, Qiu Feng et al. / Decoupled Learning for Long-Tailed Oracle Character Recognition. Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings. editor / Gernot A. Fink ; Rajiv Jain ; Koichi Kise ; Richard Zanibbi. Springer Science and Business Media Deutschland GmbH, 2023. pp. 165-181 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{3dac7e93ccf34ba8975a6ba42ade8444,

title = "Decoupled Learning for Long-Tailed Oracle Character Recognition",

abstract = "Oracle character recognition has recently made significant progress with the success of deep neural networks (DNNs), but it is far from being solved. Most works do not consider the long-tailed distribution issue in oracle character recognition, resulting in a biased DNN towards head classes. To overcome this issue, we propose a two-stage decoupled learning method to train an unbiased DNN model for long-tailed oracle character recognition. In the first stage, we optimize the DNN under instance-balanced sampling, obtaining a robust backbone but biased classifier. In the second stage, we propose two strategies to refine the classifier under class-balanced sampling. Specifically, we add a learnable weight scaling module which can adjust the classifier to respect tail classes; meanwhile, we integrate the KL-divergence loss to maintain attention to head classes through knowledge distillation from the first stage. Coupling these two designs enables us to train an unbiased DNN model in oracle character recognition. Our proposed method achieves new state-of-the-art performance on three benchmark datasets, including OBC306, Oracle-AYNU and Oracle-20K.",

keywords = "Decoupled learning, Knowledge distillation, Long tail, Oracle character recognition",

author = "Jing Li and Bin Dong and Wang, {Qiu Feng} and Lei Ding and Rui Zhang and Kaizhu Huang",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.; 17th International Conference on Document Analysis and Recognition, ICDAR 2023 ; Conference date: 21-08-2023 Through 26-08-2023",

year = "2023",

doi = "10.1007/978-3-031-41685-9_11",

language = "English",

isbn = "9783031416842",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "165--181",

editor = "Fink, {Gernot A.} and Rajiv Jain and Koichi Kise and Richard Zanibbi",

booktitle = "Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings",

}

Li, J, Dong, B, Wang, QF, Ding, L, Zhang, R & Huang, K 2023, Decoupled Learning for Long-Tailed Oracle Character Recognition. in GA Fink, R Jain, K Kise & R Zanibbi (eds), Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 14190 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 165-181, 17th International Conference on Document Analysis and Recognition, ICDAR 2023, San José, United States, 21/08/23. https://doi.org/10.1007/978-3-031-41685-9_11

Decoupled Learning for Long-Tailed Oracle Character Recognition. / Li, Jing; Dong, Bin; Wang, Qiu Feng et al.
Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings. ed. / Gernot A. Fink; Rajiv Jain; Koichi Kise; Richard Zanibbi. Springer Science and Business Media Deutschland GmbH, 2023. p. 165-181 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14190 LNCS).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Decoupled Learning for Long-Tailed Oracle Character Recognition

AU - Li, Jing

AU - Dong, Bin

AU - Wang, Qiu Feng

AU - Ding, Lei

AU - Zhang, Rui

AU - Huang, Kaizhu

PY - 2023

Y1 - 2023

N2 - Oracle character recognition has recently made significant progress with the success of deep neural networks (DNNs), but it is far from being solved. Most works do not consider the long-tailed distribution issue in oracle character recognition, resulting in a biased DNN towards head classes. To overcome this issue, we propose a two-stage decoupled learning method to train an unbiased DNN model for long-tailed oracle character recognition. In the first stage, we optimize the DNN under instance-balanced sampling, obtaining a robust backbone but biased classifier. In the second stage, we propose two strategies to refine the classifier under class-balanced sampling. Specifically, we add a learnable weight scaling module which can adjust the classifier to respect tail classes; meanwhile, we integrate the KL-divergence loss to maintain attention to head classes through knowledge distillation from the first stage. Coupling these two designs enables us to train an unbiased DNN model in oracle character recognition. Our proposed method achieves new state-of-the-art performance on three benchmark datasets, including OBC306, Oracle-AYNU and Oracle-20K.

AB - Oracle character recognition has recently made significant progress with the success of deep neural networks (DNNs), but it is far from being solved. Most works do not consider the long-tailed distribution issue in oracle character recognition, resulting in a biased DNN towards head classes. To overcome this issue, we propose a two-stage decoupled learning method to train an unbiased DNN model for long-tailed oracle character recognition. In the first stage, we optimize the DNN under instance-balanced sampling, obtaining a robust backbone but biased classifier. In the second stage, we propose two strategies to refine the classifier under class-balanced sampling. Specifically, we add a learnable weight scaling module which can adjust the classifier to respect tail classes; meanwhile, we integrate the KL-divergence loss to maintain attention to head classes through knowledge distillation from the first stage. Coupling these two designs enables us to train an unbiased DNN model in oracle character recognition. Our proposed method achieves new state-of-the-art performance on three benchmark datasets, including OBC306, Oracle-AYNU and Oracle-20K.

KW - Decoupled learning

KW - Knowledge distillation

KW - Long tail

KW - Oracle character recognition

UR - http://www.scopus.com/inward/record.url?scp=85173584986&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-41685-9_11

DO - 10.1007/978-3-031-41685-9_11

M3 - Conference Proceeding

AN - SCOPUS:85173584986

SN - 9783031416842

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 165

EP - 181

BT - Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings

A2 - Fink, Gernot A.

A2 - Jain, Rajiv

A2 - Kise, Koichi

A2 - Zanibbi, Richard

PB - Springer Science and Business Media Deutschland GmbH

T2 - 17th International Conference on Document Analysis and Recognition, ICDAR 2023

Y2 - 21 August 2023 through 26 August 2023

ER -

Li J, Dong B, Wang QF, Ding L, Zhang R, Huang K. Decoupled Learning for Long-Tailed Oracle Character Recognition. In Fink GA, Jain R, Kise K, Zanibbi R, editors, Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings. Springer Science and Business Media Deutschland GmbH. 2023. p. 165-181. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-41685-9_11