SCREAM: Knowledge sharing and compact representation for class incremental learning

Zhikun Feng; Mian Zhou; Zan Gao; Angelos Stefanidis; Zezhou Sui

doi:10.1016/j.ipm.2023.103629

SCREAM: Knowledge sharing and compact representation for class incremental learning

Zhikun Feng, Mian Zhou^*, Zan Gao, Angelos Stefanidis, Zezhou Sui

^*Corresponding author for this work

School of AI and Advanced Computing

Tianjin University of Technology

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

Methods based on dynamic structures are effective in addressing catastrophic forgetting on Class-incremental learning (CIL). However, they often isolate sub-networks and overlook the integration of overall information, resulting in a performance decline. To overcome this limitation, we recognize the importance of knowledge sharing among sub-networks. On the basis of dynamic network, we established a novel two-stage CIL method called SCREAM that includes an Expandable Network (EN) Learning Stage and a Compact Representation (CR) Stage: (1) design a clustering loss function for EN, aggregating related instances and promoting information sharing; (2) design dynamic weight alignment to alleviate the classifier's bias towards new class knowledge; and (3) design a balanced decoupled distillation for CR, mitigating the impact of the long-tail effect during multiple compressions. To validate the performance of SCREAM, we use 3 widely used datasets and set different Buffersize (replay-buffer) for comparison with the current state-of-the-art models.The result show that on CIFAR-100 ImageNet-100/1000 and Tiny-ImageNet achieve an average accuracy exceeding 2.46%, 1.22% and 1.52%, respectively. When using a smaller buffersize, SCREAM also achieves an average accuracy exceeding 4.60%. Furthermore, SCREAM shows good performance in terms of Resources needed.

Original language	English
Article number	103629
Journal	Information Processing and Management
Volume	61
Issue number	3
DOIs	https://doi.org/10.1016/j.ipm.2023.103629
Publication status	Published - May 2024

Keywords

Incremental learning
Knowledge distillation
Knowledge sharing
Rehearsal

Access to Document

10.1016/j.ipm.2023.103629

Cite this

@article{8f26bc62b4064e1fb2bbad42b3e042a5,

title = "SCREAM: Knowledge sharing and compact representation for class incremental learning",

abstract = "Methods based on dynamic structures are effective in addressing catastrophic forgetting on Class-incremental learning (CIL). However, they often isolate sub-networks and overlook the integration of overall information, resulting in a performance decline. To overcome this limitation, we recognize the importance of knowledge sharing among sub-networks. On the basis of dynamic network, we established a novel two-stage CIL method called SCREAM that includes an Expandable Network (EN) Learning Stage and a Compact Representation (CR) Stage: (1) design a clustering loss function for EN, aggregating related instances and promoting information sharing; (2) design dynamic weight alignment to alleviate the classifier's bias towards new class knowledge; and (3) design a balanced decoupled distillation for CR, mitigating the impact of the long-tail effect during multiple compressions. To validate the performance of SCREAM, we use 3 widely used datasets and set different Buffersize (replay-buffer) for comparison with the current state-of-the-art models.The result show that on CIFAR-100 ImageNet-100/1000 and Tiny-ImageNet achieve an average accuracy exceeding 2.46%, 1.22% and 1.52%, respectively. When using a smaller buffersize, SCREAM also achieves an average accuracy exceeding 4.60%. Furthermore, SCREAM shows good performance in terms of Resources needed.",

keywords = "Incremental learning, Knowledge distillation, Knowledge sharing, Rehearsal",

author = "Zhikun Feng and Mian Zhou and Zan Gao and Angelos Stefanidis and Zezhou Sui",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2024",

month = may,

doi = "10.1016/j.ipm.2023.103629",

language = "English",

volume = "61",

journal = "Information Processing and Management",

issn = "0306-4573",

number = "3",

}

TY - JOUR

T1 - SCREAM

T2 - Knowledge sharing and compact representation for class incremental learning

AU - Feng, Zhikun

AU - Zhou, Mian

AU - Gao, Zan

AU - Stefanidis, Angelos

AU - Sui, Zezhou

PY - 2024/5

Y1 - 2024/5

N2 - Methods based on dynamic structures are effective in addressing catastrophic forgetting on Class-incremental learning (CIL). However, they often isolate sub-networks and overlook the integration of overall information, resulting in a performance decline. To overcome this limitation, we recognize the importance of knowledge sharing among sub-networks. On the basis of dynamic network, we established a novel two-stage CIL method called SCREAM that includes an Expandable Network (EN) Learning Stage and a Compact Representation (CR) Stage: (1) design a clustering loss function for EN, aggregating related instances and promoting information sharing; (2) design dynamic weight alignment to alleviate the classifier's bias towards new class knowledge; and (3) design a balanced decoupled distillation for CR, mitigating the impact of the long-tail effect during multiple compressions. To validate the performance of SCREAM, we use 3 widely used datasets and set different Buffersize (replay-buffer) for comparison with the current state-of-the-art models.The result show that on CIFAR-100 ImageNet-100/1000 and Tiny-ImageNet achieve an average accuracy exceeding 2.46%, 1.22% and 1.52%, respectively. When using a smaller buffersize, SCREAM also achieves an average accuracy exceeding 4.60%. Furthermore, SCREAM shows good performance in terms of Resources needed.

AB - Methods based on dynamic structures are effective in addressing catastrophic forgetting on Class-incremental learning (CIL). However, they often isolate sub-networks and overlook the integration of overall information, resulting in a performance decline. To overcome this limitation, we recognize the importance of knowledge sharing among sub-networks. On the basis of dynamic network, we established a novel two-stage CIL method called SCREAM that includes an Expandable Network (EN) Learning Stage and a Compact Representation (CR) Stage: (1) design a clustering loss function for EN, aggregating related instances and promoting information sharing; (2) design dynamic weight alignment to alleviate the classifier's bias towards new class knowledge; and (3) design a balanced decoupled distillation for CR, mitigating the impact of the long-tail effect during multiple compressions. To validate the performance of SCREAM, we use 3 widely used datasets and set different Buffersize (replay-buffer) for comparison with the current state-of-the-art models.The result show that on CIFAR-100 ImageNet-100/1000 and Tiny-ImageNet achieve an average accuracy exceeding 2.46%, 1.22% and 1.52%, respectively. When using a smaller buffersize, SCREAM also achieves an average accuracy exceeding 4.60%. Furthermore, SCREAM shows good performance in terms of Resources needed.

KW - Incremental learning

KW - Knowledge distillation

KW - Knowledge sharing

KW - Rehearsal

UR - http://www.scopus.com/inward/record.url?scp=85182031261&partnerID=8YFLogxK

U2 - 10.1016/j.ipm.2023.103629

DO - 10.1016/j.ipm.2023.103629

M3 - Article

AN - SCOPUS:85182031261

SN - 0306-4573

VL - 61

JO - Information Processing and Management

JF - Information Processing and Management

IS - 3

M1 - 103629

ER -

SCREAM: Knowledge sharing and compact representation for class incremental learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this