Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification

Zhaorui Tan; Xi Yang; Qiufeng Wang; Anh Nguyen; Kaizhu Huang

Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification

Zhaorui Tan, Xi Yang^*, Qiufeng Wang, Anh Nguyen, Kaizhu Huang^*

^*Corresponding author for this work

Department of Intelligent Science

Research output: Contribution to journal › Conference article › peer-review

2 Citations (Scopus)

Abstract

Vision models excel in image classification but struggle to generalize to unseen data, such as classifying images from unseen domains or discovering novel categories. In this paper, we explore the relationship between logical reasoning and deep learning generalization in visual classification. A logical regularization termed L-Reg is derived which bridges a logical analysis framework to image classification. Our work reveals that L-Reg reduces the complexity of the model in terms of the feature distribution and classifier weights. Specifically, we unveil the interpretability brought by L-Reg, as it enables the model to extract the salient features, such as faces to persons, for classification. Theoretical analysis and experiments demonstrate that L-Reg enhances generalization across various scenarios, including multi-domain generalization and generalized category discovery. In complex real-world scenarios where images span unknown classes and unseen domains, L-Reg consistently improves generalization, highlighting its practical efficacy.

Original language	English
Journal	Advances in Neural Information Processing Systems
Volume	37
Publication status	Published - 2024
Event	38th Conference on Neural Information Processing Systems, NeurIPS 2024 - Vancouver, Canada Duration: 9 Dec 2024 → 15 Dec 2024

Cite this

@article{c31e0b6932664ff4b019b6b92c144d8d,

title = "Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification",

abstract = "Vision models excel in image classification but struggle to generalize to unseen data, such as classifying images from unseen domains or discovering novel categories. In this paper, we explore the relationship between logical reasoning and deep learning generalization in visual classification. A logical regularization termed L-Reg is derived which bridges a logical analysis framework to image classification. Our work reveals that L-Reg reduces the complexity of the model in terms of the feature distribution and classifier weights. Specifically, we unveil the interpretability brought by L-Reg, as it enables the model to extract the salient features, such as faces to persons, for classification. Theoretical analysis and experiments demonstrate that L-Reg enhances generalization across various scenarios, including multi-domain generalization and generalized category discovery. In complex real-world scenarios where images span unknown classes and unseen domains, L-Reg consistently improves generalization, highlighting its practical efficacy.",

author = "Zhaorui Tan and Xi Yang and Qiufeng Wang and Anh Nguyen and Kaizhu Huang",

note = "Publisher Copyright: {\textcopyright} 2024 Neural information processing systems foundation. All rights reserved.; 38th Conference on Neural Information Processing Systems, NeurIPS 2024 ; Conference date: 09-12-2024 Through 15-12-2024",

year = "2024",

language = "English",

volume = "37",

journal = "Advances in Neural Information Processing Systems",

issn = "1049-5258",

}

TY - JOUR

T1 - Interpret Your Decision

T2 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024

AU - Tan, Zhaorui

AU - Yang, Xi

AU - Wang, Qiufeng

AU - Nguyen, Anh

AU - Huang, Kaizhu

PY - 2024

Y1 - 2024

N2 - Vision models excel in image classification but struggle to generalize to unseen data, such as classifying images from unseen domains or discovering novel categories. In this paper, we explore the relationship between logical reasoning and deep learning generalization in visual classification. A logical regularization termed L-Reg is derived which bridges a logical analysis framework to image classification. Our work reveals that L-Reg reduces the complexity of the model in terms of the feature distribution and classifier weights. Specifically, we unveil the interpretability brought by L-Reg, as it enables the model to extract the salient features, such as faces to persons, for classification. Theoretical analysis and experiments demonstrate that L-Reg enhances generalization across various scenarios, including multi-domain generalization and generalized category discovery. In complex real-world scenarios where images span unknown classes and unseen domains, L-Reg consistently improves generalization, highlighting its practical efficacy.

AB - Vision models excel in image classification but struggle to generalize to unseen data, such as classifying images from unseen domains or discovering novel categories. In this paper, we explore the relationship between logical reasoning and deep learning generalization in visual classification. A logical regularization termed L-Reg is derived which bridges a logical analysis framework to image classification. Our work reveals that L-Reg reduces the complexity of the model in terms of the feature distribution and classifier weights. Specifically, we unveil the interpretability brought by L-Reg, as it enables the model to extract the salient features, such as faces to persons, for classification. Theoretical analysis and experiments demonstrate that L-Reg enhances generalization across various scenarios, including multi-domain generalization and generalized category discovery. In complex real-world scenarios where images span unknown classes and unseen domains, L-Reg consistently improves generalization, highlighting its practical efficacy.

UR - http://www.scopus.com/inward/record.url?scp=105000538379&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:105000538379

SN - 1049-5258

VL - 37

JO - Advances in Neural Information Processing Systems

JF - Advances in Neural Information Processing Systems

Y2 - 9 December 2024 through 15 December 2024

ER -

Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification

Abstract

Other files and links

Fingerprint

Cite this