AlexCapsNet: an integrated architecture for image classification with background noise

Muyi Bao; Nanlin Jin; Ming Xu

doi:10.1109/ACCESS.2025.3544661

AlexCapsNet: an integrated architecture for image classification with background noise

Muyi Bao, Nanlin Jin^*, Ming Xu

^*Corresponding author for this work

Xi'an Jiaotong-Liverpool University

Research output: Contribution to journal › Article › peer-review

Abstract

Capsule networks (CapsNet) are a pioneering architecture that can encode image features
into vectors rather than scalars, addressing the limitations of traditional Convolutional Neural Networks (CNNs). This process is achieved by the Dynamic Routing algorithm and can maintain the image’s spatial hierarchies. CapsNet has demonstrated the state-of-the-art performance in simple datasets such as MNIST, but its performance degrades in more complex datasets. To solve this problem, AlexCapsNet architecture is proposed in this paper, in which the classic classification model AlexNet is used as the feature extraction layer. This allows CapsNet to capture deeper and more semantic features. The comprehensive evaluation with four datasets shows AlexCapsNet has improved performance when compared with the baseline and other CapsNet variants. Besides, our experiments on seven datasets show the reconstruction module existing in the CapsNet degrades the performance in datasets with background noise. AlexCapsNet removes the reconstruction module and therefore can adapt to these complicated datasets. Our code is available at https://github.com/BaoBao0926/AlexCapsNet.

Original language	English
Article number	10900363
Pages (from-to)	37690-37702
Number of pages	14
Journal	IEEE Access
Volume	13
DOIs	https://doi.org/10.1109/ACCESS.2025.3544661
Publication status	Published - 4 Mar 2025

Keywords

AlexNet
Capsule Network
Deep learning
Image classification

Access to Document

10.1109/ACCESS.2025.3544661Licence: CC BY

Cite this

@article{c24849d53254456bb53fdaf7d5157b18,

title = "AlexCapsNet: an integrated architecture for image classification with background noise",

abstract = "Capsule networks (CapsNet) are a pioneering architecture that can encode image featuresinto vectors rather than scalars, addressing the limitations of traditional Convolutional Neural Networks (CNNs). This process is achieved by the Dynamic Routing algorithm and can maintain the image{\textquoteright}s spatial hierarchies. CapsNet has demonstrated the state-of-the-art performance in simple datasets such as MNIST, but its performance degrades in more complex datasets. To solve this problem, AlexCapsNet architecture is proposed in this paper, in which the classic classification model AlexNet is used as the feature extraction layer. This allows CapsNet to capture deeper and more semantic features. The comprehensive evaluation with four datasets shows AlexCapsNet has improved performance when compared with the baseline and other CapsNet variants. Besides, our experiments on seven datasets show the reconstruction module existing in the CapsNet degrades the performance in datasets with background noise. AlexCapsNet removes the reconstruction module and therefore can adapt to these complicated datasets. Our code is available at https://github.com/BaoBao0926/AlexCapsNet.",

keywords = "AlexNet, Capsule Network, Deep learning, Image classification",

author = "Muyi Bao and Nanlin Jin and Ming Xu",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2025",

month = mar,

day = "4",

doi = "10.1109/ACCESS.2025.3544661",

language = "English",

volume = "13",

pages = "37690--37702",

journal = "IEEE Access",

issn = "2169-3536",

}

TY - JOUR

T1 - AlexCapsNet: an integrated architecture for image classification with background noise

AU - Bao, Muyi

AU - Jin, Nanlin

AU - Xu, Ming

PY - 2025/3/4

Y1 - 2025/3/4

N2 - Capsule networks (CapsNet) are a pioneering architecture that can encode image featuresinto vectors rather than scalars, addressing the limitations of traditional Convolutional Neural Networks (CNNs). This process is achieved by the Dynamic Routing algorithm and can maintain the image’s spatial hierarchies. CapsNet has demonstrated the state-of-the-art performance in simple datasets such as MNIST, but its performance degrades in more complex datasets. To solve this problem, AlexCapsNet architecture is proposed in this paper, in which the classic classification model AlexNet is used as the feature extraction layer. This allows CapsNet to capture deeper and more semantic features. The comprehensive evaluation with four datasets shows AlexCapsNet has improved performance when compared with the baseline and other CapsNet variants. Besides, our experiments on seven datasets show the reconstruction module existing in the CapsNet degrades the performance in datasets with background noise. AlexCapsNet removes the reconstruction module and therefore can adapt to these complicated datasets. Our code is available at https://github.com/BaoBao0926/AlexCapsNet.

AB - Capsule networks (CapsNet) are a pioneering architecture that can encode image featuresinto vectors rather than scalars, addressing the limitations of traditional Convolutional Neural Networks (CNNs). This process is achieved by the Dynamic Routing algorithm and can maintain the image’s spatial hierarchies. CapsNet has demonstrated the state-of-the-art performance in simple datasets such as MNIST, but its performance degrades in more complex datasets. To solve this problem, AlexCapsNet architecture is proposed in this paper, in which the classic classification model AlexNet is used as the feature extraction layer. This allows CapsNet to capture deeper and more semantic features. The comprehensive evaluation with four datasets shows AlexCapsNet has improved performance when compared with the baseline and other CapsNet variants. Besides, our experiments on seven datasets show the reconstruction module existing in the CapsNet degrades the performance in datasets with background noise. AlexCapsNet removes the reconstruction module and therefore can adapt to these complicated datasets. Our code is available at https://github.com/BaoBao0926/AlexCapsNet.

KW - AlexNet

KW - Capsule Network

KW - Deep learning

KW - Image classification

UR - http://www.scopus.com/inward/record.url?scp=85219161095&partnerID=8YFLogxK

UR - https://github.com/BaoBao0926/AlexCapsNet

U2 - 10.1109/ACCESS.2025.3544661

DO - 10.1109/ACCESS.2025.3544661

M3 - Article

SN - 2169-3536

VL - 13

SP - 37690

EP - 37702

JO - IEEE Access

JF - IEEE Access

M1 - 10900363

ER -

AlexCapsNet: an integrated architecture for image classification with background noise

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this