AlexCapsNet: an integrated architecture for image classification with background noise

Muyi Bao, Nanlin Jin*, Ming Xu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Capsule networks (CapsNet) are a pioneering architecture that can encode image features
into vectors rather than scalars, addressing the limitations of traditional Convolutional Neural Networks (CNNs). This process is achieved by the Dynamic Routing algorithm and can maintain the image’s spatial hierarchies. CapsNet has demonstrated the state-of-the-art performance in simple datasets such as MNIST, but its performance degrades in more complex datasets. To solve this problem, AlexCapsNet architecture is proposed in this paper, in which the classic classification model AlexNet is used as the feature extraction layer. This allows CapsNet to capture deeper and more semantic features. The comprehensive evaluation with four datasets shows AlexCapsNet has improved performance when compared with the baseline and other CapsNet variants. Besides, our experiments on seven datasets show the reconstruction module existing in the CapsNet degrades the performance in datasets with background noise. AlexCapsNet removes the reconstruction module and therefore can adapt to these complicated datasets. Our code is available at https://github.com/BaoBao0926/AlexCapsNet.
Original languageEnglish
JournalIEEE Access
Publication statusAccepted/In press - 16 Feb 2025

Keywords

  • image classification
  • Deep learning
  • Capsule Network
  • AlexNet

Fingerprint

Dive into the research topics of 'AlexCapsNet: an integrated architecture for image classification with background noise'. Together they form a unique fingerprint.

Cite this