Abstract
Capsule networks (CapsNet) are a pioneering architecture that can encode image features
into vectors rather than scalars, addressing the limitations of traditional Convolutional Neural Networks (CNNs). This process is achieved by the Dynamic Routing algorithm and can maintain the image’s spatial hierarchies. CapsNet has demonstrated the state-of-the-art performance in simple datasets such as MNIST, but its performance degrades in more complex datasets. To solve this problem, AlexCapsNet architecture is proposed in this paper, in which the classic classification model AlexNet is used as the feature extraction layer. This allows CapsNet to capture deeper and more semantic features. The comprehensive evaluation with four datasets shows AlexCapsNet has improved performance when compared with the baseline and other CapsNet variants. Besides, our experiments on seven datasets show the reconstruction module existing in the CapsNet degrades the performance in datasets with background noise. AlexCapsNet removes the reconstruction module and therefore can adapt to these complicated datasets. Our code is available at https://github.com/BaoBao0926/AlexCapsNet.
into vectors rather than scalars, addressing the limitations of traditional Convolutional Neural Networks (CNNs). This process is achieved by the Dynamic Routing algorithm and can maintain the image’s spatial hierarchies. CapsNet has demonstrated the state-of-the-art performance in simple datasets such as MNIST, but its performance degrades in more complex datasets. To solve this problem, AlexCapsNet architecture is proposed in this paper, in which the classic classification model AlexNet is used as the feature extraction layer. This allows CapsNet to capture deeper and more semantic features. The comprehensive evaluation with four datasets shows AlexCapsNet has improved performance when compared with the baseline and other CapsNet variants. Besides, our experiments on seven datasets show the reconstruction module existing in the CapsNet degrades the performance in datasets with background noise. AlexCapsNet removes the reconstruction module and therefore can adapt to these complicated datasets. Our code is available at https://github.com/BaoBao0926/AlexCapsNet.
Original language | English |
---|---|
Article number | 10900363 |
Pages (from-to) | 37690-37702 |
Number of pages | 14 |
Journal | IEEE Access |
Volume | 13 |
DOIs | |
Publication status | Published - 4 Mar 2025 |
Keywords
- AlexNet
- Capsule Network
- Deep learning
- Image classification