TY - GEN
T1 - Industrial Image Anomaly Detection Method Based on Improved MAE
AU - He, Hui
AU - Xu, Yuanping
AU - Zhang, Chaolong
AU - Guo, Benjun
AU - Xu, Zhijie
AU - Jin, Jin
AU - Kong, Chao
AU - Huang, Jian
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - This study explores an improved Masked Autoencoder (MAE) model for anomaly detection in common industrial scene images. Industrial image anomaly detection, as an important research topic in computer vision, can detect abnormal data deviations from normal expected behavior and ensure the normal operation of various systems. In actual industrial scenarios, the scarcity of abnormal samples, the cost of labeled data, and the lack of prior knowledge about anomalies make unsupervised learning methods widely used in the field of image anomaly detection. However, most unsupervised learning methods currently require the application of large-scale datasets. In order to make unsupervised learning applicable to small-scale industrial image datasets, we replaced the backbone network with Swin Transformer based on the Masked Autoencoder method, while improving its mask strategy and optimizer. The effectiveness and advantage of the proposed method are demonstrated through an experimental comparison with existing model, the results show that this method is better than MAE model pretrained on ImageNet, and outperforms other unsupervised learning models in multiple categories.
AB - This study explores an improved Masked Autoencoder (MAE) model for anomaly detection in common industrial scene images. Industrial image anomaly detection, as an important research topic in computer vision, can detect abnormal data deviations from normal expected behavior and ensure the normal operation of various systems. In actual industrial scenarios, the scarcity of abnormal samples, the cost of labeled data, and the lack of prior knowledge about anomalies make unsupervised learning methods widely used in the field of image anomaly detection. However, most unsupervised learning methods currently require the application of large-scale datasets. In order to make unsupervised learning applicable to small-scale industrial image datasets, we replaced the backbone network with Swin Transformer based on the Masked Autoencoder method, while improving its mask strategy and optimizer. The effectiveness and advantage of the proposed method are demonstrated through an experimental comparison with existing model, the results show that this method is better than MAE model pretrained on ImageNet, and outperforms other unsupervised learning models in multiple categories.
KW - Anomaly Detection
KW - Industrial Image
KW - MAE
KW - Swin Transformer
KW - Unsupervised Learning
UR - http://www.scopus.com/inward/record.url?scp=85175555445&partnerID=8YFLogxK
U2 - 10.1109/ICAC57885.2023.10275293
DO - 10.1109/ICAC57885.2023.10275293
M3 - Conference Proceeding
AN - SCOPUS:85175555445
T3 - ICAC 2023 - 28th International Conference on Automation and Computing
BT - ICAC 2023 - 28th International Conference on Automation and Computing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 28th International Conference on Automation and Computing, ICAC 2023
Y2 - 30 August 2023 through 1 September 2023
ER -