TY - JOUR
T1 - LIDD-YOLO
T2 - a lightweight industrial defect detection network
AU - Luo, Shen
AU - Xu, Yuanping
AU - Zhang, Chaolong
AU - Jin, Jin
AU - Kong, Chao
AU - Xu, Zhijie
AU - Guo, Benjun
AU - Tang, Dan
AU - Cao, Yanlong
N1 - Publisher Copyright:
© 2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
PY - 2025/1
Y1 - 2025/1
N2 - Surface defect detection is crucial in industrial production, and due to the conveyor speed, real-time detection requires 30-60 frames per second (FPS), which exceeds the capability of most existing methods. This demand for high FPS has driven the need for lightweight detection models. Despite significant advancements in deep learning-based detection that have enabled single-stage models such as the you only look once (YOLO) series to achieve relatively fast detection, existing methods still face challenges in detecting multi-scale defects and tiny defects on complex surfaces while maintaining detection speed. This study proposes a lightweight single-stage detection model called lightweight industrial defect detection network with improved YOLO architecture (LIDD-YOLO) for high-precision and real-time industrial defect detection. Firstly, we propose the large separable kernel spatial pyramid pooling (SPP) module, which is a SPP structure with a separable large kernel attention mechanism, significantly improving the detection rate of multi-scale defects and enhancing the detection rate of small target defects. Secondly, we improved the Backbone and Neck structure of YOLOv8n with dual convolutional (Dual Conv) kernel convolution and enhanced the faster implementation of Cross Stage Partial Bottleneck with 2 Convolutions (C2f) module in the Neck structure with ghost convolution and decoupled fully connected (DFC) attention, reducing the computational and parameter overhead of the model while ensuring detection accuracy. Experimental results on the NEU-DET steel defect datasets and printed circuit board (PCB) defect datasets demonstrate that compared to YOLOv8n, LIDD-YOLO improves the recognition rate of multi-scale defects and small target defects while meeting lightweight requirements. LIDD-YOLO achieves a 3.2% increase in mean average precision (mAP) on the NEU-DET steel defect dataset, reaching 79.5%, and a 2.6% increase in mAP on the small target PCB defect dataset, reaching 93.3%. Moreover, it reduces the parameter count by 20.0% and floating point operations by 15.5%, further meeting the requirements for lightweight and high-precision industrial defect detection models.
AB - Surface defect detection is crucial in industrial production, and due to the conveyor speed, real-time detection requires 30-60 frames per second (FPS), which exceeds the capability of most existing methods. This demand for high FPS has driven the need for lightweight detection models. Despite significant advancements in deep learning-based detection that have enabled single-stage models such as the you only look once (YOLO) series to achieve relatively fast detection, existing methods still face challenges in detecting multi-scale defects and tiny defects on complex surfaces while maintaining detection speed. This study proposes a lightweight single-stage detection model called lightweight industrial defect detection network with improved YOLO architecture (LIDD-YOLO) for high-precision and real-time industrial defect detection. Firstly, we propose the large separable kernel spatial pyramid pooling (SPP) module, which is a SPP structure with a separable large kernel attention mechanism, significantly improving the detection rate of multi-scale defects and enhancing the detection rate of small target defects. Secondly, we improved the Backbone and Neck structure of YOLOv8n with dual convolutional (Dual Conv) kernel convolution and enhanced the faster implementation of Cross Stage Partial Bottleneck with 2 Convolutions (C2f) module in the Neck structure with ghost convolution and decoupled fully connected (DFC) attention, reducing the computational and parameter overhead of the model while ensuring detection accuracy. Experimental results on the NEU-DET steel defect datasets and printed circuit board (PCB) defect datasets demonstrate that compared to YOLOv8n, LIDD-YOLO improves the recognition rate of multi-scale defects and small target defects while meeting lightweight requirements. LIDD-YOLO achieves a 3.2% increase in mean average precision (mAP) on the NEU-DET steel defect dataset, reaching 79.5%, and a 2.6% increase in mAP on the small target PCB defect dataset, reaching 93.3%. Moreover, it reduces the parameter count by 20.0% and floating point operations by 15.5%, further meeting the requirements for lightweight and high-precision industrial defect detection models.
KW - LSKA
KW - PCB defect detection
KW - YOLO
KW - steel defect detection
KW - surface defect detection
UR - http://www.scopus.com/inward/record.url?scp=85215377456&partnerID=8YFLogxK
U2 - 10.1088/1361-6501/ad9d65
DO - 10.1088/1361-6501/ad9d65
M3 - Article
AN - SCOPUS:85215377456
SN - 0957-0233
VL - 36
JO - Measurement Science and Technology
JF - Measurement Science and Technology
IS - 1
M1 - 0161b5
ER -