TY - JOUR
T1 - Background suppression and comprehensive prototype pyramid distillation for few-shot object detection
AU - Li, Ning
AU - Wang, Mingliang
AU - Yang, Gaochao
AU - Li, Bo
AU - Yuan, Baohua
AU - Xu, Shoukun
AU - Qi, Jun
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/5
Y1 - 2025/5
N2 - Few-Shot Object Detection (FSOD) methods can achieve detection of novel classes with only a small number of annotated samples and have received widespread attention in recent years. Meta-learning has been proven to be a key technology for addressing few-shot problems. Typically, meta-learning-based methods require an additional support branch to extract class prototypes for the few-shot classes, and the detection head performs classification and detection by measuring the distance between the class prototypes and the query features. Since the input to the support branch is the object image annotated with a bounding box, it often contains a large amount of background information, which degrades the quality of the class prototypes. Through our meticulous observation, we found that the center of the bounding box is often the core feature area of the object. Based on this, we designed a lightweight Background Suppression (BS) module that suppresses background features by measuring the similarity between the peripheral and central features of the support features, thereby providing high-quality support features for class prototype extraction. Additionally, in terms of class prototype extraction, we designed a more robust Comprehensive Prototype Pyramid Distillation (CPPD) module. This module first captures the multi-scale feature information of the object from the background-suppressed support features, and then uses a pyramid structure to hierarchically distill the multi-scale features to extract more comprehensive and purer class prototypes. Extensive experimental results on the PASCAL VOC and COCO datasets show that compared to other models under the same architecture and techniques, we achieved the best results.
AB - Few-Shot Object Detection (FSOD) methods can achieve detection of novel classes with only a small number of annotated samples and have received widespread attention in recent years. Meta-learning has been proven to be a key technology for addressing few-shot problems. Typically, meta-learning-based methods require an additional support branch to extract class prototypes for the few-shot classes, and the detection head performs classification and detection by measuring the distance between the class prototypes and the query features. Since the input to the support branch is the object image annotated with a bounding box, it often contains a large amount of background information, which degrades the quality of the class prototypes. Through our meticulous observation, we found that the center of the bounding box is often the core feature area of the object. Based on this, we designed a lightweight Background Suppression (BS) module that suppresses background features by measuring the similarity between the peripheral and central features of the support features, thereby providing high-quality support features for class prototype extraction. Additionally, in terms of class prototype extraction, we designed a more robust Comprehensive Prototype Pyramid Distillation (CPPD) module. This module first captures the multi-scale feature information of the object from the background-suppressed support features, and then uses a pyramid structure to hierarchically distill the multi-scale features to extract more comprehensive and purer class prototypes. Extensive experimental results on the PASCAL VOC and COCO datasets show that compared to other models under the same architecture and techniques, we achieved the best results.
KW - Background suppression
KW - Comprehensive prototype pyramid distillation
KW - Few-shot object detection
KW - Object detection
UR - http://www.scopus.com/inward/record.url?scp=85217278057&partnerID=8YFLogxK
U2 - 10.1016/j.robot.2025.104938
DO - 10.1016/j.robot.2025.104938
M3 - Article
AN - SCOPUS:85217278057
SN - 0921-8890
VL - 187
JO - Robotics and Autonomous Systems
JF - Robotics and Autonomous Systems
M1 - 104938
ER -