TY - JOUR
T1 - Weight-guided loss for long-tailed object detection and instance segmentation
AU - Zhao, Xinqiao
AU - Xiao, Jimin
AU - Zhang, Bingfeng
AU - Zhang, Quan
AU - Waleed, Al Nuaimy
N1 - Funding Information:
The work was supported by National Key R&D Program of China (No. 2022Y FE0200300 ), National Natural Science Foundation of China under 61972323 , and Key Program Special Fund in XJTLU under KSF-T-02 , KSF-P-02 .
Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2023/1
Y1 - 2023/1
N2 - The long-tailed characteristic leads to a significant performance drop for various models on long-tailed distribution datasets. Existing works mainly focus on mitigating the data shortage in tail classes at dataset level by data re-sampling, loss re-weighting or knowledge transfer from head to tail. While in this paper, we focus on another perspective which is also related to the performance drop: the gap between total dataset class number and training batch size. To address this issue, we propose a Weight-Guided (WG) loss which utilizes the classifier weights as auxiliary tail samples. It can be easily deployed to different models. By simply adding WG loss to Mask R-CNN with ResNet-50 backbone, we improve the performance by (i) 0.5 box AP and 0.4 mask AP on COCO dataset, (ii) 0.4 box and mask AP (1.8 mask AP for rare classes) on LVIS v1.0 dataset. Codes will be released.
AB - The long-tailed characteristic leads to a significant performance drop for various models on long-tailed distribution datasets. Existing works mainly focus on mitigating the data shortage in tail classes at dataset level by data re-sampling, loss re-weighting or knowledge transfer from head to tail. While in this paper, we focus on another perspective which is also related to the performance drop: the gap between total dataset class number and training batch size. To address this issue, we propose a Weight-Guided (WG) loss which utilizes the classifier weights as auxiliary tail samples. It can be easily deployed to different models. By simply adding WG loss to Mask R-CNN with ResNet-50 backbone, we improve the performance by (i) 0.5 box AP and 0.4 mask AP on COCO dataset, (ii) 0.4 box and mask AP (1.8 mask AP for rare classes) on LVIS v1.0 dataset. Codes will be released.
KW - Instance segmentation
KW - Long-tail
KW - Object detection
UR - http://www.scopus.com/inward/record.url?scp=85140896191&partnerID=8YFLogxK
U2 - 10.1016/j.image.2022.116874
DO - 10.1016/j.image.2022.116874
M3 - Article
AN - SCOPUS:85140896191
SN - 0923-5965
VL - 110
JO - Signal Processing: Image Communication
JF - Signal Processing: Image Communication
M1 - 116874
ER -