MPSSD: Multi-Path Fusion Single Shot Detector

Shuyi Qu; Kaizhu Huang; Amir Hussain; Yannis Goulermas

doi:10.1109/IJCNN.2019.8852053

MPSSD: Multi-Path Fusion Single Shot Detector

Shuyi Qu, Kaizhu Huang, Amir Hussain, Yannis Goulermas

School of Advanced Technology

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

4 Citations (Scopus)

Abstract

Recent prevalent one stage detectors, such as single shot detector (SSD) and RetinaNet, are able to detect objects faster than two stage ones while maintaining comparable accuracy. To further boost the accuracy, many studies focus on enhancing the multi-scale feature pyramid. Most of these current proposals focus on strengthening features on one pyramid, ignoring the rich connection among different scale features. In contrast, we propose a novel multi-path design to fully utilize the localization and semantics information. First, we exploit the original SSD multi-scale features as our base pyramid. Then we fuse these features in different groups to generate multi-path feature pyramids. Finally, we combine these pyramids through a novel and effective aggregation module, to obtain the final informative pyramid for detection. Comparative experiments on benchmark PASCAL VOC and MS COCO datasets have shown that our proposed method outperforms many state-of-the-art detectors. As an illustrative example, for input image with size 512×512, we can achieve a mean Average Precision (mAP) of 81.8% on VOC2007 test and 33.1% mAP on COCO test-dev2015.

Original language	English
Title of host publication	2019 International Joint Conference on Neural Networks, IJCNN 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728119854
DOIs	https://doi.org/10.1109/IJCNN.2019.8852053
Publication status	Published - Jul 2019
Event	2019 International Joint Conference on Neural Networks, IJCNN 2019 - Budapest, Hungary Duration: 14 Jul 2019 → 19 Jul 2019

Publication series

Name	Proceedings of the International Joint Conference on Neural Networks
Volume	2019-July

Conference

Conference	2019 International Joint Conference on Neural Networks, IJCNN 2019
Country/Territory	Hungary
City	Budapest
Period	14/07/19 → 19/07/19

Keywords

Fusion
Multiple Path
Object Detection
SSD

Access to Document

10.1109/IJCNN.2019.8852053

Cite this

Qu, S., Huang, K., Hussain, A., & Goulermas, Y. (2019). MPSSD: Multi-Path Fusion Single Shot Detector. In 2019 International Joint Conference on Neural Networks, IJCNN 2019 Article 8852053 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2019-July). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN.2019.8852053

@inproceedings{7fd4d9cf8bc74c27b6619066b27cc4b6,

title = "MPSSD: Multi-Path Fusion Single Shot Detector",

abstract = "Recent prevalent one stage detectors, such as single shot detector (SSD) and RetinaNet, are able to detect objects faster than two stage ones while maintaining comparable accuracy. To further boost the accuracy, many studies focus on enhancing the multi-scale feature pyramid. Most of these current proposals focus on strengthening features on one pyramid, ignoring the rich connection among different scale features. In contrast, we propose a novel multi-path design to fully utilize the localization and semantics information. First, we exploit the original SSD multi-scale features as our base pyramid. Then we fuse these features in different groups to generate multi-path feature pyramids. Finally, we combine these pyramids through a novel and effective aggregation module, to obtain the final informative pyramid for detection. Comparative experiments on benchmark PASCAL VOC and MS COCO datasets have shown that our proposed method outperforms many state-of-the-art detectors. As an illustrative example, for input image with size 512×512, we can achieve a mean Average Precision (mAP) of 81.8% on VOC2007 test and 33.1% mAP on COCO test-dev2015.",

keywords = "Fusion, Multiple Path, Object Detection, SSD",

author = "Shuyi Qu and Kaizhu Huang and Amir Hussain and Yannis Goulermas",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 International Joint Conference on Neural Networks, IJCNN 2019 ; Conference date: 14-07-2019 Through 19-07-2019",

year = "2019",

month = jul,

doi = "10.1109/IJCNN.2019.8852053",

language = "English",

series = "Proceedings of the International Joint Conference on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2019 International Joint Conference on Neural Networks, IJCNN 2019",

}

Qu, S, Huang, K, Hussain, A & Goulermas, Y 2019, MPSSD: Multi-Path Fusion Single Shot Detector. in 2019 International Joint Conference on Neural Networks, IJCNN 2019., 8852053, Proceedings of the International Joint Conference on Neural Networks, vol. 2019-July, Institute of Electrical and Electronics Engineers Inc., 2019 International Joint Conference on Neural Networks, IJCNN 2019, Budapest, Hungary, 14/07/19. https://doi.org/10.1109/IJCNN.2019.8852053

MPSSD: Multi-Path Fusion Single Shot Detector. / Qu, Shuyi; Huang, Kaizhu; Hussain, Amir et al.
2019 International Joint Conference on Neural Networks, IJCNN 2019. Institute of Electrical and Electronics Engineers Inc., 2019. 8852053 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2019-July).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - MPSSD

T2 - 2019 International Joint Conference on Neural Networks, IJCNN 2019

AU - Qu, Shuyi

AU - Huang, Kaizhu

AU - Hussain, Amir

AU - Goulermas, Yannis

PY - 2019/7

Y1 - 2019/7

N2 - Recent prevalent one stage detectors, such as single shot detector (SSD) and RetinaNet, are able to detect objects faster than two stage ones while maintaining comparable accuracy. To further boost the accuracy, many studies focus on enhancing the multi-scale feature pyramid. Most of these current proposals focus on strengthening features on one pyramid, ignoring the rich connection among different scale features. In contrast, we propose a novel multi-path design to fully utilize the localization and semantics information. First, we exploit the original SSD multi-scale features as our base pyramid. Then we fuse these features in different groups to generate multi-path feature pyramids. Finally, we combine these pyramids through a novel and effective aggregation module, to obtain the final informative pyramid for detection. Comparative experiments on benchmark PASCAL VOC and MS COCO datasets have shown that our proposed method outperforms many state-of-the-art detectors. As an illustrative example, for input image with size 512×512, we can achieve a mean Average Precision (mAP) of 81.8% on VOC2007 test and 33.1% mAP on COCO test-dev2015.

AB - Recent prevalent one stage detectors, such as single shot detector (SSD) and RetinaNet, are able to detect objects faster than two stage ones while maintaining comparable accuracy. To further boost the accuracy, many studies focus on enhancing the multi-scale feature pyramid. Most of these current proposals focus on strengthening features on one pyramid, ignoring the rich connection among different scale features. In contrast, we propose a novel multi-path design to fully utilize the localization and semantics information. First, we exploit the original SSD multi-scale features as our base pyramid. Then we fuse these features in different groups to generate multi-path feature pyramids. Finally, we combine these pyramids through a novel and effective aggregation module, to obtain the final informative pyramid for detection. Comparative experiments on benchmark PASCAL VOC and MS COCO datasets have shown that our proposed method outperforms many state-of-the-art detectors. As an illustrative example, for input image with size 512×512, we can achieve a mean Average Precision (mAP) of 81.8% on VOC2007 test and 33.1% mAP on COCO test-dev2015.

KW - Fusion

KW - Multiple Path

KW - Object Detection

KW - SSD

UR - http://www.scopus.com/inward/record.url?scp=85073255773&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2019.8852053

DO - 10.1109/IJCNN.2019.8852053

M3 - Conference Proceeding

AN - SCOPUS:85073255773

T3 - Proceedings of the International Joint Conference on Neural Networks

BT - 2019 International Joint Conference on Neural Networks, IJCNN 2019

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 14 July 2019 through 19 July 2019

ER -

MPSSD: Multi-Path Fusion Single Shot Detector

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this