Region-guided network with visual cues correction for infrared small target detection

Junjie Zhang; Ding Li; Haoran Jiang; Dan Zeng

doi:10.1007/s00371-023-02892-0

Region-guided network with visual cues correction for infrared small target detection

Junjie Zhang, Ding Li, Haoran Jiang, Dan Zeng^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

Infrared small target detection (IRSTD) has experienced fast developments in recent years and been widely applied in civilian and military fields. The long imaging distance and complex backgrounds of infrared images often make the interested targets present in small scales and lack of contour features, which poses great challenges for the detection. Though deep neural network-based methods have been thoroughly investigated in IRSTD, deep layers generally struggle to retain the visual details and positions of small targets, aggravating the miss detection and false alarms. To address the above issue, we propose a Region-Guided Network with visual cues correction (RGNet) for IRSTD. More specifically, we design a Region Guidance Module embedded in shallow layers to generate the foreground mask by leveraging rich visual details contained in low-level features. The obtained mask then guides the re-weighting of deep feature maps to highlight the targets for further localization. Considering noisy signals in backgrounds tend to increase the false alarms of small targets, we propose a Visual Cues Correction Module, which extracts the regional features from low-level features by referring to the predicted positions of initial results, and conducts a binary classification to rule out the negative detection. Since the open-sourced IRSTD datasets are limited, we utilize both public and collected data for the evaluation. Both multi-target and single-target cases are investigated, and comprehensive experimental results indicate that compared to state-of-art models, our method achieves the overall best performance in both scenarios.

Original language	English
Pages (from-to)	1915-1930
Number of pages	16
Journal	Visual Computer
Volume	40
Issue number	3
DOIs	https://doi.org/10.1007/s00371-023-02892-0
Publication status	Published - Mar 2024
Externally published	Yes

Keywords

Infrared image
Region-guided
Small targets
Visual cues

Access to Document

10.1007/s00371-023-02892-0

Cite this

@article{1a78538811ea4dfcbde7e31e89b24b6c,

title = "Region-guided network with visual cues correction for infrared small target detection",

abstract = "Infrared small target detection (IRSTD) has experienced fast developments in recent years and been widely applied in civilian and military fields. The long imaging distance and complex backgrounds of infrared images often make the interested targets present in small scales and lack of contour features, which poses great challenges for the detection. Though deep neural network-based methods have been thoroughly investigated in IRSTD, deep layers generally struggle to retain the visual details and positions of small targets, aggravating the miss detection and false alarms. To address the above issue, we propose a Region-Guided Network with visual cues correction (RGNet) for IRSTD. More specifically, we design a Region Guidance Module embedded in shallow layers to generate the foreground mask by leveraging rich visual details contained in low-level features. The obtained mask then guides the re-weighting of deep feature maps to highlight the targets for further localization. Considering noisy signals in backgrounds tend to increase the false alarms of small targets, we propose a Visual Cues Correction Module, which extracts the regional features from low-level features by referring to the predicted positions of initial results, and conducts a binary classification to rule out the negative detection. Since the open-sourced IRSTD datasets are limited, we utilize both public and collected data for the evaluation. Both multi-target and single-target cases are investigated, and comprehensive experimental results indicate that compared to state-of-art models, our method achieves the overall best performance in both scenarios.",

keywords = "Infrared image, Region-guided, Small targets, Visual cues",

author = "Junjie Zhang and Ding Li and Haoran Jiang and Dan Zeng",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.",

year = "2024",

month = mar,

doi = "10.1007/s00371-023-02892-0",

language = "English",

volume = "40",

pages = "1915--1930",

journal = "Visual Computer",

issn = "0178-2789",

number = "3",

}

TY - JOUR

T1 - Region-guided network with visual cues correction for infrared small target detection

AU - Zhang, Junjie

AU - Li, Ding

AU - Jiang, Haoran

AU - Zeng, Dan

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.

PY - 2024/3

Y1 - 2024/3

N2 - Infrared small target detection (IRSTD) has experienced fast developments in recent years and been widely applied in civilian and military fields. The long imaging distance and complex backgrounds of infrared images often make the interested targets present in small scales and lack of contour features, which poses great challenges for the detection. Though deep neural network-based methods have been thoroughly investigated in IRSTD, deep layers generally struggle to retain the visual details and positions of small targets, aggravating the miss detection and false alarms. To address the above issue, we propose a Region-Guided Network with visual cues correction (RGNet) for IRSTD. More specifically, we design a Region Guidance Module embedded in shallow layers to generate the foreground mask by leveraging rich visual details contained in low-level features. The obtained mask then guides the re-weighting of deep feature maps to highlight the targets for further localization. Considering noisy signals in backgrounds tend to increase the false alarms of small targets, we propose a Visual Cues Correction Module, which extracts the regional features from low-level features by referring to the predicted positions of initial results, and conducts a binary classification to rule out the negative detection. Since the open-sourced IRSTD datasets are limited, we utilize both public and collected data for the evaluation. Both multi-target and single-target cases are investigated, and comprehensive experimental results indicate that compared to state-of-art models, our method achieves the overall best performance in both scenarios.

AB - Infrared small target detection (IRSTD) has experienced fast developments in recent years and been widely applied in civilian and military fields. The long imaging distance and complex backgrounds of infrared images often make the interested targets present in small scales and lack of contour features, which poses great challenges for the detection. Though deep neural network-based methods have been thoroughly investigated in IRSTD, deep layers generally struggle to retain the visual details and positions of small targets, aggravating the miss detection and false alarms. To address the above issue, we propose a Region-Guided Network with visual cues correction (RGNet) for IRSTD. More specifically, we design a Region Guidance Module embedded in shallow layers to generate the foreground mask by leveraging rich visual details contained in low-level features. The obtained mask then guides the re-weighting of deep feature maps to highlight the targets for further localization. Considering noisy signals in backgrounds tend to increase the false alarms of small targets, we propose a Visual Cues Correction Module, which extracts the regional features from low-level features by referring to the predicted positions of initial results, and conducts a binary classification to rule out the negative detection. Since the open-sourced IRSTD datasets are limited, we utilize both public and collected data for the evaluation. Both multi-target and single-target cases are investigated, and comprehensive experimental results indicate that compared to state-of-art models, our method achieves the overall best performance in both scenarios.

KW - Infrared image

KW - Region-guided

KW - Small targets

KW - Visual cues

UR - http://www.scopus.com/inward/record.url?scp=85160238602&partnerID=8YFLogxK

U2 - 10.1007/s00371-023-02892-0

DO - 10.1007/s00371-023-02892-0

M3 - Article

AN - SCOPUS:85160238602

SN - 0178-2789

VL - 40

SP - 1915

EP - 1930

JO - Visual Computer

JF - Visual Computer

IS - 3

ER -

Region-guided network with visual cues correction for infrared small target detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this