Salient object detection combining a self-attention module and a feature pyramid network

Guangyu Ren; Tianhong Dai; Panagiotis Barmpoutis; Tania Stathaki

doi:10.3390/electronics9101702

Salient object detection combining a self-attention module and a feature pyramid network

Guangyu Ren, Tianhong Dai, Panagiotis Barmpoutis, Tania Stathaki^*

^*Corresponding author for this work

Imperial College London

Research output: Contribution to journal › Article › peer-review

12 Citations (Scopus)

Abstract

Salient object detection has achieved great improvements by using the Fully Convolutional Networks (FCNs). However, the FCN-based U-shape architecture may cause dilution problems in the high-level semantic information during the up-sample operations in the top-down pathway. Thus, it can weaken the ability of salient object localization and produce degraded boundaries. To this end, in order to overcome this limitation, we propose a novel pyramid self-attention module (PSAM) and the adoption of an independent feature-complementing strategy. In PSAM, self-attention layers are equipped after multi-scale pyramid features to capture richer high-level features and bring larger receptive fields to the model. In addition, a channel-wise attention module is also employed to reduce the redundant features of the FPN and provide refined results. Experimental analysis demonstrates that the proposed PSAM effectively contributes to the whole model so that it outperforms state-of-the-art results over five challenging datasets. Finally, quantitative results show that PSAM generates accurate predictions and integral salient maps, which can provide further help to other computer vision tasks, such as object detection and semantic segmentation.

Original language	English
Article number	1702
Pages (from-to)	1-13
Number of pages	13
Journal	Electronics (Switzerland)
Volume	9
Issue number	10
DOIs	https://doi.org/10.3390/electronics9101702
Publication status	Published - Oct 2020
Externally published	Yes

Keywords

Feature pyramid network
Fully convolution network
Pyramid self-attention module
Salient object detection

Access to Document

10.3390/electronics9101702

Cite this

@article{b5cfa5e896594def9307ed8ba7315672,

title = "Salient object detection combining a self-attention module and a feature pyramid network",

abstract = "Salient object detection has achieved great improvements by using the Fully Convolutional Networks (FCNs). However, the FCN-based U-shape architecture may cause dilution problems in the high-level semantic information during the up-sample operations in the top-down pathway. Thus, it can weaken the ability of salient object localization and produce degraded boundaries. To this end, in order to overcome this limitation, we propose a novel pyramid self-attention module (PSAM) and the adoption of an independent feature-complementing strategy. In PSAM, self-attention layers are equipped after multi-scale pyramid features to capture richer high-level features and bring larger receptive fields to the model. In addition, a channel-wise attention module is also employed to reduce the redundant features of the FPN and provide refined results. Experimental analysis demonstrates that the proposed PSAM effectively contributes to the whole model so that it outperforms state-of-the-art results over five challenging datasets. Finally, quantitative results show that PSAM generates accurate predictions and integral salient maps, which can provide further help to other computer vision tasks, such as object detection and semantic segmentation.",

keywords = "Feature pyramid network, Fully convolution network, Pyramid self-attention module, Salient object detection",

author = "Guangyu Ren and Tianhong Dai and Panagiotis Barmpoutis and Tania Stathaki",

note = "Publisher Copyright: {\textcopyright} 2020 by the authors. Licensee MDPI, Basel, Switzerland.",

year = "2020",

month = oct,

doi = "10.3390/electronics9101702",

language = "English",

volume = "9",

pages = "1--13",

journal = "Electronics (Switzerland)",

issn = "2079-9292",

number = "10",

}

TY - JOUR

T1 - Salient object detection combining a self-attention module and a feature pyramid network

AU - Ren, Guangyu

AU - Dai, Tianhong

AU - Barmpoutis, Panagiotis

AU - Stathaki, Tania

PY - 2020/10

Y1 - 2020/10

N2 - Salient object detection has achieved great improvements by using the Fully Convolutional Networks (FCNs). However, the FCN-based U-shape architecture may cause dilution problems in the high-level semantic information during the up-sample operations in the top-down pathway. Thus, it can weaken the ability of salient object localization and produce degraded boundaries. To this end, in order to overcome this limitation, we propose a novel pyramid self-attention module (PSAM) and the adoption of an independent feature-complementing strategy. In PSAM, self-attention layers are equipped after multi-scale pyramid features to capture richer high-level features and bring larger receptive fields to the model. In addition, a channel-wise attention module is also employed to reduce the redundant features of the FPN and provide refined results. Experimental analysis demonstrates that the proposed PSAM effectively contributes to the whole model so that it outperforms state-of-the-art results over five challenging datasets. Finally, quantitative results show that PSAM generates accurate predictions and integral salient maps, which can provide further help to other computer vision tasks, such as object detection and semantic segmentation.

AB - Salient object detection has achieved great improvements by using the Fully Convolutional Networks (FCNs). However, the FCN-based U-shape architecture may cause dilution problems in the high-level semantic information during the up-sample operations in the top-down pathway. Thus, it can weaken the ability of salient object localization and produce degraded boundaries. To this end, in order to overcome this limitation, we propose a novel pyramid self-attention module (PSAM) and the adoption of an independent feature-complementing strategy. In PSAM, self-attention layers are equipped after multi-scale pyramid features to capture richer high-level features and bring larger receptive fields to the model. In addition, a channel-wise attention module is also employed to reduce the redundant features of the FPN and provide refined results. Experimental analysis demonstrates that the proposed PSAM effectively contributes to the whole model so that it outperforms state-of-the-art results over five challenging datasets. Finally, quantitative results show that PSAM generates accurate predictions and integral salient maps, which can provide further help to other computer vision tasks, such as object detection and semantic segmentation.

KW - Feature pyramid network

KW - Fully convolution network

KW - Pyramid self-attention module

KW - Salient object detection

UR - http://www.scopus.com/inward/record.url?scp=85092550928&partnerID=8YFLogxK

U2 - 10.3390/electronics9101702

DO - 10.3390/electronics9101702

M3 - Article

AN - SCOPUS:85092550928

SN - 2079-9292

VL - 9

SP - 1

EP - 13

JO - Electronics (Switzerland)

JF - Electronics (Switzerland)

IS - 10

M1 - 1702

ER -

Salient object detection combining a self-attention module and a feature pyramid network

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this