Attention-gating for improved radio galaxy classification

Micah Bowles; Anna M.M. Scaife; Fiona Porter; Hongming Tang; David J. Bastien

doi:10.1093/mnras/staa3946

Attention-gating for improved radio galaxy classification

Micah Bowles^*, Anna M.M. Scaife, Fiona Porter, Hongming Tang, David J. Bastien

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

29 Citations (Scopus)

Abstract

In this work, we introduce attention as a state-of-the-art mechanism for classification of radio galaxies, using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50 per cent fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalization and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalization and aggregation may only minimally affect the performance of individual models, it can significantly affect the interpretability of the respective attention maps and by selecting a model which aligns well with how astronomers classify radio sources by eye, a user can employ the model in a more effective manner.

Original language	English
Pages (from-to)	4579-4595
Number of pages	17
Journal	Monthly Notices of the Royal Astronomical Society
Volume	501
Issue number	3
DOIs	https://doi.org/10.1093/mnras/staa3946
Publication status	Published - 1 Mar 2021
Externally published	Yes

Keywords

methods: statistical
radio continuum: galaxies
techniques: image processing

Access to Document

10.1093/mnras/staa3946

Cite this

@article{a89a683403cf463a8ba371257586b7af,

title = "Attention-gating for improved radio galaxy classification",

abstract = "In this work, we introduce attention as a state-of-the-art mechanism for classification of radio galaxies, using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50 per cent fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalization and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalization and aggregation may only minimally affect the performance of individual models, it can significantly affect the interpretability of the respective attention maps and by selecting a model which aligns well with how astronomers classify radio sources by eye, a user can employ the model in a more effective manner.",

keywords = "methods: statistical, radio continuum: galaxies, techniques: image processing",

author = "Micah Bowles and Scaife, {Anna M.M.} and Fiona Porter and Hongming Tang and Bastien, {David J.}",

note = "Publisher Copyright: {\textcopyright} 2021 2020 The Author(s) Published by Oxford University Press on behalf of Royal Astronomical Society.",

year = "2021",

month = mar,

day = "1",

doi = "10.1093/mnras/staa3946",

language = "English",

volume = "501",

pages = "4579--4595",

journal = "Monthly Notices of the Royal Astronomical Society",

issn = "0035-8711",

number = "3",

}

TY - JOUR

T1 - Attention-gating for improved radio galaxy classification

AU - Bowles, Micah

AU - Scaife, Anna M.M.

AU - Porter, Fiona

AU - Tang, Hongming

AU - Bastien, David J.

PY - 2021/3/1

Y1 - 2021/3/1

N2 - In this work, we introduce attention as a state-of-the-art mechanism for classification of radio galaxies, using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50 per cent fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalization and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalization and aggregation may only minimally affect the performance of individual models, it can significantly affect the interpretability of the respective attention maps and by selecting a model which aligns well with how astronomers classify radio sources by eye, a user can employ the model in a more effective manner.

AB - In this work, we introduce attention as a state-of-the-art mechanism for classification of radio galaxies, using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50 per cent fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalization and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalization and aggregation may only minimally affect the performance of individual models, it can significantly affect the interpretability of the respective attention maps and by selecting a model which aligns well with how astronomers classify radio sources by eye, a user can employ the model in a more effective manner.

KW - methods: statistical

KW - radio continuum: galaxies

KW - techniques: image processing

UR - http://www.scopus.com/inward/record.url?scp=85101052122&partnerID=8YFLogxK

U2 - 10.1093/mnras/staa3946

DO - 10.1093/mnras/staa3946

M3 - Article

AN - SCOPUS:85101052122

SN - 0035-8711

VL - 501

SP - 4579

EP - 4595

JO - Monthly Notices of the Royal Astronomical Society

JF - Monthly Notices of the Royal Astronomical Society

IS - 3

ER -

Attention-gating for improved radio galaxy classification

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this