TY - JOUR
T1 - Attention-gating for improved radio galaxy classification
AU - Bowles, Micah
AU - Scaife, Anna M.M.
AU - Porter, Fiona
AU - Tang, Hongming
AU - Bastien, David J.
N1 - Publisher Copyright:
© 2021 2020 The Author(s) Published by Oxford University Press on behalf of Royal Astronomical Society.
PY - 2021/3/1
Y1 - 2021/3/1
N2 - In this work, we introduce attention as a state-of-the-art mechanism for classification of radio galaxies, using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50 per cent fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalization and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalization and aggregation may only minimally affect the performance of individual models, it can significantly affect the interpretability of the respective attention maps and by selecting a model which aligns well with how astronomers classify radio sources by eye, a user can employ the model in a more effective manner.
AB - In this work, we introduce attention as a state-of-the-art mechanism for classification of radio galaxies, using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50 per cent fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalization and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalization and aggregation may only minimally affect the performance of individual models, it can significantly affect the interpretability of the respective attention maps and by selecting a model which aligns well with how astronomers classify radio sources by eye, a user can employ the model in a more effective manner.
KW - methods: statistical
KW - radio continuum: galaxies
KW - techniques: image processing
UR - http://www.scopus.com/inward/record.url?scp=85101052122&partnerID=8YFLogxK
U2 - 10.1093/mnras/staa3946
DO - 10.1093/mnras/staa3946
M3 - Article
AN - SCOPUS:85101052122
SN - 0035-8711
VL - 501
SP - 4579
EP - 4595
JO - Monthly Notices of the Royal Astronomical Society
JF - Monthly Notices of the Royal Astronomical Society
IS - 3
ER -