TY - GEN
T1 - Bird sound detection based on binarized convolutional neural networks
AU - Song, Jianan
AU - Li, Shengchen
N1 - Publisher Copyright:
© Springer Nature Singapore Pte Ltd. 2019.
PY - 2019
Y1 - 2019
N2 - Bird Sound Detection (BSD) is helpful for monitoring biodiversity and in this regard, deep learning networks have shown good performance in BSD in recent years. However, such a complex network structure requires high memory resources and computing power at great cost for performing the extensive calculations required, which make it difficult to implement the hardware in BSD. Therefore, we designed an audio classification method for BSD using a Binarized Convolutional Neural Network (BCNN). The convolutional layers and fully connected layers of the original Convolutional Neural Network were binarized to two values. The Area Under ROC Curve (AUC) score of BCNN achieved comparable results with the CNN in an unseen evaluation. This paper proposes two networks (CNNs and BCNNs) for the BSD task of the IEEE AASP Challenge on the Detection and Classification of Acoustic Scenes and Events (DCASE2018). The Area Under ROC Curve (AUC) score of BCNN achieved comparable results with CNN on the unseen evaluation data. More importantly, the use of the BCNN could reduce the memory requirement and the hardware loss unit, which are of great significance to the hardware implementation of a bird sound detection system.
AB - Bird Sound Detection (BSD) is helpful for monitoring biodiversity and in this regard, deep learning networks have shown good performance in BSD in recent years. However, such a complex network structure requires high memory resources and computing power at great cost for performing the extensive calculations required, which make it difficult to implement the hardware in BSD. Therefore, we designed an audio classification method for BSD using a Binarized Convolutional Neural Network (BCNN). The convolutional layers and fully connected layers of the original Convolutional Neural Network were binarized to two values. The Area Under ROC Curve (AUC) score of BCNN achieved comparable results with the CNN in an unseen evaluation. This paper proposes two networks (CNNs and BCNNs) for the BSD task of the IEEE AASP Challenge on the Detection and Classification of Acoustic Scenes and Events (DCASE2018). The Area Under ROC Curve (AUC) score of BCNN achieved comparable results with CNN on the unseen evaluation data. More importantly, the use of the BCNN could reduce the memory requirement and the hardware loss unit, which are of great significance to the hardware implementation of a bird sound detection system.
KW - Binarized neural network
KW - Bird sound detection
KW - Convolutional neural networks
UR - http://www.scopus.com/inward/record.url?scp=85070752831&partnerID=8YFLogxK
U2 - 10.1007/978-981-13-8707-4_6
DO - 10.1007/978-981-13-8707-4_6
M3 - Conference Proceeding
AN - SCOPUS:85070752831
SN - 9789811387067
T3 - Lecture Notes in Electrical Engineering
SP - 63
EP - 71
BT - Proceedings of the 6th Conference on Sound and Music Technology, CSMT - Revised Selected Papers, 2018
A2 - Li, Wei
A2 - Li, Shengchen
A2 - Shao, Xi
A2 - Li, Zijin
PB - Springer Verlag
T2 - 6th Conference on Sound and Music Technology, CSMT 2018
Y2 - 24 November 2018 through 26 November 2018
ER -