Bird sound detection based on binarized convolutional neural networks

Jianan Song*, Shengchen Li

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

7 Citations (Scopus)

Abstract

Bird Sound Detection (BSD) is helpful for monitoring biodiversity and in this regard, deep learning networks have shown good performance in BSD in recent years. However, such a complex network structure requires high memory resources and computing power at great cost for performing the extensive calculations required, which make it difficult to implement the hardware in BSD. Therefore, we designed an audio classification method for BSD using a Binarized Convolutional Neural Network (BCNN). The convolutional layers and fully connected layers of the original Convolutional Neural Network were binarized to two values. The Area Under ROC Curve (AUC) score of BCNN achieved comparable results with the CNN in an unseen evaluation. This paper proposes two networks (CNNs and BCNNs) for the BSD task of the IEEE AASP Challenge on the Detection and Classification of Acoustic Scenes and Events (DCASE2018). The Area Under ROC Curve (AUC) score of BCNN achieved comparable results with CNN on the unseen evaluation data. More importantly, the use of the BCNN could reduce the memory requirement and the hardware loss unit, which are of great significance to the hardware implementation of a bird sound detection system.

Original languageEnglish
Title of host publicationProceedings of the 6th Conference on Sound and Music Technology, CSMT - Revised Selected Papers, 2018
EditorsWei Li, Shengchen Li, Xi Shao, Zijin Li
PublisherSpringer Verlag
Pages63-71
Number of pages9
ISBN (Print)9789811387067
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event6th Conference on Sound and Music Technology, CSMT 2018 - Xiamen, China
Duration: 24 Nov 201826 Nov 2018

Publication series

NameLecture Notes in Electrical Engineering
Volume568
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference6th Conference on Sound and Music Technology, CSMT 2018
Country/TerritoryChina
CityXiamen
Period24/11/1826/11/18

Keywords

  • Binarized neural network
  • Bird sound detection
  • Convolutional neural networks

Cite this