Sub-spectrogram Segmentation for Environmental Sound Classification via Convolutional Recurrent Neural Network and Score Level Fusion

Tianhao Qiao, Shunqing Zhang, Zhichao Zhang, Shan Cao, Shugong Xu

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

13 Citations (Scopus)

Abstract

Environmental Sound Classification (ESC) is an important and challenging problem, and feature representation is a critical and even decisive factor in ESC. Feature representation ability directly affects the accuracy of sound classification. Therefore, the ESC performance is heavily dependent on the effectiveness of representative features extracted from the environmental sounds. In this paper, we propose a sub-spectrogram segmentation based ESC classification framework. In addition, we adopt the proposed Convolutional Recurrent Neural Network (CRNN) and score level fusion to jointly improve the classification accuracy. Extensive truncation schemes are evaluated to find the optimal number and the corresponding band ranges of sub-spectrograms. Based on the numerical experiments, the proposed framework can achieve 81.9% ESC classification accuracy on the public dataset ESC-50, which provides 9.1% accuracy improvement over traditional baseline schemes.

Original languageEnglish
Title of host publication2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages318-323
Number of pages6
ISBN (Electronic)9781728119274
DOIs
Publication statusPublished - Oct 2019
Externally publishedYes
Event33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019 - Nanjing, China
Duration: 20 Oct 201923 Oct 2019

Publication series

NameIEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
Volume2019-October
ISSN (Print)1520-6130

Conference

Conference33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019
Country/TerritoryChina
CityNanjing
Period20/10/1923/10/19

Keywords

  • convolutional recurrent neural network
  • Environmental sound classification
  • score level fusion
  • sub-spectrogram segmentation

Fingerprint

Dive into the research topics of 'Sub-spectrogram Segmentation for Environmental Sound Classification via Convolutional Recurrent Neural Network and Score Level Fusion'. Together they form a unique fingerprint.

Cite this