Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge

Christian Kroos; Oliver Bones; Yin Cao; Lara Harris; Philip J.B. Jackson; William J. Davies; Wenwu Wang; Trevor J. Cox; Mark D. Plumbley

doi:10.1109/ICASSP.2019.8683292

Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge

Christian Kroos, Oliver Bones, Yin Cao, Lara Harris, Philip J.B. Jackson, William J. Davies, Wenwu Wang, Trevor J. Cox, Mark D. Plumbley

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

12 Citations (Scopus)

Abstract

Humans are able to identify a large number of environmental sounds and categorise them according to high-level semantic categories, e.g. urban sounds or music. They are also capable of generalising from past experience to new sounds when applying these categories. In this paper we report on the creation of a data set that is structured according to the top-level of a taxonomy derived from human judgements and the design of an associated machine learning challenge, in which strong generalisation abilities are required to be successful. We introduce a baseline classification system, a deep convolutional network, which showed strong performance with an average accuracy on the evaluation data of 80.8%. The result is discussed in the light of two alternative explanations: An unlikely accidental category bias in the sound recordings or a more plausible true acoustic grounding of the high-level categories.

Original language	English
Title of host publication	2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	8082-8086
Number of pages	5
ISBN (Electronic)	9781479981311
DOIs	https://doi.org/10.1109/ICASSP.2019.8683292
Publication status	Published - May 2019
Externally published	Yes
Event	44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Brighton, United Kingdom Duration: 12 May 2019 → 17 May 2019

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume	2019-May
ISSN (Print)	1520-6149

Conference

Conference	44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019
Country/Territory	United Kingdom
City	Brighton
Period	12/05/19 → 17/05/19

Keywords

Acoustic classification
convolutional neural network
deep learning
machine learning challenge
sound taxonomy

Access to Document

10.1109/ICASSP.2019.8683292

Cite this

Kroos, C., Bones, O., Cao, Y., Harris, L., Jackson, P. J. B., Davies, W. J., Wang, W., Cox, T. J., & Plumbley, M. D. (2019). Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge. In 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings (pp. 8082-8086). Article 8683292 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2019-May). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2019.8683292

Kroos, Christian ; Bones, Oliver ; Cao, Yin et al. / Generalisation in Environmental Sound Classification : The 'Making Sense of Sounds' Data Set and Challenge. 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 8082-8086 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

@inproceedings{b397f47088f64408a9a4757ad17932e3,

title = "Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge",

abstract = "Humans are able to identify a large number of environmental sounds and categorise them according to high-level semantic categories, e.g. urban sounds or music. They are also capable of generalising from past experience to new sounds when applying these categories. In this paper we report on the creation of a data set that is structured according to the top-level of a taxonomy derived from human judgements and the design of an associated machine learning challenge, in which strong generalisation abilities are required to be successful. We introduce a baseline classification system, a deep convolutional network, which showed strong performance with an average accuracy on the evaluation data of 80.8%. The result is discussed in the light of two alternative explanations: An unlikely accidental category bias in the sound recordings or a more plausible true acoustic grounding of the high-level categories.",

keywords = "Acoustic classification, convolutional neural network, deep learning, machine learning challenge, sound taxonomy",

author = "Christian Kroos and Oliver Bones and Yin Cao and Lara Harris and Jackson, {Philip J.B.} and Davies, {William J.} and Wenwu Wang and Cox, {Trevor J.} and Plumbley, {Mark D.}",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 ; Conference date: 12-05-2019 Through 17-05-2019",

year = "2019",

month = may,

doi = "10.1109/ICASSP.2019.8683292",

language = "English",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "8082--8086",

booktitle = "2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings",

}

Kroos, C, Bones, O, Cao, Y, Harris, L, Jackson, PJB, Davies, WJ, Wang, W, Cox, TJ & Plumbley, MD 2019, Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge. in 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings., 8683292, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2019-May, Institute of Electrical and Electronics Engineers Inc., pp. 8082-8086, 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, Brighton, United Kingdom, 12/05/19. https://doi.org/10.1109/ICASSP.2019.8683292

Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge. / Kroos, Christian; Bones, Oliver; Cao, Yin et al.
2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. p. 8082-8086 8683292 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2019-May).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Generalisation in Environmental Sound Classification

T2 - 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019

AU - Kroos, Christian

AU - Bones, Oliver

AU - Cao, Yin

AU - Harris, Lara

AU - Jackson, Philip J.B.

AU - Davies, William J.

AU - Wang, Wenwu

AU - Cox, Trevor J.

AU - Plumbley, Mark D.

PY - 2019/5

Y1 - 2019/5

N2 - Humans are able to identify a large number of environmental sounds and categorise them according to high-level semantic categories, e.g. urban sounds or music. They are also capable of generalising from past experience to new sounds when applying these categories. In this paper we report on the creation of a data set that is structured according to the top-level of a taxonomy derived from human judgements and the design of an associated machine learning challenge, in which strong generalisation abilities are required to be successful. We introduce a baseline classification system, a deep convolutional network, which showed strong performance with an average accuracy on the evaluation data of 80.8%. The result is discussed in the light of two alternative explanations: An unlikely accidental category bias in the sound recordings or a more plausible true acoustic grounding of the high-level categories.

AB - Humans are able to identify a large number of environmental sounds and categorise them according to high-level semantic categories, e.g. urban sounds or music. They are also capable of generalising from past experience to new sounds when applying these categories. In this paper we report on the creation of a data set that is structured according to the top-level of a taxonomy derived from human judgements and the design of an associated machine learning challenge, in which strong generalisation abilities are required to be successful. We introduce a baseline classification system, a deep convolutional network, which showed strong performance with an average accuracy on the evaluation data of 80.8%. The result is discussed in the light of two alternative explanations: An unlikely accidental category bias in the sound recordings or a more plausible true acoustic grounding of the high-level categories.

KW - Acoustic classification

KW - convolutional neural network

KW - deep learning

KW - machine learning challenge

KW - sound taxonomy

UR - http://www.scopus.com/inward/record.url?scp=85068989594&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2019.8683292

DO - 10.1109/ICASSP.2019.8683292

M3 - Conference Proceeding

AN - SCOPUS:85068989594

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 8082

EP - 8086

BT - 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 12 May 2019 through 17 May 2019

ER -

Kroos C, Bones O, Cao Y, Harris L, Jackson PJB, Davies WJ et al. Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge. In 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. p. 8082-8086. 8683292. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP.2019.8683292

Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this