TY - JOUR
T1 - ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing
AU - Tan, Pooi Shiang
AU - Lim, Kian Ming
AU - Tan, Cheah Heng
AU - Lee, Chin Poo
AU - Kwek, Lee Chung
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
PY - 2023/4
Y1 - 2023/4
N2 - Sound plays an important role in human daily life as humans use sound to communicate with each other and to understand the events occurring in the surroundings. This has prompted the researchers to further study on how to automatically identify the event that is happening by analyzing the acoustic signal. This paper presents a deep learning model enhanced by compressed sensing techniques for acoustic event classification. The compressed sensing first transforms the input acoustic signal into a reconstructed signal to reduce the noise in the input acoustic signal. The reconstructed signals are then fed into a 1-dimensional convolutional neural network (1D-CNN) to train a deep learning model for the acoustic event classification. In addition, the dropout regularization is leveraged in the 1D-CNN to mitigate the overfitting problems. The proposed compressed sensing with 1D-CNN was evaluated on three benchmark datasets, namely Soundscapes1, Soundscapes2, and UrbanSound8K, and achieved F1-scores of 80.5%, 81.1%, and 69.2%, respectively.
AB - Sound plays an important role in human daily life as humans use sound to communicate with each other and to understand the events occurring in the surroundings. This has prompted the researchers to further study on how to automatically identify the event that is happening by analyzing the acoustic signal. This paper presents a deep learning model enhanced by compressed sensing techniques for acoustic event classification. The compressed sensing first transforms the input acoustic signal into a reconstructed signal to reduce the noise in the input acoustic signal. The reconstructed signals are then fed into a 1-dimensional convolutional neural network (1D-CNN) to train a deep learning model for the acoustic event classification. In addition, the dropout regularization is leveraged in the 1D-CNN to mitigate the overfitting problems. The proposed compressed sensing with 1D-CNN was evaluated on three benchmark datasets, namely Soundscapes1, Soundscapes2, and UrbanSound8K, and achieved F1-scores of 80.5%, 81.1%, and 69.2%, respectively.
KW - 1D convolutional neural network
KW - 1D-CNN
KW - Acoustic event classification
KW - Compressed sensing
UR - http://www.scopus.com/inward/record.url?scp=85132350112&partnerID=8YFLogxK
U2 - 10.1007/s11760-022-02281-5
DO - 10.1007/s11760-022-02281-5
M3 - Article
AN - SCOPUS:85132350112
SN - 1863-1703
VL - 17
SP - 735
EP - 741
JO - Signal, Image and Video Processing
JF - Signal, Image and Video Processing
IS - 3
ER -