TY - GEN
T1 - Singing voice detection using multi-feature deep fusion with CNN
AU - Zhang, Xulong
AU - Li, Shengchen
AU - Li, Zijin
AU - Chen, Shizhe
AU - Gao, Yongwei
AU - Li, Wei
N1 - Publisher Copyright:
© Springer Nature Singapore Pte Ltd 2020.
PY - 2020
Y1 - 2020
N2 - The problem of singing voice detection is to segment a song into vocal and non-vocal parts. Commonly used methods usually train a model on a set of frame-based features and then predict the unknown frames by the model. However, the multi-dimensional features are usually concatenated together for each frame, with little consideration of spatial information. Hence, a deep fusion method of the Multi-feature dimensions with Convolution Neural Networks (CNN) is proposed. A one dimension convolution is made on feature dimensions for each frames, then the high-level features obtained can be used for a direct binary classification. The performance of the proposed method is on par with the state-of-art methods on public dataset.
AB - The problem of singing voice detection is to segment a song into vocal and non-vocal parts. Commonly used methods usually train a model on a set of frame-based features and then predict the unknown frames by the model. However, the multi-dimensional features are usually concatenated together for each frame, with little consideration of spatial information. Hence, a deep fusion method of the Multi-feature dimensions with Convolution Neural Networks (CNN) is proposed. A one dimension convolution is made on feature dimensions for each frames, then the high-level features obtained can be used for a direct binary classification. The performance of the proposed method is on par with the state-of-art methods on public dataset.
KW - Convolution neural network (CNN)
KW - Deep learning
KW - Multi-feature fusion
KW - Singing voice detection (SVD)
UR - http://www.scopus.com/inward/record.url?scp=85078450640&partnerID=8YFLogxK
U2 - 10.1007/978-981-15-2756-2_4
DO - 10.1007/978-981-15-2756-2_4
M3 - Conference Proceeding
AN - SCOPUS:85078450640
SN - 9789811527555
T3 - Lecture Notes in Electrical Engineering
SP - 41
EP - 52
BT - Proceedings of the 7th Conference on Sound and Music Technology CSMT 2019, Revised Selected Papers
A2 - Li, Haifeng
A2 - Ma, Lin
A2 - Li, Shengchen
A2 - Fang, Chunying
A2 - Zhu, Yidan
PB - Springer
T2 - 7th Conference on Sound and Music Technology, CSMT 2019
Y2 - 26 December 2019 through 29 December 2019
ER -