Music Genre Classification with LSTM based on Time and Frequency Domain Features

Yinhui Yi; Xiaohui Zhu; Yong Yue; Wei Wang

doi:10.1109/ICCCS52626.2021.9449177

Music Genre Classification with LSTM based on Time and Frequency Domain Features

Yinhui Yi, Xiaohui Zhu, Yong Yue, Wei Wang

Department of Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

13 Citations (Scopus)

Abstract

Deep features generated from deep learning models contain more information for music classification than short-term features. This paper uses a long-short term memory (LSTM) model to generate deep features and achieve music genre classification. Firstly, two short-term features of Zero crossing rate (ZCR) and mel-frequency spectral coefficients (MFCC) are extracted from music in digital form, which is a time-domain feature and frequency-domain feature, respectively. Then these two features are fed to LSTM to generate deep features. Finally, we use support vector machine (SVM) and k-nearest neighbors (KNN) respectively to classify the music genre based on these deep features. Experimental results show that using LSTM can significantly increase the accuracy of music genre classification.

Original language	English
Title of host publication	2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	678-682
Number of pages	5
ISBN (Electronic)	9780738126043
DOIs	https://doi.org/10.1109/ICCCS52626.2021.9449177
Publication status	Published - 23 Apr 2021
Event	6th IEEE International Conference on Computer and Communication Systems, ICCCS 2021 - Chengdu, China Duration: 23 Apr 2021 → 26 Apr 2021

Publication series

Name	2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021

Conference

Conference	6th IEEE International Conference on Computer and Communication Systems, ICCCS 2021
Country/Territory	China
City	Chengdu
Period	23/04/21 → 26/04/21

Keywords

Deep features
LSTM
MFCC
Music classification
ZCR

Access to Document

10.1109/ICCCS52626.2021.9449177

Cite this

Yi, Y., Zhu, X., Yue, Y., & Wang, W. (2021). Music Genre Classification with LSTM based on Time and Frequency Domain Features. In 2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021 (pp. 678-682). Article 9449177 (2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCCS52626.2021.9449177

Yi, Yinhui ; Zhu, Xiaohui ; Yue, Yong et al. / Music Genre Classification with LSTM based on Time and Frequency Domain Features. 2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 678-682 (2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021).

@inproceedings{dd17c696083a4fecb9a7afdea6bb1b6f,

title = "Music Genre Classification with LSTM based on Time and Frequency Domain Features",

abstract = "Deep features generated from deep learning models contain more information for music classification than short-term features. This paper uses a long-short term memory (LSTM) model to generate deep features and achieve music genre classification. Firstly, two short-term features of Zero crossing rate (ZCR) and mel-frequency spectral coefficients (MFCC) are extracted from music in digital form, which is a time-domain feature and frequency-domain feature, respectively. Then these two features are fed to LSTM to generate deep features. Finally, we use support vector machine (SVM) and k-nearest neighbors (KNN) respectively to classify the music genre based on these deep features. Experimental results show that using LSTM can significantly increase the accuracy of music genre classification.",

keywords = "Deep features, LSTM, MFCC, Music classification, ZCR",

author = "Yinhui Yi and Xiaohui Zhu and Yong Yue and Wei Wang",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 6th IEEE International Conference on Computer and Communication Systems, ICCCS 2021 ; Conference date: 23-04-2021 Through 26-04-2021",

year = "2021",

month = apr,

day = "23",

doi = "10.1109/ICCCS52626.2021.9449177",

language = "English",

series = "2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "678--682",

booktitle = "2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021",

}

Yi, Y, Zhu, X , Yue, Y & Wang, W 2021, Music Genre Classification with LSTM based on Time and Frequency Domain Features. in 2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021., 9449177, 2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021, Institute of Electrical and Electronics Engineers Inc., pp. 678-682, 6th IEEE International Conference on Computer and Communication Systems, ICCCS 2021, Chengdu, China, 23/04/21. https://doi.org/10.1109/ICCCS52626.2021.9449177

Music Genre Classification with LSTM based on Time and Frequency Domain Features. / Yi, Yinhui; Zhu, Xiaohui ; Yue, Yong et al.
2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 678-682 9449177 (2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Music Genre Classification with LSTM based on Time and Frequency Domain Features

AU - Yi, Yinhui

AU - Zhu, Xiaohui

AU - Yue, Yong

AU - Wang, Wei

PY - 2021/4/23

Y1 - 2021/4/23

N2 - Deep features generated from deep learning models contain more information for music classification than short-term features. This paper uses a long-short term memory (LSTM) model to generate deep features and achieve music genre classification. Firstly, two short-term features of Zero crossing rate (ZCR) and mel-frequency spectral coefficients (MFCC) are extracted from music in digital form, which is a time-domain feature and frequency-domain feature, respectively. Then these two features are fed to LSTM to generate deep features. Finally, we use support vector machine (SVM) and k-nearest neighbors (KNN) respectively to classify the music genre based on these deep features. Experimental results show that using LSTM can significantly increase the accuracy of music genre classification.

AB - Deep features generated from deep learning models contain more information for music classification than short-term features. This paper uses a long-short term memory (LSTM) model to generate deep features and achieve music genre classification. Firstly, two short-term features of Zero crossing rate (ZCR) and mel-frequency spectral coefficients (MFCC) are extracted from music in digital form, which is a time-domain feature and frequency-domain feature, respectively. Then these two features are fed to LSTM to generate deep features. Finally, we use support vector machine (SVM) and k-nearest neighbors (KNN) respectively to classify the music genre based on these deep features. Experimental results show that using LSTM can significantly increase the accuracy of music genre classification.

KW - Deep features

KW - LSTM

KW - MFCC

KW - Music classification

KW - ZCR

UR - http://www.scopus.com/inward/record.url?scp=85113346224&partnerID=8YFLogxK

U2 - 10.1109/ICCCS52626.2021.9449177

DO - 10.1109/ICCCS52626.2021.9449177

M3 - Conference Proceeding

AN - SCOPUS:85113346224

T3 - 2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021

SP - 678

EP - 682

BT - 2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 6th IEEE International Conference on Computer and Communication Systems, ICCCS 2021

Y2 - 23 April 2021 through 26 April 2021

ER -

Yi Y, Zhu X , Yue Y, Wang W. Music Genre Classification with LSTM based on Time and Frequency Domain Features. In 2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 678-682. 9449177. (2021 IEEE 6th International Conference on Computer and Communication Systems, ICCCS 2021). doi: 10.1109/ICCCS52626.2021.9449177

Music Genre Classification with LSTM based on Time and Frequency Domain Features

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Cite this