Data mining for news content: a case of Cantonese opera news topic modelling analysis

Bifeng Wang, Xiaotong Xu, Haocih Chen, Xinyi Xie, Jiawei Chen, Qian Liu, Yong Fu*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

1 Citation (Scopus)

Abstract

Topic modelling approach is widely used for text data mining in NLP(Natural Language Processing). Text mining has been used for analysis of ICH (intangible cultural heritage), where Cantonese opera is a representative ICH of Lingnan culture. This study retrieved news content on Cantonese Opera and used machine learning analysis (LDA topic modelling) method to find out the distribution of the topics. Four main themes are concluded: the development, cooperation, and inheritance of Cantonese opera(taken up to 45.1% in all data); The traditional form(23.5%); Innovative forms(18.3%); Education and cultural inheritance of Cantonese opera(13.2%). This research further explored how to better promote Cantonese opera by analysing the topics as well as the data, and suggested that emphasis should be placed on the innovation of traditional elements in Cantonese opera, keeping them close to life, and education.

Original languageEnglish
Title of host publicationThird International Conference on Intelligent Computing and Human-Computer Interaction, ICHCI 2022
EditorsKannimuthu Subramanian
PublisherSPIE
ISBN (Electronic)9781510661301
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event3rd International Conference on Intelligent Computing and Human-Computer Interaction, ICHCI 2022 - Guangzhou, China
Duration: 12 Aug 202214 Aug 2022

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume12509
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference3rd International Conference on Intelligent Computing and Human-Computer Interaction, ICHCI 2022
Country/TerritoryChina
CityGuangzhou
Period12/08/2214/08/22

Keywords

  • Cantonese opera
  • Data mining
  • LDA
  • machine learning
  • NLP
  • topic modelling

Fingerprint

Dive into the research topics of 'Data mining for news content: a case of Cantonese opera news topic modelling analysis'. Together they form a unique fingerprint.

Cite this