Zero-Shot Text Classification via Knowledge Graph Embedding for Social Media Data

Qi Chen; Wei Wang; Kaizhu Huang; Frans Coenen

doi:10.1109/JIOT.2021.3093065

Zero-Shot Text Classification via Knowledge Graph Embedding for Social Media Data

Qi Chen, Wei Wang^*, Kaizhu Huang, Frans Coenen

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

26 Citations (Scopus)

Abstract

The idea of 'citizen sensing' and 'human as sensors' is crucial for social Internet of Things, an integral part of cyber-physical-social systems (CPSSs). Social media data, which can be easily collected from the social world, has become a valuable resource for research in many different disciplines, e.g., crisis/disaster assessment, social event detection, or the recent COVID-19 analysis. Useful information, or knowledge derived from social data, could better serve the public if it could be processed and analyzed in more efficient and reliable ways. Advances in deep neural networks have significantly improved the performance of many social media analysis tasks. However, deep learning models typically require a large amount of labeled data for model training, while most CPSS data is not labeled, making it impractical to build effective learning models using traditional approaches. In addition, the current state-of-the-art, pretrained natural language processing (NLP) models do not make use of existing knowledge graphs, thus often leading to unsatisfactory performance in real-world applications. To address the issues, we propose a new zero-shot learning method which makes effective use of existing knowledge graphs for the classification of very large amounts of social text data. Experiments were performed on a large, real-world tweet data set related to COVID-19, the evaluation results show that the proposed method significantly outperforms six baseline models implemented with state-of-the-art deep learning models for NLP.

Original language	English
Pages (from-to)	9205-9213
Number of pages	9
Journal	IEEE Internet of Things Journal
Volume	9
Issue number	12
DOIs	https://doi.org/10.1109/JIOT.2021.3093065
Publication status	Published - 15 Jun 2022

Keywords

Internet of Things (IoT)
knowledge graph
natural language processing (NLP)
social media data analysis
zero-shot learning

Access to Document

10.1109/JIOT.2021.3093065

Cite this

@article{e1f80eae41784a4c8835f7b5482eaaf6,

title = "Zero-Shot Text Classification via Knowledge Graph Embedding for Social Media Data",

abstract = "The idea of 'citizen sensing' and 'human as sensors' is crucial for social Internet of Things, an integral part of cyber-physical-social systems (CPSSs). Social media data, which can be easily collected from the social world, has become a valuable resource for research in many different disciplines, e.g., crisis/disaster assessment, social event detection, or the recent COVID-19 analysis. Useful information, or knowledge derived from social data, could better serve the public if it could be processed and analyzed in more efficient and reliable ways. Advances in deep neural networks have significantly improved the performance of many social media analysis tasks. However, deep learning models typically require a large amount of labeled data for model training, while most CPSS data is not labeled, making it impractical to build effective learning models using traditional approaches. In addition, the current state-of-the-art, pretrained natural language processing (NLP) models do not make use of existing knowledge graphs, thus often leading to unsatisfactory performance in real-world applications. To address the issues, we propose a new zero-shot learning method which makes effective use of existing knowledge graphs for the classification of very large amounts of social text data. Experiments were performed on a large, real-world tweet data set related to COVID-19, the evaluation results show that the proposed method significantly outperforms six baseline models implemented with state-of-the-art deep learning models for NLP.",

keywords = "Internet of Things (IoT), knowledge graph, natural language processing (NLP), social media data analysis, zero-shot learning",

author = "Qi Chen and Wei Wang and Kaizhu Huang and Frans Coenen",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2022",

month = jun,

day = "15",

doi = "10.1109/JIOT.2021.3093065",

language = "English",

volume = "9",

pages = "9205--9213",

journal = "IEEE Internet of Things Journal",

issn = "2327-4662",

publisher = "IEEE",

number = "12",

}

TY - JOUR

T1 - Zero-Shot Text Classification via Knowledge Graph Embedding for Social Media Data

AU - Chen, Qi

AU - Wang, Wei

AU - Huang, Kaizhu

AU - Coenen, Frans

PY - 2022/6/15

Y1 - 2022/6/15

N2 - The idea of 'citizen sensing' and 'human as sensors' is crucial for social Internet of Things, an integral part of cyber-physical-social systems (CPSSs). Social media data, which can be easily collected from the social world, has become a valuable resource for research in many different disciplines, e.g., crisis/disaster assessment, social event detection, or the recent COVID-19 analysis. Useful information, or knowledge derived from social data, could better serve the public if it could be processed and analyzed in more efficient and reliable ways. Advances in deep neural networks have significantly improved the performance of many social media analysis tasks. However, deep learning models typically require a large amount of labeled data for model training, while most CPSS data is not labeled, making it impractical to build effective learning models using traditional approaches. In addition, the current state-of-the-art, pretrained natural language processing (NLP) models do not make use of existing knowledge graphs, thus often leading to unsatisfactory performance in real-world applications. To address the issues, we propose a new zero-shot learning method which makes effective use of existing knowledge graphs for the classification of very large amounts of social text data. Experiments were performed on a large, real-world tweet data set related to COVID-19, the evaluation results show that the proposed method significantly outperforms six baseline models implemented with state-of-the-art deep learning models for NLP.

AB - The idea of 'citizen sensing' and 'human as sensors' is crucial for social Internet of Things, an integral part of cyber-physical-social systems (CPSSs). Social media data, which can be easily collected from the social world, has become a valuable resource for research in many different disciplines, e.g., crisis/disaster assessment, social event detection, or the recent COVID-19 analysis. Useful information, or knowledge derived from social data, could better serve the public if it could be processed and analyzed in more efficient and reliable ways. Advances in deep neural networks have significantly improved the performance of many social media analysis tasks. However, deep learning models typically require a large amount of labeled data for model training, while most CPSS data is not labeled, making it impractical to build effective learning models using traditional approaches. In addition, the current state-of-the-art, pretrained natural language processing (NLP) models do not make use of existing knowledge graphs, thus often leading to unsatisfactory performance in real-world applications. To address the issues, we propose a new zero-shot learning method which makes effective use of existing knowledge graphs for the classification of very large amounts of social text data. Experiments were performed on a large, real-world tweet data set related to COVID-19, the evaluation results show that the proposed method significantly outperforms six baseline models implemented with state-of-the-art deep learning models for NLP.

KW - Internet of Things (IoT)

KW - knowledge graph

KW - natural language processing (NLP)

KW - social media data analysis

KW - zero-shot learning

UR - http://www.scopus.com/inward/record.url?scp=85112209978&partnerID=8YFLogxK

U2 - 10.1109/JIOT.2021.3093065

DO - 10.1109/JIOT.2021.3093065

M3 - Article

AN - SCOPUS:85112209978

SN - 2327-4662

VL - 9

SP - 9205

EP - 9213

JO - IEEE Internet of Things Journal

JF - IEEE Internet of Things Journal

IS - 12

ER -

Zero-Shot Text Classification via Knowledge Graph Embedding for Social Media Data

Abstract

Keywords

Access to Document

Other files and links

Cite this