Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Jianfei He; Lilin Wang; Jiaying Wang; Zhenyu Liu; Hongbin Na; Zimu Wang; Wei Wang; Qi Chen

doi:10.1109/SWC62898.2024.00246

Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Jianfei He, Lilin Wang, Jiaying Wang, Zhenyu Liu, Hongbin Na, Zimu Wang, Wei Wang, Qi Chen^*

^*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

Identifying offensive language is essential for maintaining safety and sustainability in the social media era. Though large language models (LLMs) have demonstrated encouraging potential in social media analytics, they lack thorough evaluation when in offensive language detection, particularly in multilingual environments. We for the first time evaluate multilingual offensive language detection of LLMs in three languages: English, Spanish, and German with three LLMs, GPT-3.5, Flan-T5, and Mistral, in both monolingual and multilingual settings. We further examine the impact of different prompt languages and augmented translation data for the task in non-English contexts. Furthermore, we discuss the impact of the inherent bias in LLMs and the datasets in the mispredictions related to sensitive topics.

Original language	English
Title of host publication	Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1603-1608
Number of pages	6
ISBN (Electronic)	9798331520861
DOIs	https://doi.org/10.1109/SWC62898.2024.00246
Publication status	Published - 2024
Event	10th IEEE Smart World Congress, SWC 2024 - Nadi, Fiji Duration: 2 Dec 2024 → 7 Dec 2024

Publication series

Name	Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications

Conference

Conference	10th IEEE Smart World Congress, SWC 2024
Country/Territory	Fiji
City	Nadi
Period	2/12/24 → 7/12/24

Keywords

large language models
multilingual
Offensive language detection

Access to Document

10.1109/SWC62898.2024.00246

Cite this

He, J., Wang, L., Wang, J., Liu, Z., Na, H., Wang, Z., Wang, W., & Chen, Q. (2024). Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection. In Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications (pp. 1603-1608). (Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SWC62898.2024.00246

He, Jianfei ; Wang, Lilin ; Wang, Jiaying et al. / Guardians of Discourse : Evaluating LLMs on Multilingual Offensive Language Detection. Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 1603-1608 (Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications).

@inproceedings{6de74e5bd97b45eda89338ae46b9053f,

title = "Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection",

abstract = "Identifying offensive language is essential for maintaining safety and sustainability in the social media era. Though large language models (LLMs) have demonstrated encouraging potential in social media analytics, they lack thorough evaluation when in offensive language detection, particularly in multilingual environments. We for the first time evaluate multilingual offensive language detection of LLMs in three languages: English, Spanish, and German with three LLMs, GPT-3.5, Flan-T5, and Mistral, in both monolingual and multilingual settings. We further examine the impact of different prompt languages and augmented translation data for the task in non-English contexts. Furthermore, we discuss the impact of the inherent bias in LLMs and the datasets in the mispredictions related to sensitive topics.",

keywords = "large language models, multilingual, Offensive language detection",

author = "Jianfei He and Lilin Wang and Jiaying Wang and Zhenyu Liu and Hongbin Na and Zimu Wang and Wei Wang and Qi Chen",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 10th IEEE Smart World Congress, SWC 2024 ; Conference date: 02-12-2024 Through 07-12-2024",

year = "2024",

doi = "10.1109/SWC62898.2024.00246",

language = "English",

series = "Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1603--1608",

booktitle = "Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications",

}

He, J, Wang, L, Wang, J, Liu, Z, Na, H, Wang, Z, Wang, W & Chen, Q 2024, Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection. in Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications. Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications, Institute of Electrical and Electronics Engineers Inc., pp. 1603-1608, 10th IEEE Smart World Congress, SWC 2024, Nadi, Fiji, 2/12/24. https://doi.org/10.1109/SWC62898.2024.00246

Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection. / He, Jianfei; Wang, Lilin; Wang, Jiaying et al.
Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications. Institute of Electrical and Electronics Engineers Inc., 2024. p. 1603-1608 (Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Guardians of Discourse

T2 - 10th IEEE Smart World Congress, SWC 2024

AU - He, Jianfei

AU - Wang, Lilin

AU - Wang, Jiaying

AU - Liu, Zhenyu

AU - Na, Hongbin

AU - Wang, Zimu

AU - Wang, Wei

AU - Chen, Qi

PY - 2024

Y1 - 2024

N2 - Identifying offensive language is essential for maintaining safety and sustainability in the social media era. Though large language models (LLMs) have demonstrated encouraging potential in social media analytics, they lack thorough evaluation when in offensive language detection, particularly in multilingual environments. We for the first time evaluate multilingual offensive language detection of LLMs in three languages: English, Spanish, and German with three LLMs, GPT-3.5, Flan-T5, and Mistral, in both monolingual and multilingual settings. We further examine the impact of different prompt languages and augmented translation data for the task in non-English contexts. Furthermore, we discuss the impact of the inherent bias in LLMs and the datasets in the mispredictions related to sensitive topics.

AB - Identifying offensive language is essential for maintaining safety and sustainability in the social media era. Though large language models (LLMs) have demonstrated encouraging potential in social media analytics, they lack thorough evaluation when in offensive language detection, particularly in multilingual environments. We for the first time evaluate multilingual offensive language detection of LLMs in three languages: English, Spanish, and German with three LLMs, GPT-3.5, Flan-T5, and Mistral, in both monolingual and multilingual settings. We further examine the impact of different prompt languages and augmented translation data for the task in non-English contexts. Furthermore, we discuss the impact of the inherent bias in LLMs and the datasets in the mispredictions related to sensitive topics.

KW - large language models

KW - multilingual

KW - Offensive language detection

UR - http://www.scopus.com/inward/record.url?scp=105002244485&partnerID=8YFLogxK

U2 - 10.1109/SWC62898.2024.00246

DO - 10.1109/SWC62898.2024.00246

M3 - Conference Proceeding

AN - SCOPUS:105002244485

T3 - Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications

SP - 1603

EP - 1608

BT - Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 2 December 2024 through 7 December 2024

ER -

He J, Wang L, Wang J, Liu Z, Na H, Wang Z et al. Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection. In Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications. Institute of Electrical and Electronics Engineers Inc. 2024. p. 1603-1608. (Proceedings - 2024 IEEE Smart World Congress, SWC 2024 - 2024 IEEE Ubiquitous Intelligence and Computing, Autonomous and Trusted Computing, Digital Twin, Metaverse, Privacy Computing and Data Security, Scalable Computing and Communications). doi: 10.1109/SWC62898.2024.00246

Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this