Skip to main navigation Skip to search Skip to main content

Lost in Pronunciation: Detecting Chinese Offensive Language Disguised by Phonetic Cloaking Replacement

  • Haotan Guo
  • , Jianfei He
  • , Jiayuan Ma
  • , Hongbin Na
  • , Zimu Wang*
  • , Haiyang Zhang
  • , Qi Chen
  • , Wei Wang
  • , Zijing Shi
  • , Tao Shen
  • , Ling Chen
  • *Corresponding author for this work
  • The University of Sydney
  • Hong Kong University of Science and Technology
  • University of Technology Sydney
  • Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Warning: this paper contains content that may be offensive or upsetting. Phonetic Cloaking Replacement (PCR), defined as the deliberate use of homophonic or near-homophonic variants to hide toxic intent, has become a major obstacle to Chinese content moderation. While this problem is well-recognized, existing evaluations predominantly rely on rule-based, synthetic perturbations that ignore the creativity of real users. We organize PCR into a four-way surface-form taxonomy and compile PCR-ToxiCN, a dataset of 500 naturally occurring, phonetically cloaked offensive posts gathered from the RedNote platform. Benchmarking state-of-the-art LLMs on this dataset exposes a serious weakness: the best model reaches only an F1-score of 0.672, and zero-shot chain-of-thought prompting pushes performance even lower. Guided by error analysis, we revisit a Pinyin-based prompting strategy that earlier studies judged ineffective and show that it recovers much of the lost accuracy. This study offers the first comprehensive taxonomy of Chinese PCR, a realistic benchmark that reveals current detectors’ limits, and a lightweight mitigation technique that advances research on robust toxicity detection.

Original languageEnglish
Title of host publicationEMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Industry Track
EditorsSaloni Potdar, Lina Rojas-Barahona, Sebastien Montella
PublisherAssociation for Computational Linguistics (ACL)
Pages2538-2550
Number of pages13
ISBN (Electronic)9798891763333
DOIs
Publication statusPublished - 2025
Event2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, EMNLP 2025 - Suzhou, China
Duration: 4 Nov 20259 Nov 2025

Publication series

NameEMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Industry Track

Conference

Conference2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, EMNLP 2025
Country/TerritoryChina
CitySuzhou
Period4/11/259/11/25

Cite this