Guided Safe Diffusion: Prohibiting Diffusion Models from Generating Inappropriate Content

Sidong Jiang, Rui Zhang, Xi Yang, Bin Dong, Kaizhu Huang*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

The increasing deployment of large generative models has heightened concerns over security and privacy, particularly regarding the generation of inappropriate content such as violent, explicit, or sensitive images, as well as the potential for creating fake images that spread misinformation and cause social problems. In this work, we propose Guided Safe Diffusion (GSD), an inference-time method specifically designed for diffusion models to prevent the generation of images with undesirable content as defined in a prohibited content list. Our method integrates safety guidance during the denoising steps of the model’s inference process, modifying the predicted noise to steer the generation process away from unwanted content. This approach allows the model to accept both an input image and a text description, facilitating controlled image generation. Unlike previous methods, our technique does not necessitate retraining or fine-tuning of the model. We conduct qualitative and quantitative experiments to assess the effectiveness of our method, demonstrating that GSD can remove the unwanted content while preserving unrelated content. The results validate our method’s ability to mitigate risks while maintaining the generative utility of diffusion models.

Original languageEnglish
Title of host publicationNeural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
EditorsMufti Mahmud, Maryam Doborjeh, Zohreh Doborjeh, Kevin Wong, Andrew Chi Sing Leung, M. Tanveer
PublisherSpringer Science and Business Media Deutschland GmbH
Pages153-164
Number of pages12
ISBN (Print)9789819670352
DOIs
Publication statusPublished - 19 Jul 2025
Event31st International Conference on Neural Information Processing, ICONIP 2024 - Auckland, New Zealand
Duration: 2 Dec 20246 Dec 2024

Publication series

NameCommunications in Computer and Information Science
Volume2297 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference31st International Conference on Neural Information Processing, ICONIP 2024
Country/TerritoryNew Zealand
CityAuckland
Period2/12/246/12/24

Keywords

  • AI ethics
  • Diffusion models
  • Image protection

Fingerprint

Dive into the research topics of 'Guided Safe Diffusion: Prohibiting Diffusion Models from Generating Inappropriate Content'. Together they form a unique fingerprint.

Cite this