Accelerating Attentional Generative Adversarial Networks with Sampling Blocks

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

402 Downloads (Pure)

Abstract

Synthesizing text-to-image models for high-quality images by guiding generative models through text descriptions is an innovative and challenging task. In recent years, AttnGAN has been proposed based on the Attention mechanism to guide GAN training, which improves the details and quality of images by stacking multiple generators and discriminators. However, the combination of multiple enhancements in GAN architecture introduces redundancy, hindering the practical application of the model. These redundancies adversely affect its performance, increasing inference time and space complexity. In this paper, we propose an Accelerated AttnGAN (AccAttnGAN) to optimize the structure and training efficiency of AttnGAN by (1) removing redundant structures and improving the backbone network of AttnGAN; (2) integrating and reconstructing multiple losses for the training of deep attention model. Experimental results show that AccAttnGAN significantly reduces the model’s space complexity and time complexity during inference while maintaining performance. Code is available at https://github.com/jmyissb/SEAttnGAN.

Original languageEnglish
Title of host publicationInternational Conference on Neural Information Processing 2024
Subtitle of host publicationICONIP
EditorsMufti Mahmud, Maryam Doborjeh, Zohreh Doborjeh, Kevin Wong, Andrew Chi Sing Leung, M. Tanveer
Place of PublicationSingapore
PublisherSpringer Nature Singapore
ChapterXVI
Pages122-137
Number of pages16
Volume2297
Edition1
ISBN (Electronic)978-981-96-7036-9
ISBN (Print)978-981-96-7035-2
DOIs
Publication statusPublished - 19 Jul 2025
Event31st International Conference on Neural Information Processing, ICONIP 2024 - Auckland, New Zealand
Duration: 2 Dec 20246 Dec 2024

Publication series

NameCommunications in Computer and Information Science
Volume2297 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference31st International Conference on Neural Information Processing, ICONIP 2024
Country/TerritoryNew Zealand
CityAuckland
Period2/12/246/12/24

Keywords

  • Alignment of Semantics and Images
  • AttnGAN
  • Efficiency
  • Sampling Blocks
  • Text-to-image

Fingerprint

Dive into the research topics of 'Accelerating Attentional Generative Adversarial Networks with Sampling Blocks'. Together they form a unique fingerprint.

Cite this