Robust generative adversarial network

Shufei Zhang; Zhuang Qian; Kaizhu Huang; Rui Zhang; Jimin Xiao; Yuan He; Canyi Lu

doi:10.1007/s10994-023-06367-0

Robust generative adversarial network

Shufei Zhang, Zhuang Qian, Kaizhu Huang^*, Rui Zhang, Jimin Xiao, Yuan He, Canyi Lu

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

10 Citations (Scopus)

Abstract

Generative Adversarial Networks (GANs) are one of the most popular and powerful models to learn the complex high dimensional distributions. However, they usually suffer from instability and generalization issues which may lead to poor generations. Most existing works focus on stabilizing the training for the discriminators of GANs while ignoring their generalization issue. In this work, we aim to improve the generalization capability of GANs by promoting the local robustness within the small neighborhood of the training samples. We prove that the robustness in the small neighborhood of the training sets can lead to better generalization. Particularly, we design a new robust method called Robust Generative Adversarial Network (RGAN) in which the generator and discriminator compete with each other in a worst-case setting within a small Wasserstein ball. The generator tries to map the worst input distribution (rather than a Gaussian distribution used in most GANs) to the real data distribution, while the discriminator attempts to distinguish the real and fake distributions with the worst perturbations. Intuitively, the proposed RGAN can learn a good generator and discriminator that can even perform well on the worst-case input points. Strictly, we have proved that RGAN can obtain a tighter generalization upper bound than the traditional GANs under mild assumptions, ensuring a theoretical superiority of RGAN over GANs. We conduct our proposed method on five different baselines (five popular GAN models). And a series of experiments on CIFAR-10, STL-10 and CelebA datasets indicate that our proposed robust frameworks outperform five baseline models substantially and consistently.

Original language	English
Pages (from-to)	5135-5161
Number of pages	27
Journal	Machine Learning
Volume	112
Issue number	12
DOIs	https://doi.org/10.1007/s10994-023-06367-0
Publication status	Published - Dec 2023

Keywords

Generalization
Generative adversarial network
Robustness

Access to Document

10.1007/s10994-023-06367-0

Cite this

@article{5fbfd760cab24d48bd37a2fddc6b2132,

title = "Robust generative adversarial network",

abstract = "Generative Adversarial Networks (GANs) are one of the most popular and powerful models to learn the complex high dimensional distributions. However, they usually suffer from instability and generalization issues which may lead to poor generations. Most existing works focus on stabilizing the training for the discriminators of GANs while ignoring their generalization issue. In this work, we aim to improve the generalization capability of GANs by promoting the local robustness within the small neighborhood of the training samples. We prove that the robustness in the small neighborhood of the training sets can lead to better generalization. Particularly, we design a new robust method called Robust Generative Adversarial Network (RGAN) in which the generator and discriminator compete with each other in a worst-case setting within a small Wasserstein ball. The generator tries to map the worst input distribution (rather than a Gaussian distribution used in most GANs) to the real data distribution, while the discriminator attempts to distinguish the real and fake distributions with the worst perturbations. Intuitively, the proposed RGAN can learn a good generator and discriminator that can even perform well on the worst-case input points. Strictly, we have proved that RGAN can obtain a tighter generalization upper bound than the traditional GANs under mild assumptions, ensuring a theoretical superiority of RGAN over GANs. We conduct our proposed method on five different baselines (five popular GAN models). And a series of experiments on CIFAR-10, STL-10 and CelebA datasets indicate that our proposed robust frameworks outperform five baseline models substantially and consistently.",

keywords = "Generalization, Generative adversarial network, Robustness",

author = "Shufei Zhang and Zhuang Qian and Kaizhu Huang and Rui Zhang and Jimin Xiao and Yuan He and Canyi Lu",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature.",

year = "2023",

month = dec,

doi = "10.1007/s10994-023-06367-0",

language = "English",

volume = "112",

pages = "5135--5161",

journal = "Machine Learning",

issn = "0885-6125",

number = "12",

}

TY - JOUR

T1 - Robust generative adversarial network

AU - Zhang, Shufei

AU - Qian, Zhuang

AU - Huang, Kaizhu

AU - Zhang, Rui

AU - Xiao, Jimin

AU - He, Yuan

AU - Lu, Canyi

PY - 2023/12

Y1 - 2023/12

N2 - Generative Adversarial Networks (GANs) are one of the most popular and powerful models to learn the complex high dimensional distributions. However, they usually suffer from instability and generalization issues which may lead to poor generations. Most existing works focus on stabilizing the training for the discriminators of GANs while ignoring their generalization issue. In this work, we aim to improve the generalization capability of GANs by promoting the local robustness within the small neighborhood of the training samples. We prove that the robustness in the small neighborhood of the training sets can lead to better generalization. Particularly, we design a new robust method called Robust Generative Adversarial Network (RGAN) in which the generator and discriminator compete with each other in a worst-case setting within a small Wasserstein ball. The generator tries to map the worst input distribution (rather than a Gaussian distribution used in most GANs) to the real data distribution, while the discriminator attempts to distinguish the real and fake distributions with the worst perturbations. Intuitively, the proposed RGAN can learn a good generator and discriminator that can even perform well on the worst-case input points. Strictly, we have proved that RGAN can obtain a tighter generalization upper bound than the traditional GANs under mild assumptions, ensuring a theoretical superiority of RGAN over GANs. We conduct our proposed method on five different baselines (five popular GAN models). And a series of experiments on CIFAR-10, STL-10 and CelebA datasets indicate that our proposed robust frameworks outperform five baseline models substantially and consistently.

AB - Generative Adversarial Networks (GANs) are one of the most popular and powerful models to learn the complex high dimensional distributions. However, they usually suffer from instability and generalization issues which may lead to poor generations. Most existing works focus on stabilizing the training for the discriminators of GANs while ignoring their generalization issue. In this work, we aim to improve the generalization capability of GANs by promoting the local robustness within the small neighborhood of the training samples. We prove that the robustness in the small neighborhood of the training sets can lead to better generalization. Particularly, we design a new robust method called Robust Generative Adversarial Network (RGAN) in which the generator and discriminator compete with each other in a worst-case setting within a small Wasserstein ball. The generator tries to map the worst input distribution (rather than a Gaussian distribution used in most GANs) to the real data distribution, while the discriminator attempts to distinguish the real and fake distributions with the worst perturbations. Intuitively, the proposed RGAN can learn a good generator and discriminator that can even perform well on the worst-case input points. Strictly, we have proved that RGAN can obtain a tighter generalization upper bound than the traditional GANs under mild assumptions, ensuring a theoretical superiority of RGAN over GANs. We conduct our proposed method on five different baselines (five popular GAN models). And a series of experiments on CIFAR-10, STL-10 and CelebA datasets indicate that our proposed robust frameworks outperform five baseline models substantially and consistently.

KW - Generalization

KW - Generative adversarial network

KW - Robustness

UR - http://www.scopus.com/inward/record.url?scp=85171179229&partnerID=8YFLogxK

U2 - 10.1007/s10994-023-06367-0

DO - 10.1007/s10994-023-06367-0

M3 - Article

AN - SCOPUS:85171179229

SN - 0885-6125

VL - 112

SP - 5135

EP - 5161

JO - Machine Learning

JF - Machine Learning

IS - 12

ER -

Robust generative adversarial network

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this