Improving generative adversarial networks with simple latent distributions

Shufei Zhang; Kaizhu Huang; Zhuang Qian; Rui Zhang; Amir Hussain

doi:10.1007/s00521-021-05946-3

Improving generative adversarial networks with simple latent distributions

Shufei Zhang, Kaizhu Huang, Zhuang Qian, Rui Zhang, Amir Hussain

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

Generative Adversarial Networks (GANs) have drawn great attention recently since they are the powerful models to generate high-quality images. Although GANs have achieved great success, they usually suffer from unstable training and consequently may lead to the poor generations in some cases. Such drawback is argued mainly due to the difficulties in measuring the divergence between the highly complicated the real and fake data distributions, which are normally in the high-dimensional space. To tackle this problem, previous researchers attempt to search a proper divergence capable of measuring the departure of the complex distributions. In contrast, we attempt to alleviate this problem from a different perspective: while retaining the information as much as possible of the original high dimensional distributions, we learn and leverage an additional latent space where simple distributions are defined in a low-dimensional space; as a result, we can readily compute the distance between two simple distributions with an available divergence measurement. Concretely, to retain the data information, the mutual information is maximized between the variables for the high dimensional complex distributions and the low dimensional simple distributions. The departure of the resulting simple distributions are then measured in the original way of GANs. Additionally, for simplifying the optimization further, we optimize directly the lower bound for mutual information. Termed as SimpleGAN, we conduct the proposed approach over the several different baseline models, i.e., conventional GANs, DCGAN, WGAN-GP, WGAN-GP-res, and LSWGAN-GP on the benchmark CIFAR-10 and STL-10 datasets. SimpleGAN shows the obvious superiority on these baseline models. Furthermore, in comparison with the existing methods measuring directly the distribution departure in the high-dimensional space, our method clearly demonstrates its superiority. Finally, a series of experiments show the advantages of the proposed SimpleGAN.

Original language	English
Pages (from-to)	13193-13203
Number of pages	11
Journal	Neural Computing and Applications
Volume	33
Issue number	20
DOIs	https://doi.org/10.1007/s00521-021-05946-3
Publication status	Published - Oct 2021

Keywords

Deep generative model
Deep learning
Generation
Generative adversarial network
Information theory

Access to Document

10.1007/s00521-021-05946-3

Cite this

@article{7aed2d7c6d1f4fa98c118e23b30c8178,

title = "Improving generative adversarial networks with simple latent distributions",

abstract = "Generative Adversarial Networks (GANs) have drawn great attention recently since they are the powerful models to generate high-quality images. Although GANs have achieved great success, they usually suffer from unstable training and consequently may lead to the poor generations in some cases. Such drawback is argued mainly due to the difficulties in measuring the divergence between the highly complicated the real and fake data distributions, which are normally in the high-dimensional space. To tackle this problem, previous researchers attempt to search a proper divergence capable of measuring the departure of the complex distributions. In contrast, we attempt to alleviate this problem from a different perspective: while retaining the information as much as possible of the original high dimensional distributions, we learn and leverage an additional latent space where simple distributions are defined in a low-dimensional space; as a result, we can readily compute the distance between two simple distributions with an available divergence measurement. Concretely, to retain the data information, the mutual information is maximized between the variables for the high dimensional complex distributions and the low dimensional simple distributions. The departure of the resulting simple distributions are then measured in the original way of GANs. Additionally, for simplifying the optimization further, we optimize directly the lower bound for mutual information. Termed as SimpleGAN, we conduct the proposed approach over the several different baseline models, i.e., conventional GANs, DCGAN, WGAN-GP, WGAN-GP-res, and LSWGAN-GP on the benchmark CIFAR-10 and STL-10 datasets. SimpleGAN shows the obvious superiority on these baseline models. Furthermore, in comparison with the existing methods measuring directly the distribution departure in the high-dimensional space, our method clearly demonstrates its superiority. Finally, a series of experiments show the advantages of the proposed SimpleGAN.",

keywords = "Deep generative model, Deep learning, Generation, Generative adversarial network, Information theory",

author = "Shufei Zhang and Kaizhu Huang and Zhuang Qian and Rui Zhang and Amir Hussain",

note = "Funding Information: The work was partially supported by the following: National Natural Science Foundation of China under No. 61876155; Jiangsu Science and Technology Programme (Natural Science Foundation of Jiangsu Province) under Nos. BE2020006-4, BK20181189; Key Program Special Fund in XJTLU under No. KSF-T-06. Publisher Copyright: {\textcopyright} 2021, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.",

year = "2021",

month = oct,

doi = "10.1007/s00521-021-05946-3",

language = "English",

volume = "33",

pages = "13193--13203",

journal = "Neural Computing and Applications",

issn = "0941-0643",

number = "20",

}

TY - JOUR

T1 - Improving generative adversarial networks with simple latent distributions

AU - Zhang, Shufei

AU - Huang, Kaizhu

AU - Qian, Zhuang

AU - Zhang, Rui

AU - Hussain, Amir

N1 - Funding Information: The work was partially supported by the following: National Natural Science Foundation of China under No. 61876155; Jiangsu Science and Technology Programme (Natural Science Foundation of Jiangsu Province) under Nos. BE2020006-4, BK20181189; Key Program Special Fund in XJTLU under No. KSF-T-06. Publisher Copyright: © 2021, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.

PY - 2021/10

Y1 - 2021/10

N2 - Generative Adversarial Networks (GANs) have drawn great attention recently since they are the powerful models to generate high-quality images. Although GANs have achieved great success, they usually suffer from unstable training and consequently may lead to the poor generations in some cases. Such drawback is argued mainly due to the difficulties in measuring the divergence between the highly complicated the real and fake data distributions, which are normally in the high-dimensional space. To tackle this problem, previous researchers attempt to search a proper divergence capable of measuring the departure of the complex distributions. In contrast, we attempt to alleviate this problem from a different perspective: while retaining the information as much as possible of the original high dimensional distributions, we learn and leverage an additional latent space where simple distributions are defined in a low-dimensional space; as a result, we can readily compute the distance between two simple distributions with an available divergence measurement. Concretely, to retain the data information, the mutual information is maximized between the variables for the high dimensional complex distributions and the low dimensional simple distributions. The departure of the resulting simple distributions are then measured in the original way of GANs. Additionally, for simplifying the optimization further, we optimize directly the lower bound for mutual information. Termed as SimpleGAN, we conduct the proposed approach over the several different baseline models, i.e., conventional GANs, DCGAN, WGAN-GP, WGAN-GP-res, and LSWGAN-GP on the benchmark CIFAR-10 and STL-10 datasets. SimpleGAN shows the obvious superiority on these baseline models. Furthermore, in comparison with the existing methods measuring directly the distribution departure in the high-dimensional space, our method clearly demonstrates its superiority. Finally, a series of experiments show the advantages of the proposed SimpleGAN.

AB - Generative Adversarial Networks (GANs) have drawn great attention recently since they are the powerful models to generate high-quality images. Although GANs have achieved great success, they usually suffer from unstable training and consequently may lead to the poor generations in some cases. Such drawback is argued mainly due to the difficulties in measuring the divergence between the highly complicated the real and fake data distributions, which are normally in the high-dimensional space. To tackle this problem, previous researchers attempt to search a proper divergence capable of measuring the departure of the complex distributions. In contrast, we attempt to alleviate this problem from a different perspective: while retaining the information as much as possible of the original high dimensional distributions, we learn and leverage an additional latent space where simple distributions are defined in a low-dimensional space; as a result, we can readily compute the distance between two simple distributions with an available divergence measurement. Concretely, to retain the data information, the mutual information is maximized between the variables for the high dimensional complex distributions and the low dimensional simple distributions. The departure of the resulting simple distributions are then measured in the original way of GANs. Additionally, for simplifying the optimization further, we optimize directly the lower bound for mutual information. Termed as SimpleGAN, we conduct the proposed approach over the several different baseline models, i.e., conventional GANs, DCGAN, WGAN-GP, WGAN-GP-res, and LSWGAN-GP on the benchmark CIFAR-10 and STL-10 datasets. SimpleGAN shows the obvious superiority on these baseline models. Furthermore, in comparison with the existing methods measuring directly the distribution departure in the high-dimensional space, our method clearly demonstrates its superiority. Finally, a series of experiments show the advantages of the proposed SimpleGAN.

KW - Deep generative model

KW - Deep learning

KW - Generation

KW - Generative adversarial network

KW - Information theory

UR - http://www.scopus.com/inward/record.url?scp=85104813288&partnerID=8YFLogxK

U2 - 10.1007/s00521-021-05946-3

DO - 10.1007/s00521-021-05946-3

M3 - Article

AN - SCOPUS:85104813288

SN - 0941-0643

VL - 33

SP - 13193

EP - 13203

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 20

ER -

Improving generative adversarial networks with simple latent distributions

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this