TY - JOUR
T1 - TextFace
T2 - Text-to-Style Mapping Based Face Generation and Manipulation
AU - Hou, Xianxu
AU - Zhang, Xiaokang
AU - Li, Yudong
AU - Shen, Linlin
N1 - Publisher Copyright:
© 1999-2012 IEEE.
PY - 2023
Y1 - 2023
N2 - As a subtopic of text-to-image synthesis, text-to-face generation has great potential in face-related applications. In this paper, we propose a generic text-to-face framework, namely, TextFace, to achieve diverse and high-quality face image generation from text descriptions. We introduce text-to-style mapping, a novel method where the text description can be directly encoded into the latent space of a pretrained StyleGAN. Guided by our text-image similarity matching and face captioning-based text alignment, the textual latent code can be fed into the generator of a well-trained StyleGAN to produce diverse face images with high resolution (1024×1024). Furthermore, our model inherently supports semantic face editing using text descriptions. Finally, experimental results quantitatively and qualitatively demonstrate the superior performance of our model.
AB - As a subtopic of text-to-image synthesis, text-to-face generation has great potential in face-related applications. In this paper, we propose a generic text-to-face framework, namely, TextFace, to achieve diverse and high-quality face image generation from text descriptions. We introduce text-to-style mapping, a novel method where the text description can be directly encoded into the latent space of a pretrained StyleGAN. Guided by our text-image similarity matching and face captioning-based text alignment, the textual latent code can be fed into the generator of a well-trained StyleGAN to produce diverse face images with high resolution (1024×1024). Furthermore, our model inherently supports semantic face editing using text descriptions. Finally, experimental results quantitatively and qualitatively demonstrate the superior performance of our model.
KW - GANs
KW - cross modal
KW - text-guided semantic face manipulation
KW - text-to-face generation
KW - text-to-image generation
UR - http://www.scopus.com/inward/record.url?scp=85126695399&partnerID=8YFLogxK
U2 - 10.1109/TMM.2022.3160360
DO - 10.1109/TMM.2022.3160360
M3 - Article
AN - SCOPUS:85126695399
SN - 1520-9210
VL - 25
SP - 3409
EP - 3419
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
ER -