A Novel Encoder-Decoder Approach for Text-to-Face Conversion using Generative Adversarial Networks and Attention Mechanism
This repository contains a TensorFlow implementation for generating face images using the GAN-CLS Algorithm, as described in the paper [Generative Adversarial Text-to-Image Synthesis]1. The model is built upon the solid foundation of the [DCGAN in TensorFlow]2.
With the ability to handle input descriptions in over 100 languages, this implementation is highly versatile and adaptable. This means you can generate face images based on textual descriptions in various languages.
To train and evaluate the model, we utilize the large-scale CelebFaces Attributes (CelebA) dataset, which provides a diverse set of realistic human faces.
Image Source : [https://s3-us-west-1.amazonaws.com/udacity-dlnfd/datasets/celeba.zip]Caption source : [https://raw.githubusercontent.com/midas-research/text2facegan/master/data/caps.txt]
-The man sports a 5 o’clock shadow.He has big nose.The young attractive man is smiling.
##text-to-face evaluation (human evaluation) :
[Text2FaceGAN]3.
Model | FID |
---|---|
Cycle Text2Face | 1.20±0.081 |
Text2FaceGan | 1.4±0.7 |
Article cycle text2face: cycle text-to-face gan via transformers : [https://arxiv.org/abs/2206.04503]