Skip to content

Yatin-aggarwal/Pixel_Prompt

Repository files navigation

Pixel Prompt

About

Pixel Prompt is a Generative AI project in which anime character images are generated according to the prompt given by the user. It follows the DCGan approach for image generation. For prompt generation, LSTM is used for prompt encodings concatenated with random noise fed into the generator. At the same time, the discriminator uses both generator output and user prompt encoding to discriminate whether the result is fake or real. In this project hugging face dataset "alfredplpl/anime-with-caption-cc0". It is a nearly 21 GB dataset with 15000 images and prompts, for supervised learning. And for the tokenizer Pixel Prompt uses "openai-community/gpt2" from hugging face.

Result

Following are the generated result with training images for corresponding prompt. It was Trained for only 70 epochs on Nividia Gtx 1650(4GB) GPU with 8 GB of computer RAM. Weights of generator and discrminator along with optimizer states are provided so anyone can further improve result by training it bit longer.

Generated Images

Screenshot 2024-08-01 193653

Training Images

Screenshot 2024-08-01 193701

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages