Colab notebooks for text-to-image generators

User-friendly Colab notebooks for various text prompt steered synthetic image generators. These notebooks are not actively maintained, though issues and pull requests are welcome.

Available notebooks (latest to oldest):

Stable Diffusion (v1.4, v1.5, waifu diffusion)
DALL·E Mini (craiyon.ai)
Latent Diffusion LAION 400M
ruDALLE Malevich
JAX CLIP Guided Diffusion
CLIP+VQGAN
Latent Vision
Big Sleep

These techniques are all more or less descendants of Big Sleep (2021), the original concept of turning text into imagery by using CLIP (OpenAI) to steer BigGAN (Andrew Brock et al.), as introduced by Ryan Murdock in his original notebook. The technique has since branched and evolved into what we now know as Midjourney, Dall-E, Stable Diffusion, etc.

The point of this repository is to record the development history of image-generation AIs, and as long as they work, user-friendly and quick-to-run UIs for the different implementations. Some additional basic features have been added to some of the notebooks, such as saving all/selected progression images, saving progression video, batch processing, etc.

I do not use these notebooks on a regular basis, and hence I'm mostly unaware when some library gets updated and a notebook may no longer function; Please do raise an issue should any problems occur.

Stable Diffusion by CompVis, Stability AI and LAION

This colab requires registering with Hugging Face for login token and accepting model card terms to download required models here and here.

"Satanism in the 1900s"

DALL·E Mini (craiyon.ai) by Boris Dayma

This colab requires registering with wandb for an authentication key to download required models.

"Satanism in the 1900s"

Latent Diffusion LAION 400M by CompVis

"Satanism in the 1900s" (upscaled)

ruDALLE Malevich by Sberbank-AI

"Satanism in the 1900s"

JAX CLIP Guided Diffusion by nshepperd

Method originally by Katherine Crowson

300 iterations of "Satanism in the 1900s".

CLIP+VQGAN by Katherine Crowson

Papers: 1) VQGAN by Patrick Esser et al, 2) CLIP by OpenAI

400 iterations of "Satanism in the 1900s".

Latent Vision by Ryan Murdock

Papers: 1) VQGAN by Patrick Esser et al, 2) CLIP by OpenAI

400 iterations of "Satanism in the 1900s".

Big Sleep by Phil Wang (more precisely this fork)

Papers: 1) BigGAN by Andrew Brock et al, 2) CLIP by OpenAI

Original Big Sleep method.

400 iterations of "Satanism in the 1900s".

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
BigSleep_lucidrains.ipynb		BigSleep_lucidrains.ipynb
CLIP+VQGAN_crowsonkb.ipynb		CLIP+VQGAN_crowsonkb.ipynb
Fast_Dream_Booth.ipynb		Fast_Dream_Booth.ipynb
JAX_CLIP_Guided_diffusion_nshepperd.ipynb		JAX_CLIP_Guided_diffusion_nshepperd.ipynb
LAION_400M.ipynb		LAION_400M.ipynb
LatentVision_rynmurdock.ipynb		LatentVision_rynmurdock.ipynb
README.md		README.md
Stable_Diffusion.ipynb		Stable_Diffusion.ipynb
dalle_mini.ipynb		dalle_mini.ipynb
ruDALLE_1_1_0rc0.ipynb		ruDALLE_1_1_0rc0.ipynb
ruDalle.ipynb		ruDalle.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BigSleep_lucidrains.ipynb

BigSleep_lucidrains.ipynb

CLIP+VQGAN_crowsonkb.ipynb

CLIP+VQGAN_crowsonkb.ipynb

Fast_Dream_Booth.ipynb

Fast_Dream_Booth.ipynb

JAX_CLIP_Guided_diffusion_nshepperd.ipynb

JAX_CLIP_Guided_diffusion_nshepperd.ipynb

LAION_400M.ipynb

LAION_400M.ipynb

LatentVision_rynmurdock.ipynb

LatentVision_rynmurdock.ipynb

README.md

README.md

Stable_Diffusion.ipynb

Stable_Diffusion.ipynb

dalle_mini.ipynb

dalle_mini.ipynb

ruDALLE_1_1_0rc0.ipynb

ruDALLE_1_1_0rc0.ipynb

ruDalle.ipynb

ruDalle.ipynb

Repository files navigation

Colab notebooks for text-to-image generators

Stable Diffusion by CompVis, Stability AI and LAION

DALL·E Mini (craiyon.ai) by Boris Dayma

Latent Diffusion LAION 400M by CompVis

ruDALLE Malevich by Sberbank-AI

JAX CLIP Guided Diffusion by nshepperd

CLIP+VQGAN by Katherine Crowson

Latent Vision by Ryan Murdock

Big Sleep by Phil Wang (more precisely this fork)

About

Languages

olaviinha/NeuralTextToImage

Folders and files

Latest commit

History

Repository files navigation

Colab notebooks for text-to-image generators

Stable Diffusion by CompVis, Stability AI and LAION

DALL·E Mini (craiyon.ai) by Boris Dayma

Latent Diffusion LAION 400M by CompVis

ruDALLE Malevich by Sberbank-AI

JAX CLIP Guided Diffusion by nshepperd

CLIP+VQGAN by Katherine Crowson

Latent Vision by Ryan Murdock

Big Sleep by Phil Wang (more precisely this fork)

About

Topics

Resources

Stars

Watchers

Forks

Languages