Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

📝 Meissonic Updates and Family Papers

🚀 Introduction

Meissonic is a non-autoregressive mask image modeling text-to-image synthesis model that can generate high-resolution images. It is designed to run on consumer graphics cards.

Key Features:

🖼️ High-resolution image generation (up to 1024x1024)
💻 Designed to run on consumer GPUs
🎨 Versatile applications: text-to-image, image-to-image

🛠️ Prerequisites

Step 1: Clone the repository

git clone https://github.com/viiika/Meissonic/
cd Meissonic

Step 2: Create virtual environment

conda create --name meissonic python
conda activate meissonic
pip install -r requirements.txt

Step 3: Install diffusers

git clone https://github.com/huggingface/diffusers.git
cd diffusers
pip install -e .

💡 Inference Usage

Gradio Web UI

python app.py

Command-line Interface

Text-to-Image Generation

python inference.py --prompt "Your creative prompt here"

Inpainting and Outpainting

python inpaint.py --mode inpaint --input_image path/to/image.jpg
python inpaint.py --mode outpaint --input_image path/to/image.jpg

Advanced: FP8 Quantization

Optimize performance with FP8 quantization:

Requirements:

CUDA 12.4
PyTorch 2.4.1
TorchAO

Note: Windows users install TorchAO using

pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cpu

Command-line inference

python inference_fp8.py --quantization fp8

Gradio for FP8 (Select Quantization Method in Advanced settings)

python app_fp8.py

Performance Benchmarks

Precision (Steps=64, Resolution=1024x1024)	Batch Size=1 (Avg. Time)	Memory Usage
FP32	13.32s	12GB
FP16	12.35s	9.5GB
FP8	12.93s	8.7GB

🎨 Showcase

"A pillow with a picture of a Husky on it."

"A white coffee mug, a solid black background"

🎓 Training

To train Meissonic, follow these steps:

Install dependencies:

cd train
pip install -r requirements.txt

Download the Meissonic base model from Hugging Face.
Prepare your dataset:
- Use the sample dataset: MeissonFlow/splash
- Or prepare your own dataset and dataset class following the format in line 100 in dataset_utils.py and line 656-680 in train_meissonic.py
- Modify train.sh with your dataset path
Start training:
```
bash train/train.sh
```

Note: For custom datasets, you'll likely need to implement your own dataset class.

📚 Citation

If you find this work helpful, please consider citing:

@article{bai2024meissonic,
  title={Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis},
  author={Bai, Jinbin and Ye, Tian and Chow, Wei and Song, Enxin and Chen, Qing-Guo and Li, Xiangtai and Dong, Zhen and Zhu, Lei and Yan, Shuicheng},
  journal={arXiv preprint arXiv:2410.08261},
  year={2024}
}

🙏 Acknowledgements

We thank the community and contributors for their invaluable support in developing Meissonic. We thank apolinario@multimodal.art for making Meissonic Demo. We thank @NewGenAI and @飛鷹しずか@自称文系プログラマの勉強 for making YouTube tutorials. We thank @pprp for making fp8 and int4 quantization. We thank @camenduru for making jupyter tutorial. We thank @chenxwh for making Replicate demo and api. We thank Collov Labs for reproducing Monetico. We thank Shitong et al. for identifying effective design choices for enhancing visual quality.

Made with ❤️ by the MeissonFlow Research

Name	Name	Last commit message	Last commit date
Latest commit viiika update Mar 20, 2025 07ca024 · Mar 20, 2025 History 67 Commits
.github	.github	Create FUNDING.yml	Nov 11, 2024
assets	assets	update	Nov 14, 2024
diffusers	diffusers	fp8	Oct 19, 2024
output	output	fp8	Oct 19, 2024
src	src	update training tips	Mar 20, 2025
train	train	update training tips	Mar 20, 2025
.gitignore	.gitignore	add fp16 inference	Oct 28, 2024
LICENSE	LICENSE	Initial commit	Oct 13, 2024
README.md	README.md	update	Mar 20, 2025
app.py	app.py	Gradio webui for local	Oct 18, 2024
app_Monetico.py	app_Monetico.py	add support to Monetico	Nov 4, 2024
app_fp8.py	app_fp8.py	FP8 updates	Oct 23, 2024
cog.yaml	cog.yaml	replicate demo	Oct 20, 2024
inference.py	inference.py	change inference code	Oct 17, 2024
inference_fp16.py	inference_fp16.py	fix issues	Nov 8, 2024
inference_fp16_Monetico.py	inference_fp16_Monetico.py	add support to Monetico	Nov 4, 2024
inference_fp8.py	inference_fp8.py	Add Support for FP8	Oct 19, 2024
inpaint.py	inpaint.py	update	Oct 16, 2024
predict.py	predict.py	replicate demo	Oct 20, 2024
requirements.txt	requirements.txt	fix issues	Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

📝 Meissonic Updates and Family Papers

🚀 Introduction

🛠️ Prerequisites

Step 1: Clone the repository

Step 2: Create virtual environment

Step 3: Install diffusers

💡 Inference Usage

Gradio Web UI

Command-line Interface

Text-to-Image Generation

Inpainting and Outpainting

Advanced: FP8 Quantization

Performance Benchmarks

🎨 Showcase

🎓 Training

📚 Citation

🙏 Acknowledgements

About

Releases

Packages

Contributors 9

Languages

License

viiika/Meissonic

Folders and files

Latest commit

History

Repository files navigation

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

📝 Meissonic Updates and Family Papers

🚀 Introduction

🛠️ Prerequisites

Step 1: Clone the repository

Step 2: Create virtual environment

Step 3: Install diffusers

💡 Inference Usage

Gradio Web UI

Command-line Interface

Text-to-Image Generation

Inpainting and Outpainting

Advanced: FP8 Quantization

Performance Benchmarks

🎨 Showcase

🎓 Training

📚 Citation

🙏 Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages