VRAM requirement to load ControlNet for inference? #99

yuchen1984 · 2024-09-04T16:48:51Z

I was trying to load XLabs-AI/flux-controlnet-depth-v3 for inference, using the checkpoint flux-dev-fp8 with the switch "offload". Image size 1024x512.

It still gives CUDA OOM on RTX4090 (24GB VRAM). What is the minimal VRAM requirement to load ControlNet for inference? Is there FP8 version of ControlNets or is there any caveat to get it work? It feels outrageous having to use A100 just for running inference.....

NB: without loading the ControlNet, the inference is possible with 24GB VRAM. The observed peak VRAM usage is just about 14GB

Oguzhanercan · 2024-11-21T10:20:21Z

Did you find a way to infer with 24GB vram? @yuchen1984

yuchen1984 · 2024-11-21T10:47:15Z

Did you find a way to infer with 24GB vram? @yuchen1984

Nope. I ended up hiring an A40 node on vast.ai by the time. The peak VRAM usage is about 27.5GB

yuchen1984 · 2024-11-21T14:24:01Z

Did you find a way to infer with 24GB vram? @yuchen1984

Actually it seems possible to make a bit of code change in xflux_pipeline.py so that ControlNet can be offloaded to cpu in the --lowvram mode. This will bring the peak VRAM below 24GB. I will create a PR a bit later

yuchen1984 · 2024-11-21T14:31:51Z

#138

Oguzhanercan · 2024-11-22T06:55:27Z

Thanks for your PR, I solved it via sequential offload, 2GB vram required, inference time doubled, how much this solution slow down the pipeline? (transformer quantized to nf4)

yuchen1984 · 2024-11-22T10:20:12Z

Thanks for your PR, I solved it via sequential offload, 2GB vram required, inference time doubled, how much this solution slow down the pipeline? (transformer quantized to nf4)

Slight slow-down but definitely not as much as sequential offload I believe. (of course will need a lot more than 2GB vram). I was running everything in fp8. Peak-vram is about 21GB

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VRAM requirement to load ControlNet for inference? #99

VRAM requirement to load ControlNet for inference? #99

yuchen1984 commented Sep 4, 2024 •

edited

Loading

Oguzhanercan commented Nov 21, 2024 •

edited

Loading

yuchen1984 commented Nov 21, 2024

yuchen1984 commented Nov 21, 2024

yuchen1984 commented Nov 21, 2024

Oguzhanercan commented Nov 22, 2024 •

edited

Loading

yuchen1984 commented Nov 22, 2024

VRAM requirement to load ControlNet for inference? #99

VRAM requirement to load ControlNet for inference? #99

Comments

yuchen1984 commented Sep 4, 2024 • edited Loading

Oguzhanercan commented Nov 21, 2024 • edited Loading

yuchen1984 commented Nov 21, 2024

yuchen1984 commented Nov 21, 2024

yuchen1984 commented Nov 21, 2024

Oguzhanercan commented Nov 22, 2024 • edited Loading

yuchen1984 commented Nov 22, 2024

yuchen1984 commented Sep 4, 2024 •

edited

Loading

Oguzhanercan commented Nov 21, 2024 •

edited

Loading

Oguzhanercan commented Nov 22, 2024 •

edited

Loading