Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting this working on 24gb vram? #34

Open
Thomas2419 opened this issue Nov 8, 2024 · 0 comments
Open

Getting this working on 24gb vram? #34

Thomas2419 opened this issue Nov 8, 2024 · 0 comments

Comments

@Thomas2419
Copy link

Thomas2419 commented Nov 8, 2024

Hello I'm looking for some opinions and directions on getting this working on 24 gb vram. I'm wondering if int8 quantization with torchao might be the best direction with splitting the pipeline up and clearing vram per step? Any suggestions or ideas?

I did try pipe.cpu offload but it didn't appear to help much still got consistent ooms.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant