Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-Node Quantization using Ray? #601

Open
paolovic opened this issue Aug 30, 2024 · 2 comments
Open

Multi-Node Quantization using Ray? #601

paolovic opened this issue Aug 30, 2024 · 2 comments

Comments

@paolovic
Copy link

paolovic commented Aug 30, 2024

Hi,

in theory I could get enough compute to host and quantize current models.

But it will be provided as multiple VMs, each with 2GPUs, each with 48GB VRAM.
Using these, I could create a Ray Cluster https://github.com/ray-project/ray with, e.g., 5 nodes, therefore in total 10 GPUs and 480 GB VRAM.

Is it possible to utilize this to quantize models with AutoAWQ?

Thank you very much!
Best regards

@casper-hansen
Copy link
Owner

Hi @paolovic, at the moment this is not something explicitly supported or even something that I have attempted. I suspect it could be possible, but it's not something that I have researched. If you do find the time, I would love to receive a PR with either code changes / docs on how to do this.

@paolovic
Copy link
Author

Hoi @casper-hansen ,
alright, as soon as I have time for that, I will dig into it.
First, I assume I'll have to master Ray.
Thank you for the quick response!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants