-
Notifications
You must be signed in to change notification settings - Fork 477
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grok | Mixture-of-Experts | Model Support #564
Comments
I was thinking roughly the same thing. Curious to see Grok running on petals. If possible. |
related in regards of MOE's (mixture of experts): #548 |
There is a possibility to implement a distributed model for Grok based on this unofficial transformers implementation of it https://huggingface.co/keyfan/grok-1-hf |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Grok architecture and weights were just released, do Petals support, or is it in plan to support Grok and MOE models?
Having a first in class 314B parameter model running on consumer hardware would be great!
Thanks in advance.
The text was updated successfully, but these errors were encountered: