Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

win10 support #154

Open
sipie800 opened this issue Jul 17, 2024 · 4 comments
Open

win10 support #154

sipie800 opened this issue Jul 17, 2024 · 4 comments
Assignees

Comments

@sipie800
Copy link

Feature request / 功能建议

It uses FastRotaryEmbedding, which relys on Triton. However triton is not available in win now. We may found some unofficial build Triton win10 wheels around. Tested and they doesn't support cogvlm(neither v1 and v2).
Many many applications need to be deployed on win. Linux is limited to some web applications.
Local LLM is the future. Please pay more attention to local deployment support.

Motivation / 动机

none

Your contribution / 您的贡献

none

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR self-assigned this Jul 17, 2024
@zRzRzRzRzRzRzR
Copy link
Member

To infer this model, the xformer library must be used, which poses a significant challenge for Windows users. For versions v1 and v2, it is not straightforward to remove xformer without compromising quality (as we have tested before). We will try removing this dependency in future versions of the model.

@sipie800
Copy link
Author

Thanks. I do have xformer installed. And use it in other models such as stable diffusion nicely. The issue is the FastRotaryEmbedding use triton. I've no idea what is the connection between xformer and triton, or can it simply go with another FastRoraryEmbedding impl rather than the triton one?

@sipie800
Copy link
Author

Thanks. I do have xformer installed. And use it in other models such as stable diffusion nicely. The issue is the FastRotaryEmbedding use triton. I've no idea what is the connection between xformer and triton, or can it simply go with another FastRotaryEmbedding impl rather than the triton one?

@FurkanGozukara
Copy link

same issue here. by the way v1 works perfect for me on windows even with 4bit

#169

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants