Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于注意力块权重如何分配? #224

Open
Yues007 opened this issue Oct 4, 2024 · 2 comments
Open

关于注意力块权重如何分配? #224

Yues007 opened this issue Oct 4, 2024 · 2 comments
Labels
question Further information is requested

Comments

@Yues007
Copy link

Yues007 commented Oct 4, 2024

很感谢贵团队开源了powerinfer的研究成果,我想知道powerinfer是怎样处理模型中注意力块的权重的?貌似论文里只提到对FFN进行分层load,希望可以解答一下我的疑惑

@Yues007 Yues007 added the question Further information is requested label Oct 4, 2024
@Paradise59
Copy link

你现在知道了吗,我也很好奇这个问题

@Yues007
Copy link
Author

Yues007 commented Jan 1, 2025 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants