-
Notifications
You must be signed in to change notification settings - Fork 127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
qwen2.5不支持moe吗? #450
Comments
您好,在https://www.modelscope.cn/collections/Qwen25-dbc4d30adb768 并没有看到qwen2.5 moe的相关内容。考虑到当前qwen2.5实现后端调用的是qwen2的训练代码,如果您有自定义需求,可以先尝试修改入口脚本~ |
我这边更新了一下hf2mcore_qwen2_dense_and_moe_gqa.py,主要是添加了gate的初始化。 代码:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
如题,如果不支持的话,后续是否有计划支持Megatron-Core-MoE?
The text was updated successfully, but these errors were encountered: