Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Curious about Zero1 Optimizer #226

Open
huyiwen opened this issue Jan 4, 2025 · 0 comments
Open

Curious about Zero1 Optimizer #226

huyiwen opened this issue Jan 4, 2025 · 0 comments

Comments

@huyiwen
Copy link

huyiwen commented Jan 4, 2025

I noticed your team went with the Zero1 optimizer instead of Zero 2. Just wondering, if there's any particular reasons or benefits you were aiming for? Also, how does this affect training models like MoE?

Thanks a lot for all your hard work! Looking forward to hearing back from you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant