You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed your team went with the Zero1 optimizer instead of Zero 2. Just wondering, if there's any particular reasons or benefits you were aiming for? Also, how does this affect training models like MoE?
Thanks a lot for all your hard work! Looking forward to hearing back from you.
The text was updated successfully, but these errors were encountered:
I noticed your team went with the Zero1 optimizer instead of Zero 2. Just wondering, if there's any particular reasons or benefits you were aiming for? Also, how does this affect training models like MoE?
Thanks a lot for all your hard work! Looking forward to hearing back from you.
The text was updated successfully, but these errors were encountered: