Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QUESTION] Question about ZeroBubbleVPipeScheduler schedule W #38

Open
duanjunwen opened this issue Aug 1, 2024 · 1 comment
Open

Comments

@duanjunwen
Copy link

duanjunwen commented Aug 1, 2024

Hello,
I had some questions while reading ZeroBubbleVPipeScheduler, especially schedule W. (As the title)
Can you plz explain exactly how this part is implemented ?
1722502584508
In detailed:

  1. Where it strips the weight grad ?
  2. Where is the weight grad calculated? (in schedule B, backward_step ?)
@duanjunwen duanjunwen changed the title [QUESTION] In ZeroBubbleVPipeScheduler, where is the calculation and communication of the weight grad done? [QUESTION] Question about ZeroBubbleVPipeScheduler schedule W Aug 1, 2024
@huanggx-sea
Copy link

The weight grad computation is stored in WeightGradStore and executed when scheduled.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants