You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
May I ask which parts of Transformer Engine are causing issues?
This would be very helpful for applying Zero Bubble Pipeline Parallel to the current version of Megatron.
Thanks!
To support Transformer Engine, you may need to change the code of TransformerEngine first, and then build from source code to support splitting backward pass in Megatron. It's doable if you want.
The reason why we don't support transformer engine is not about any technical issue, it's mainly because we don't want to make any code change in dependencies.
overlap_grad_reduce and transformer_engine can both bring significant performance benefits. Are they still not supported?
The text was updated successfully, but these errors were encountered: