You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Nomad has a good scheduler, and pytorch has good distributed training. However, Nomad launches batch job tasks asynchronously instead of all at once, so torchx will be necessary to train in an orderly way when not all resources are immediately available.
Description
Add support for Hashicorp Nomad as a scheduler.
Motivation/Background
Nomad has a good scheduler, and pytorch has good distributed training. However, Nomad launches batch job tasks asynchronously instead of all at once, so torchx will be necessary to train in an orderly way when not all resources are immediately available.
Proposal
Add a torchx.schedulers.Scheduler subclass.
The text was updated successfully, but these errors were encountered: