You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Current singularity HPO workflow calls train.sh, that downloads the data and sometimes when the data isn't present all processes involved in HPO download the data resulting into undesirable data race condition.
Possible Fix:
HPO spuns two jobs:
preprocess.sh
this runs on one processor or as the model data download in preprocess.sh is setup
with args to get and format data as per requirements
launch the HPO job
as per the procs and run configuration specified for running the HPO workflow
Note: Design should consider that: this infrastructure will be useful for the cross-study workflows,
where we run multiple HPO for each differing in the preprocess.sh inputs. cross-study workflow also has to run infer.sh on all of the combinations.
The text was updated successfully, but these errors were encountered:
Current singularity HPO workflow calls train.sh, that downloads the data and sometimes when the data isn't present all processes involved in HPO download the data resulting into undesirable data race condition.
Possible Fix:
HPO spuns two jobs:
Note: Design should consider that: this infrastructure will be useful for the
cross-study
workflows,where we run multiple HPO for each differing in the preprocess.sh inputs.
cross-study
workflow also has to runinfer.sh
on all of the combinations.The text was updated successfully, but these errors were encountered: