Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avoid data race condition and formalize preprocess.sh #108

Open
rajeeja opened this issue Sep 20, 2023 · 0 comments
Open

Avoid data race condition and formalize preprocess.sh #108

rajeeja opened this issue Sep 20, 2023 · 0 comments
Assignees

Comments

@rajeeja
Copy link
Collaborator

rajeeja commented Sep 20, 2023

Current singularity HPO workflow calls train.sh, that downloads the data and sometimes when the data isn't present all processes involved in HPO download the data resulting into undesirable data race condition.

Possible Fix:
HPO spuns two jobs:

  1. preprocess.sh
    • this runs on one processor or as the model data download in preprocess.sh is setup
    • with args to get and format data as per requirements
  2. launch the HPO job
    • as per the procs and run configuration specified for running the HPO workflow

Note: Design should consider that: this infrastructure will be useful for the cross-study workflows,
where we run multiple HPO for each differing in the preprocess.sh inputs. cross-study workflow also has to run infer.sh on all of the combinations.

@rajeeja rajeeja self-assigned this Sep 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant