-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support pmix, openmpi5 #97
Comments
Well no, I take it back. It's set in shifter:
|
I think one difference is that PMIX doesn't set PMI_FD. If I set it myself to 11, I can see the duped FD as we expect:
But actually trying to run an mpi test hangs. We'll have to strace.
|
I've crashed my node trying to strace this. Some of the messages that Chris saw while trying to debug:
and
|
Update: with
|
Add some initial support in #96 Will still need to figure out what is going wrong with shared-run, so I'll leave this open. With
|
Testing with Howard Pritchard. He suggested mpirun:
|
Compare to case with
|
|
Rebuild image with 4.1.6 with both pmi2 and pmix- make sure we can toggle Focus on srun, not mpirun Openmpi 4 will ship with pmix3- we need to override with external pmix 4 if possible |
I think we've achieved this with the following recipe:
but I'll have to test. |
This recipe did produce a working image build. Now testing with it. pmi2 appears to work and pmix fails. pmi2:
pmix with userns=keep-id, with user namespace complaint
pmix without userns=keep-id
|
Our recent work will help support pmi2/openmpi4 and older.
Of course we need to support newer pmix and openmpi5.
Looks like pmix doesn't use file descriptors the way pmi2 did. Here's some diagnostic output:
The text was updated successfully, but these errors were encountered: