-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Delta GPUs results give NaNs in the viscous sub-grid bubble benchmark case whereas Phoenix GPUs do not #396
Comments
Update: This issue is associated with |
I'm not sure if this is still "broken" or not. |
Update: This is still broken. Related to PR #425 Update 2: This does not fail when case optimization is disabled. It only fails with case optimization enabled (on non-Phoenix computers). I get the feeling that this line is not actually invoking case optimization.... MFC/.github/workflows/phoenix/bench.sh Line 12 in 4f89f33
Update 3: Update 2 is incorrect and case optimization is not relevant |
@sbryngelson I'm pretty sure |
The logs indicate that case optimization is enabled on Phoenix for the benchmarking. There's recompilation of code in cases that I would expect to see recompilation due to case optimization. |
Nevermind, you're both right and it fails with and without case optimization on Delta (and presumably other computers). |
Delta GPUs results give NaNs in the viscous sub-grid bubble benchmark case whereas Phoenix GPUs do not.
This is regardless of "memory size" (I've checked 4gb).
I've tested A100s and A40s on Delta, both give the issue discussed further on Slack.
I tested A100s and V100s on Phoenix, both of which do not give the issue.
Both computers use NVHPC 22.11.
Error is this:
One can run this case via something like
./mfc.sh run benchmarks/viscous_weno5_sgb_mono/case.py 4 -t pre_process simulation -c delta --gpu
if you are already on a node with GPUs and have loaded the appropriate modules.
The text was updated successfully, but these errors were encountered: