Suitable example for In-situ techniques and GPU Utilization #761
-
Hello, can you recommend me a suitable example for in-situ techniques? In my specific case, I need to link Neko with ADIOS2 and Catalyst to do Synchronous and asynchronous Image Generation. I tried this on tgv case but it doesn't seem well, it's kind of small. I want to find an example on 4 GPUs with about 10 minutes and should run faster on the gpu than the CPU(which is not good in tgv). Besides, I'm a little confused with how the Neko uses GPU, such as visible devices. If you have many nodes and many GPUs, how do you control which devices your application uses? Thanks for your work. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi, You can easily increase the amount of work for the tgv example by using one of the larger meshes in the example folder ( For multiple GPUs per node, you need to set the visible devices ( |
Beta Was this translation helpful? Give feedback.
Hi,
You can easily increase the amount of work for the tgv example by using one of the larger meshes in the example folder (
mesh_file
in the case file), and the length can be controlled by reducingT_end
. However, when increasing the problem size, you might need to reducedt
as well to avoid divergence (see for example https://github.com/ExtremeFLOW/MSA-tests/blob/main/tgv/tgv.case). The32768.nmsh
mesh should be enough to keep two - four GPUs busy, while262144.nmsh
can be used to scale out from 8 - 16 GPUs to a couple of hundreds of devices.For multiple GPUs per node, you need to set the visible devices (
CUDA_VISIBLE_DEVICES
orROCR_VISIBLE_DEVICES
) in the job script, for example, to$…