Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error when running GCHP container v14.4.0 with Singularity #439

Open
cora579 opened this issue Aug 29, 2024 · 3 comments
Open

Error when running GCHP container v14.4.0 with Singularity #439

cora579 opened this issue Aug 29, 2024 · 3 comments
Assignees
Labels
category: Question Further information is requested topic: Containers Relating to environment containers topic: Runtime Related to runtime issues (e.g. simulation stops with error)

Comments

@cora579
Copy link

cora579 commented Aug 29, 2024

Your name

Coralina Hernández Trujillo

Your affiliation

CIEMAT, Madrid, Spain

Please provide a clear and concise description of your question or discussion topic.

I have been trying to run GCHP container (version 14.4.0) with Singularity on Slurm as @mgarciavi mentioned in #438 . I think I'm been able to do the set up succesfully. The issue arise when I run the model, I get the following error message just after srun is called:

"./gchp: error while loading shared libraries: libnetcdf.so.19: cannot open shared object file: No such file or directory"

Besides the slurm output (slurm-1844748.txt) where you can see above error, I'm attaching the run script (lanza-gchp.batch_job.txt) with the bash script used to configure settings (setCommonRunSettings.txt).

Because the error seems to be related to loading the internal environment of the container, I'm also attaching the internal_exec file (internal_exec.txt) and the .container_run.rc (container_run_rc.txt) in case it may help.

Thank you for your help!

container_run_rc.txt
internal_exec.txt
lanza-gchp.batch_job.txt
setCommonRunSettings.txt
slurm-1844747.txt

@cora579 cora579 added the category: Question Further information is requested label Aug 29, 2024
@yantosca
Copy link
Contributor

Tagging @yidant @lizziel

@yantosca yantosca added topic: Runtime Related to runtime issues (e.g. simulation stops with error) topic: Containers Relating to environment containers labels Aug 29, 2024
@lizziel
Copy link
Contributor

lizziel commented Sep 26, 2024

@yidant, would you be able to look into this issue?

@yidant yidant self-assigned this Sep 26, 2024
@yidant
Copy link
Contributor

yidant commented Oct 2, 2024

Hi @cora579, sorry that I didn't have a similar cluster environment right now to test this, but I can provide some of my thoughts.

I checked in this container and found there's libnetcdf.so.19 under this path /opt/spack/opt/spack/linux-ubuntu20.04-cascadelake/gcc-10.2.0/netcdf-c-4.9.2-nwa3dhld7f57eewtz2uji2iop4gra5wt/lib. According to the error message, it is likely that the library path is not set up properly to include this path in the runtime environment. To resolve this, you need to source global environment file (/etc/bashrc) in the runtime environment.

Could you please try adding this line to your internal_exec.txt before the line cd /rundir and try again:

. /etc/bashrc

This file should have contained all the environment settings for GCHP. Hope this helps!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: Question Further information is requested topic: Containers Relating to environment containers topic: Runtime Related to runtime issues (e.g. simulation stops with error)
Projects
None yet
Development

No branches or pull requests

4 participants