You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The reason for this request is, I'd like to install a specific version (470.57.02) of Nvidia drivers on a GKE cluster running container-optimized OS with containerd. The official GKE documentation provides this daemonset, which installs an older driver version. I assume daemonset-nvidia-v450.yaml in this repo can be modified to install a specific driver, by changing this line to an appropriate image:
If you set the image to gcr.io/cos-cloud/cos-gpu-installer:latest and set an NVIDIA_DRIVER_VERSION environment variable to the driver version you want it should work. Works for me with 470.82.01.
Also, I don't think the entrypoint and Dockerfile for Ubuntu are valid anymore. I've attempted the install steps in the script manually on an Ubuntu node and it doesn't work.
Would it be possible for repo maintainers to provide the Dockerfile and any scripts used to generate the image by this daemonset? https://github.com/GoogleCloudPlatform/container-engine-accelerators/blob/master/nvidia-driver-installer/cos/daemonset-nvidia-v450.yaml
The reason for this request is, I'd like to install a specific version (470.57.02) of Nvidia drivers on a GKE cluster running container-optimized OS with containerd. The official GKE documentation provides this daemonset, which installs an older driver version. I assume daemonset-nvidia-v450.yaml in this repo can be modified to install a specific driver, by changing this line to an appropriate image:
However, I cannot find the source code for this image, so it is not clear how I can install a different Nvidia driver version.
For example, for GKE ubuntu images, this repo provides the Dockerfile and entrypoint.sh source code. Would it be possible to share the COS equivalent?
The text was updated successfully, but these errors were encountered: