-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Release-1.31] - Nvidia operator not working correctly #11088
Comments
##Environment Details Infrastructure
Node(s) CPU architecture, OS, and version: Linux 6.4.0-150600.23.17-default x86_64 GNU/Linux Cluster Configuration:
Config.yaml:
Reproduction && Validation
Results: Before from existing release v1.31.1+k3s1 $ sudo cat /var/lib/rancher/k3s/agent/etc/containerd/config.toml
Newest COMMIT ID installation now shows additional nvidia-cdi entries on config.toml
Seems to be required now but isn't documented well on the k3s side yet $ cat operator.yaml
$ cat cuda-add.yaml
$ cat nvidia-pod.yaml
$ cat pytorch-gpu.yaml
$ k exec --stdin --tty pytorch-test -- /bin/bash
|
Backport fix for Nvidia operator not working correctly
The text was updated successfully, but these errors were encountered: