We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What happened: 使用GPU Operator安装Kubernetes GPU 环境搭建,然后安装HAMi插件,服务安装正常,但是GPU数量还是显示1,在容器中也未切分
What you expected to happen: GPU数量显示10份(默认),容器中资源得到限制
How to reproduce it (as minimally and precisely as possible): 使用GPU Operator安装Kubernetes GPU 环境
Anything else we need to know?:
nvidia-smi -a
Your docker or containerd configuration file (e.g: /etc/docker/daemon.json) 配置无问题
/etc/docker/daemon.json
The hami-device-plugin container logs
sudo journalctl -r -u kubelet
dmesg
Environment:
docker version
uname -a
The text was updated successfully, but these errors were encountered:
have you uninstalled nvidia-k8s-device-plugin before installing HAMi?
Sorry, something went wrong.
我通过helm uninstall hami -n kube-system 卸载后重装hami解决了,现在可以看到GPU信息了 进入容器也可以看到隔离信息了
发现一个新的问题,这个问题可能是因为开关机为主机增加和删除新的显卡引起的,在增加和删除显卡后,hami会失效
No branches or pull requests
What happened:
使用GPU Operator安装Kubernetes GPU 环境搭建,然后安装HAMi插件,服务安装正常,但是GPU数量还是显示1,在容器中也未切分
What you expected to happen:
GPU数量显示10份(默认),容器中资源得到限制
How to reproduce it (as minimally and precisely as possible):
使用GPU Operator安装Kubernetes GPU 环境
Anything else we need to know?:
nvidia-smi -a
on your hostYour docker or containerd configuration file (e.g:
/etc/docker/daemon.json
)配置无问题
The hami-device-plugin container logs
sudo journalctl -r -u kubelet
)dmesg
Environment:
docker version
: Docker version 20.10.24, build 297e128uname -a
: ubuntu 5.15.0-124-genericThe text was updated successfully, but these errors were encountered: