dasaint
Cadet
- Joined
- Jan 26, 2021
- Messages
- 8
Hello,
hope someone can help me out here and maybe this is my own stupidity but i wanted to get clarification on this.
I have a X9SRI-3F Motherboard with a E5-2660 V2 Chip that has an Onboard VGA GPU (Matrox Electronics MGA G200eW) and its set as the priority device in the BIOS. I have added the Telsa P4 to the system and can see the nvidia-smi pickup the unit no problem but the kubectl is not recognizing the GPU... What could i be missing here?? is it because i don't have 2x Telsa P4s i saw it said i needed 2 GPUs which i do have just not similar. i Picked the Telsa P4 b/c it has 2xNVENC Encoders and lets be honest i got it for a heck of a good deal!
My Endgame is to use the Tesla P4 as a Transcoder for Plex Containers. Is there some output I'm missing that might help troubleshoot this please forgive kinda N00b at the new Scale OS so any help would be greatly appreciated.
Output of nvidia-smi
Output of k3s kubectl describe nodes
hope someone can help me out here and maybe this is my own stupidity but i wanted to get clarification on this.
I have a X9SRI-3F Motherboard with a E5-2660 V2 Chip that has an Onboard VGA GPU (Matrox Electronics MGA G200eW) and its set as the priority device in the BIOS. I have added the Telsa P4 to the system and can see the nvidia-smi pickup the unit no problem but the kubectl is not recognizing the GPU... What could i be missing here?? is it because i don't have 2x Telsa P4s i saw it said i needed 2 GPUs which i do have just not similar. i Picked the Telsa P4 b/c it has 2xNVENC Encoders and lets be honest i got it for a heck of a good deal!
My Endgame is to use the Tesla P4 as a Transcoder for Plex Containers. Is there some output I'm missing that might help troubleshoot this please forgive kinda N00b at the new Scale OS so any help would be greatly appreciated.
Output of nvidia-smi
Code:
truenas# nvidia-smi Sat Jul 24 13:05:18 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 460.73.01 Driver Version: 460.73.01 CUDA Version: 11.2 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 Tesla P4 Off | 00000000:06:00.0 Off | 0 | | N/A 31C P8 7W / 75W | 0MiB / 7611MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | No running processes found | +-----------------------------------------------------------------------------+
Output of k3s kubectl describe nodes
Code:
truenas# k3s kubectl describe nodes Name: ix-truenas Roles: control-plane,master Labels: beta.kubernetes.io/arch=amd64 beta.kubernetes.io/os=linux kubernetes.io/arch=amd64 kubernetes.io/hostname=ix-truenas kubernetes.io/os=linux node-role.kubernetes.io/control-plane=true node-role.kubernetes.io/master=true openebs.io/nodeid=ix-truenas openebs.io/nodename=ix-truenas Annotations: csi.volume.kubernetes.io/nodeid: {"zfs.csi.openebs.io":"ix-truenas"} k3s.io/node-args: ["server","--flannel-backend","none","--disable","traefik,metrics-server,local-storage","--disable-kube-proxy","--disable-network-policy",... k3s.io/node-config-hash: BIUIMAIT5RSP6VDWUFNWJISDQUIHGP2EW7PO65M4ARMY7EHXKKGA==== k3s.io/node-env: {"K3S_DATA_DIR":"/mnt/Test-Pool/ix-applications/k3s/data/1fda8eac79455ae721508123989e095a50c209cf7965df5630549292f7916941"} node.alpha.kubernetes.io/ttl: 0 volumes.kubernetes.io/controller-managed-attach-detach: true CreationTimestamp: Sat, 24 Jul 2021 11:53:53 -0700 Taints: <none> Unschedulable: false Lease: HolderIdentity: ix-truenas AcquireTime: <unset> RenewTime: Sat, 24 Jul 2021 13:08:29 -0700 Conditions: Type Status LastHeartbeatTime LastTransitionTime Reason Message ---- ------ ----------------- ------------------ ------ ------- MemoryPressure False Sat, 24 Jul 2021 13:08:09 -0700 Sat, 24 Jul 2021 11:53:51 -0700 KubeletHasSufficientMemory kubelet has sufficient memory available DiskPressure False Sat, 24 Jul 2021 13:08:09 -0700 Sat, 24 Jul 2021 11:53:51 -0700 KubeletHasNoDiskPressure kubelet has no disk pressure PIDPressure False Sat, 24 Jul 2021 13:08:09 -0700 Sat, 24 Jul 2021 11:53:51 -0700 KubeletHasSufficientPID kubelet has sufficient PID available Ready True Sat, 24 Jul 2021 13:08:09 -0700 Sat, 24 Jul 2021 13:03:06 -0700 KubeletReady kubelet is posting ready status. AppArmor enabled Addresses: InternalIP: 192.168.10.196 Hostname: ix-truenas Capacity: cpu: 16 ephemeral-storage: 43673984Ki hugepages-1Gi: 0 hugepages-2Mi: 0 memory: 65827544Ki pods: 110 Allocatable: cpu: 16 ephemeral-storage: 42486051602 hugepages-1Gi: 0 hugepages-2Mi: 0 memory: 65827544Ki pods: 110 System Info: Machine ID: 740a67cc6e0240219481b6a240ee3837 System UUID: 00000000-0000-0000-0000-0cc47a496124 Boot ID: 4331e4d6-8526-4ed9-b268-6ca88e0db3ce Kernel Version: 5.10.42+truenas OS Image: Debian GNU/Linux 11 (bullseye) Operating System: linux Architecture: amd64 Container Runtime Version: docker://20.10.6 Kubelet Version: v1.21.0-k3s1 Kube-Proxy Version: v1.21.0-k3s1 PodCIDR: 172.16.0.0/16 PodCIDRs: 172.16.0.0/16 Non-terminated Pods: (4 in total) Namespace Name CPU Requests CPU Limits Memory Requests Memory Limits Age --------- ---- ------------ ---------- --------------- ------------- --- kube-system openebs-zfs-node-mdfp8 0 (0%) 0 (0%) 0 (0%) 0 (0%) 74m kube-system coredns-7448499f4d-x5bmn 100m (0%) 0 (0%) 70Mi (0%) 170Mi (0%) 74m kube-system openebs-zfs-controller-0 0 (0%) 0 (0%) 0 (0%) 0 (0%) 74m ix-testplex testplex-7dd95c96b8-7644j 0 (0%) 0 (0%) 0 (0%) 0 (0%) 42m Allocated resources: (Total limits may be over 100 percent, i.e., overcommitted.) Resource Requests Limits -------- -------- ------ cpu 100m (0%) 0 (0%) memory 70Mi (0%) 170Mi (0%) ephemeral-storage 0 (0%) 0 (0%) hugepages-1Gi 0 (0%) 0 (0%) hugepages-2Mi 0 (0%) 0 (0%) Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Starting 74m kubelet Starting kubelet. Normal NodeHasSufficientMemory 74m (x2 over 74m) kubelet Node ix-truenas status is now: NodeHasSufficientMemory Normal NodeHasNoDiskPressure 74m (x2 over 74m) kubelet Node ix-truenas status is now: NodeHasNoDiskPressure Normal NodeHasSufficientPID 74m (x2 over 74m) kubelet Node ix-truenas status is now: NodeHasSufficientPID Normal NodeAllocatableEnforced 74m kubelet Updated Node Allocatable limit across pods Normal NodeReady 74m kubelet Node ix-truenas status is now: NodeReady Normal Starting 17m kubelet Starting kubelet. Normal NodeHasSufficientMemory 17m kubelet Node ix-truenas status is now: NodeHasSufficientMemory Normal NodeHasNoDiskPressure 17m kubelet Node ix-truenas status is now: NodeHasNoDiskPressure Normal NodeHasSufficientPID 17m kubelet Node ix-truenas status is now: NodeHasSufficientPID Normal NodeAllocatableEnforced 17m kubelet Updated Node Allocatable limit across pods Normal NodeNotReady 17m kubelet Node ix-truenas status is now: NodeNotReady Warning Rebooted 17m kubelet Node ix-truenas has been rebooted, boot id: c1f9007c-2dd8-497e-911a-446ea15d12b6 Normal NodeReady 17m kubelet Node ix-truenas status is now: NodeReady Normal Starting 5m44s kubelet Starting kubelet. Normal NodeHasSufficientMemory 5m44s kubelet Node ix-truenas status is now: NodeHasSufficientMemory Normal NodeHasNoDiskPressure 5m44s kubelet Node ix-truenas status is now: NodeHasNoDiskPressure Normal NodeHasSufficientPID 5m44s kubelet Node ix-truenas status is now: NodeHasSufficientPID Normal NodeAllocatableEnforced 5m44s kubelet Updated Node Allocatable limit across pods Warning Rebooted 5m42s kubelet Node ix-truenas has been rebooted, boot id: 4331e4d6-8526-4ed9-b268-6ca88e0db3ce Normal NodeNotReady 5m41s kubelet Node ix-truenas status is now: NodeNotReady Normal NodeReady 5m31s kubelet Node ix-truenas status is now: NodeReady # truenas#