NVIDIA
View GPU utilization (Tesla M10 + VMware)
I have 3 Tesla M10 GPUs on three Cisco UCS hosts. They are being used in one pool in Horizon 7 across 80+ separate VM's. I have been tasked with seeing how much the GPU's are being utilized. When I run nvidia-smi it show's everything at 100% but I have a hard time believing that considering many of the VM's are not currently running. Is there something I'm missing?
I have 3 Tesla M10 GPUs on three Cisco UCS hosts. They are being used in one pool in Horizon 7 across 80+ separate VM's. I have been tasked with seeing how much the GPU's are being utilized. When I run nvidia-smi it show's everything at 100% but I have a hard time believing that considering many of the VM's are not currently running. Is there something I'm missing?

#1
Posted 01/23/2018 07:22 PM   
Where do you run nvidia-smi? Host or guest? Which profile are you using? I don't believe that all of your GPUs are running at 100% all the time...
Where do you run nvidia-smi? Host or guest? Which profile are you using?
I don't believe that all of your GPUs are running at 100% all the time...

#2
Posted 01/29/2018 09:43 PM   
Have you tried installing and running the nice free utility, GPUprofiler ?
Have you tried installing and running the nice free utility, GPUprofiler ?

-=Tobias

#3
Posted 01/31/2018 11:41 PM   
As you're running VMware, if you look in the GRID Software Portal (where you doenloaded the software from), under Product Information > 5.2 > ... There's an "NVIDIA Virtual GPU Management Pack for vRealize Operations". This will give you some great monitoring capabilities if you're running vROPs. Otherwise as Tobias mentions, GPUProfiler on GitHub is very good as well! Regards
As you're running VMware, if you look in the GRID Software Portal (where you doenloaded the software from), under Product Information > 5.2 > ... There's an "NVIDIA Virtual GPU Management Pack for vRealize Operations".

This will give you some great monitoring capabilities if you're running vROPs. Otherwise as Tobias mentions, GPUProfiler on GitHub is very good as well!

Regards

#4
Posted 02/07/2018 07:52 PM   
Have you figured this one out? I am having problems with my P40 cards doing the same thing.
Have you figured this one out? I am having problems with my P40 cards doing the same thing.

#5
Posted 02/24/2018 01:57 AM   
Let me guess you are running PCoIP with Horizon? See release notes: http://docs.nvidia.com/grid/5.0/grid-vgpu-release-notes-vmware-vsphere/index.html#bug-1735009-high-gpu-use-active-horizon-sessions This is a known issue from VMWare and open for 3 years now. They just won't fix it as they seem to not invest in PCoIP any more. Regards Simon
Let me guess you are running PCoIP with Horizon?

See release notes:
http://docs.nvidia.com/grid/5.0/grid-vgpu-release-notes-vmware-vsphere/index.html#bug-1735009-high-gpu-use-active-horizon-sessions

This is a known issue from VMWare and open for 3 years now. They just won't fix it as they seem to not invest in PCoIP any more.

Regards

Simon

#6
Posted 02/24/2018 08:39 AM   
Yes, we are using PCoIP. Sounds like that entry in those release notes you linked could be the issue.
Yes, we are using PCoIP. Sounds like that entry in those release notes you linked could be the issue.

#7
Posted 02/28/2018 02:28 PM   
Scroll To Top

Add Reply