Unclear values for vmware_vm_cpu_usage_average metric #3

pryorda · 2018-06-29T00:24:32Z

From @dannyk81 on February 10, 2018 1:53

I can't quite figure out the values of vmware_vm_cpu_usage_average metric, for example:

vmware_vm_cpu_usage_average{instance="<vcenter>",job="vmware-exporter",vm_name="xyz1"} | 202
vmware_vm_cpu_usage_average{instance="<vcenter>",job="vmware-exporter",vm_name="xyz2"} | 225
vmware_vm_cpu_usage_average{instance="<vcenter>",job="vmware-exporter",vm_name="xyz3"} | 4015
vmware_vm_cpu_usage_average{instance="<vcenter>",job="vmware-exporter",vm_name="xyz4"} | 207
vmware_vm_cpu_usage_average{instance="<vcenter>",job="vmware-exporter",vm_name="xyz5"} | 209

according to this https://www.vmware.com/support/developer/converter-sdk/conv61_apireference/cpu_counters.html

The description of this counter is Amount of actively used virtual CPU, as a percentage of total available CPU, but the values I'm seeing do not seem like percentages.

Any clues?

Copied from original issue: rverchere/vmware_exporter#29

The text was updated successfully, but these errors were encountered:

pryorda · 2018-06-29T00:24:33Z

From @dannyk81 on February 15, 2018 17:47

@rverchere

So, seems like dividing the value by 100 gets the correct result 😄 (compared to figures we see in vCenter)

Perhaps this is due to converting the value to float here: https://github.com/rverchere/vmware_exporter/blob/aeccb035d368dcc8e6bc52628d7eef786345725b/vmware_exporter/vmware_exporter.py#L386

pryorda · 2018-06-29T00:24:34Z

From @dannyk81 on February 15, 2018 17:54

same issue with vmware_vm_mem_usage_average metric, need to divide by 100 the value to get correct result.

dannyk81 · 2018-10-07T18:49:16Z

@pryorda was this actually fixed in #16?

pryorda · 2018-10-07T19:15:29Z

If it it's suppose to be divided by 100 I don't think so. Sent from ProtonMail mobile

…

-------- Original Message --------

On Oct 7, 2018, 12:49 PM, Danny Kulchinsky wrote: ***@***.***(https://github.com/pryorda) was this actually fixed in [#16](#16)? — You are receiving this because you were mentioned. Reply to this email directly, [view it on GitHub](#3 (comment)), or [mute the thread](https://github.com/notifications/unsubscribe-auth/AFYgHBqKYjJqqsvw7nsO_xdTlA9LiU_Xks5uikytgaJpZM4U8Q1_).

dannyk81 · 2018-10-11T03:04:18Z

@pryorda

Here's a sample:

# HELP vmware_vm_cpu_usage_average vmware_vm_cpu_usage_average
# TYPE vmware_vm_cpu_usage_average gauge
vmware_vm_cpu_usage_average{cluster_name="MAD-PROD",dc_name="MAD",host_name="esx-prod-4.foo.bar",vm_name="ELASTICDATA-03.PRD.MOVES.MAD"} 533.0
vmware_vm_cpu_usage_average{cluster_name="MAD-PROD",dc_name="MAD",host_name="esx-prod-4.foo.bar",vm_name="BLUE-KUBM-01.PRD.MOVES.MAD"} 774.0
vmware_vm_cpu_usage_average{cluster_name="MAD-PROD",dc_name="MAD",host_name="esx-prod-4.foo.bar",vm_name="PROMETHEUS-01.MAD"} 1474.0
vmware_vm_cpu_usage_average{cluster_name="MAD-PROD",dc_name="MAD",host_name="esx-prod-4.foo.bar",vm_name="BLUE-KUBW-01.PRD.MOVES.MAD"} 888.0
vmware_vm_cpu_usage_average{cluster_name="MAD-PROD",dc_name="MAD",host_name="esx-prod-4.foo.bar",vm_name="KAF-05.PRD.MOVES.MAD"} 175.0

Above should be percentages, only dividing them by 100 do I get a meaningful value.

Can you confirm it's the same in your case?

pryorda · 2018-10-20T20:07:57Z

Being divided by 100 def looks better. I'll add a PR later tonight.

pryorda · 2018-12-27T17:40:10Z

Ugh, never got around to this but based on the verbiage in here: https://www.vmware.com/support/developer/converter-sdk/conv61_apireference/cpu_counters.html

VM - Amount of actively used virtual CPU, as a percentage of total available CPU. This is the host's view of the CPU usage, not the guest operating system view. It is the average CPU utilization over all available virtual CPUs in the virtual machine. For example, if a virtual machine with one virtual CPU is running on a host that has four physical CPUs and the CPU usage is 100%, the virtual machine is using one physical CPU completely.

and

Memory usage as percentage of total configured or available memory

I'm wondering if we should create some kind of mapping to get the correct values?

Something like:

mem_usage == percent. 
cpu_usage == percent. 

if type ==  percent:
  value / 100

dannyk81 · 2018-12-27T20:24:10Z

The perf metrics data object returned should include the Unit information which can be used to normalize the values.

https://www.vmware.com/support/developer/converter-sdk/conv61_apireference/vim.PerformanceManager.CounterInfo.Unit.html

This way, we should be able to use a generic function to check the Unit of the metric and apply any relevant normalization.

pryorda · 2019-06-19T23:55:55Z

According to this: https://www.vmware.com/support/developer/converter-sdk/conv61_apireference/cpu_counters.html

This is how it gets the value: virtual CPU usage = usagemhz / (# of virtual CPUs x core frequency)

jdelvecchio · 2019-06-24T09:01:31Z

Thanks for you reply ! However, this is what I get running a few tests.

Example for a VM :
Usagemhz : 18953
Number of virtual CPUs : 8
Core frequency : 2593.993 MHz

vmware_vm_cpu_usage_average = 18953 / (8*2593.993) = 0.9133

Then it is multiplied by 10 000 because the value I get in prometheus is 9133 so the correct formula is :
vmware_vm_cpu_usage_average = usagemhz / (# virtual CPUs * core frequency) * 10 000

Or I'm getting the wrong unit in core frequency, because 18953 / (8 * 0.2593993) = 9133

pryorda · 2019-06-26T05:41:14Z

I'm not sure. I dont think we do any mangling of that, but I can double check. I "assume" its the second formula.

jdelvecchio · 2019-06-26T16:16:40Z

Has someone found a way to use this value ? Like how to convert it to %cpu used ?
I don't seem to get anything from it apart from a number that indicates a cpu workload without any real unit.

Would be helpful!

pryorda · 2019-06-27T04:41:24Z

I usually just graph all the vms and find the outliers. I don't alert on cpu usage just load.

dannyk81 · 2019-06-28T17:31:42Z

@jdelvecchio I use this metric in various dashboards and simply divide the value by 100.

running vmware_vm_cpu_usage_average /100>100 returns no data for all our deployments (~1000 VMs), so value is always 0~100.

I wonder if this has something to do with sockets/core? (though it shouldn't) in our case the cores per socket is always 1, how about you?

jdelvecchio · 2019-07-04T14:48:02Z

@dannyk81 running vmware_vm_cpu_usage_average /100>100 also returns no data for me.

I got my maths wrong, it seems to be %used. Thanks to both of you for the details and the help.

As for sockets/core it depends on the VM, we have a bit of both.

dannyk81 · 2019-07-04T16:16:25Z

indeed, this metrics is average %used.

however the value returned does not have a decimal point, hence the need to divide by 100.

pryorda mentioned this issue Jun 29, 2018

Unclear values for vmware_vm_cpu_usage_average metric rverchere/vmware_exporter#29

Open

pryorda added bug Something isn't working help wanted Extra attention is needed labels Sep 11, 2018

pryorda closed this as completed Mar 11, 2021

pryorda mentioned this issue Nov 16, 2021

Can I change the query interval? #299

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unclear values for vmware_vm_cpu_usage_average metric #3

Unclear values for vmware_vm_cpu_usage_average metric #3

pryorda commented Jun 29, 2018

pryorda commented Jun 29, 2018

pryorda commented Jun 29, 2018

dannyk81 commented Oct 7, 2018

pryorda commented Oct 7, 2018 via email

dannyk81 commented Oct 11, 2018

pryorda commented Oct 20, 2018

pryorda commented Dec 27, 2018

dannyk81 commented Dec 27, 2018 •

edited

Loading

pryorda commented Jun 19, 2019

jdelvecchio commented Jun 24, 2019

pryorda commented Jun 26, 2019

jdelvecchio commented Jun 26, 2019 •

edited

Loading

pryorda commented Jun 27, 2019

dannyk81 commented Jun 28, 2019 •

edited

Loading

jdelvecchio commented Jul 4, 2019

dannyk81 commented Jul 4, 2019

Unclear values for vmware_vm_cpu_usage_average metric #3

Unclear values for vmware_vm_cpu_usage_average metric #3

Comments

pryorda commented Jun 29, 2018

pryorda commented Jun 29, 2018

pryorda commented Jun 29, 2018

dannyk81 commented Oct 7, 2018

pryorda commented Oct 7, 2018 via email

dannyk81 commented Oct 11, 2018

pryorda commented Oct 20, 2018

pryorda commented Dec 27, 2018

dannyk81 commented Dec 27, 2018 • edited Loading

pryorda commented Jun 19, 2019

jdelvecchio commented Jun 24, 2019

pryorda commented Jun 26, 2019

jdelvecchio commented Jun 26, 2019 • edited Loading

pryorda commented Jun 27, 2019

dannyk81 commented Jun 28, 2019 • edited Loading

jdelvecchio commented Jul 4, 2019

dannyk81 commented Jul 4, 2019

dannyk81 commented Dec 27, 2018 •

edited

Loading

jdelvecchio commented Jun 26, 2019 •

edited

Loading

dannyk81 commented Jun 28, 2019 •

edited

Loading