Epic: Enable overcommit #517

vadim2404 · 2023-09-04T15:34:50Z

Motivation

Look at the image:

Resource	Requested	Used
CPU	121	6
RAM	457	53

Let's have at least 2x overcommit (because of the maximum number of pods that can be up and running)

DoD

Implementation ideas

TODO

Tasks

Give feedback

...
List tasks as they're created for this Epic
Options

Other related tasks, Epics, and links

sharnoff · 2023-10-29T18:53:35Z

Here's some design ideas I've considered. I believe the 3rd solution is probably the best one.

1. Implement exclusively within k8s scheduler plugin

Idea: Change the scheduler plugin so that it actually allows resource usage from

Good parts:

Only requires changes in one component; other components don't need to know about it

Bad parts:

If usage from regular pods fills up a node, the base scheduler + cluster-autoscaler will still interpret that correctly.
May have strange consequences with cluster-autoscaler being overly willing to scale up, because its internal scheduler simulation is less permissive than our plugin
Setting would be global, increasing risk of volatility during rollout or revert

2. Implement within neonvm-controller/neonvm-runner

Idea: When a VM is set to a certain amount of memory and/or CPU, we actually set QEMU to use some fixed multiple of that

Good parts:

Has no effect on scheduler or cluster-autoscaler

Bad parts:

Regular pods unaffected
Counterintuitive (e.g. "why does setting it to 3 CPUs actually give it 6?")
Will have knock-on effects on autoscaler-agent's scaling; it must be made aware of this global overcommit factor (even though it's not possible to atomically deploy changes to multiple components)
Requires restart or migration of all VMs in order to get updated neonvm-runner pods
Setting would be global, increasing risk of volatility during rollout or revert

3. Implement via "overcommit" factor per VM

Idea: Add a new VM setting determining an "overcommit" factor that both the scheduler and cluster-autoscaler respect. Kind of a combination of ideas 1 and 2, but per-VM rather than global.

Good parts:

Rollout is gradual, does not require restarting any VMs

Bad parts:

Blocked on Cleanup: Scheduler should refer to memory in bytes, not memory slots #590
Regular pods unaffected
Requires changes in NeonVM CRD, scheduler plugin, cluster-autoscaler, and control plane
- The changes to the CRD are particularly high risk; this is a good motivating example for Internal feature: Backwards compatibility testing for API changes #580

stradig · 2024-04-09T15:24:27Z

We discussed this today and decided to start increasing the overcommit factor gradually over the course of the next few weeks by .1 increments. We will observe if there are any negative effects and probably ramp up to 1.5, only. For more we will prioritize neondatabase/cloud#14114 as a prerequisite to allow for faster reaction in case of node failures.

sharnoff · 2024-04-15T04:18:26Z

Made an initial implementation in #905, still need to test it and self-review. Opened a handful of other PRs while I was poking around in the area:

sharnoff · 2024-06-10T15:11:55Z

Status: #936 is waiting on my response to review, then will need to create new PR rebasing #905 onto #936.

Found myself wanting these as part of #517, and it's been on the todo list for a while from #764. Some of the tests currently aren't working as expected. They are configured to pass, acting as if the undesired outcome is expected. The bugfix will be in a separate PR.

vadim2404 added the t/Epic Issue type: Epic label Sep 4, 2023

sharnoff mentioned this issue Oct 23, 2023

Epic: Scheduler-triggered migration informed by CPU/memory/disk metrics #581

Open

sharnoff added the t/feature Issue type: feature, for new features or requests label Oct 29, 2023

sharnoff mentioned this issue Oct 29, 2023

Cleanup: Scheduler should refer to memory in bytes, not memory slots #590

Closed

sharnoff mentioned this issue Oct 30, 2023

Bug: VMs have low-priority CPU use, distributed equally under load (not based on VM size!) #591

Open

sharnoff mentioned this issue Apr 14, 2024

agent,api: Remove custom Format()s for complex types #902

Merged

sharnoff self-assigned this Apr 15, 2024

This was referenced Apr 15, 2024

plugin: Replace readClusterState with existing watch events #904

Merged

Add VirtualMachine overcommit factors #905

Closed

sharnoff mentioned this issue May 20, 2024

plugin: Transaction-based speculative reserve and logic unification #936

Merged

sharnoff mentioned this issue Aug 6, 2024

plugin: Add basic unit tests for resourceTransitioner #1023

Merged

sharnoff mentioned this issue Feb 27, 2025

Add VirtualMachine overcommit factors #1289

Merged

sharnoff closed this as completed in #1289 Mar 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Epic: Enable overcommit #517

Epic: Enable overcommit #517

vadim2404 commented Sep 4, 2023 •

edited

Loading

Tasks

sharnoff commented Oct 29, 2023

stradig commented Apr 9, 2024

sharnoff commented Apr 15, 2024

sharnoff commented Jun 10, 2024

Epic: Enable overcommit #517

Epic: Enable overcommit #517

Comments

vadim2404 commented Sep 4, 2023 • edited Loading

Motivation

DoD

Implementation ideas

Tasks

Tasks

Other related tasks, Epics, and links

sharnoff commented Oct 29, 2023

1. Implement exclusively within k8s scheduler plugin

2. Implement within neonvm-controller/neonvm-runner

3. Implement via "overcommit" factor per VM

stradig commented Apr 9, 2024

sharnoff commented Apr 15, 2024

sharnoff commented Jun 10, 2024

vadim2404 commented Sep 4, 2023 •

edited

Loading