Add support for partition based scaling on the kafka scaler #6558

patelvp · 2025-02-20T06:26:44Z

Scaling kafka consumers should be done in factors of partition count on the topic. This is to ensure that the partitions are evenly spread across all consumers. If the paritions are not evenly spread we run the risk of some partitions being consumed faster than other. This PR adds a new property on the kafka scaler ensureEvenDistributionOfPartitions. When this property is set to true the scaler ensure that the number of pods are always evenly spread across the number of topics on the partition.

Checklist

When introducing a new scaler, I agree with the scaling governance policy
I have verified that my change is according to the deprecations & breaking changes policy
Tests have been added
Changelog has been updated and is aligned with our changelog requirements
A PR is opened to update the documentation on (repo) (if applicable)
Commits are signed with Developer Certificate of Origin (DCO - learn more)

Fixes #2581

QA:
Tested this locally by pushing an image on a local kind cluster. Kafka consumer and producers are outside the cluster to get granual and quick control over production and consumption rate.
Plotted the pod count and kafka partition lag onto grafana.

Scaling kafka consumers should be done in factors of partition count on the topic. This is to ensure that the partitions are evenly spread across all consumers. If the paritions are not evenly spread we run the risk of some partitions being consumed faster than other. This PR adds a new property on the kafka scaler `ensureEvenDistributionOfPartitions`. When this property is set to true the scaler ensure that the number of pods are always evenly spread across the number of topics on the partition. Signed-off-by: Vishal Patel <[email protected]>

JorTurFer · 2025-02-20T21:26:40Z

@dttung2905 @zroubalik , you are the Kafka experts xD
Does this make sense?

dttung2905

Thank you very much for this PR. Personally, I like the direction this PR is heading to. Just 1 small comment for my understanding

dttung2905 · 2025-02-20T22:15:17Z

pkg/scalers/kafka_scaler.go

+	for _, factor := range factors {
+		if factor*lagThreshold >= totalLag {
+			return factor
+		}
+	}
+	return totalTopicPartitions


Just trying to understand and confirm the logic here. Are we trying to get the smallest number of pods to satisfy the condition factor*lagThreshold >= totalLag ? The reason is the factors array is sorted in ascending order from FindFactors

Another follow up question is that what if it is sorted in descending order to provide a more aggressive strategy to reduce lag 🤔 ?

Just trying to understand and confirm the logic here. Are we trying to get the smallest number of pods to satisfy the condition factor*lagThreshold >= totalLag ? The reason is the factors array is sorted in ascending order from FindFactors

Yep just trying to find the smallest number of pods that will satisfy the condition

what if it is sorted in descending order to provide a more aggressive strategy to reduce lag

In that case, eventually, either we have to flip the conditional factor*lagThreshold >= totalLag to end up getting the same number of pods that we get with the proposed logic, or we risk running way more pods than we should ideally want. Consider if there are 100 partitions. It would immediately scale to 100 pods even for a lag of 100 and lagThreshold of 10.
Not sure if you I am missing anything when you say aggressive strategy?

patelvp requested a review from a team as a code owner February 20, 2025 06:26

patelvp mentioned this pull request Feb 20, 2025

Kafka Partitions Per Pod Scaling #2581

Open

patelvp force-pushed the add-partition-based-scaling-kafka-scaler branch from a548ac9 to c46e703 Compare February 20, 2025 17:37

dttung2905 reviewed Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for partition based scaling on the kafka scaler #6558

Add support for partition based scaling on the kafka scaler #6558

patelvp commented Feb 20, 2025 •

edited

Loading

JorTurFer commented Feb 20, 2025

dttung2905 left a comment

dttung2905 Feb 20, 2025

patelvp Feb 21, 2025

Add support for partition based scaling on the kafka scaler #6558

Are you sure you want to change the base?

Add support for partition based scaling on the kafka scaler #6558

Conversation

patelvp commented Feb 20, 2025 • edited Loading

Checklist

JorTurFer commented Feb 20, 2025

dttung2905 left a comment

Choose a reason for hiding this comment

dttung2905 Feb 20, 2025

Choose a reason for hiding this comment

patelvp Feb 21, 2025

Choose a reason for hiding this comment

patelvp commented Feb 20, 2025 •

edited

Loading