[BREAKING] Only store hash of distributions over time #12

dee-kryvenko · 2025-02-18T22:17:16Z

Evaluation resource has been storing list of replicas to each history
record. It appears to still blow out of the waters the size of the
object on big number of replicas. This change removes the distribution
from history records on evaluations. The idea is that we store the last
known projected winning distribution and compare it to the current one,
and if current one becomes the winner - we store it insead. At no point
in time we should need to know any other non-winning distribution, other
than what was its hash, how many times we've seen it and when was the
last time we've seen it.

There would be an edge case when a while ago there was a very much
wanted distribution that no longer wanter but it's total seen count is
higher than any of the current distributions. When that huge part of the
history goes out of the bounds and getting erased - we might not have
current projection anymore. In that case evaluator will go in not ready
state for a while until the new projected winning distribution is clear.

This change updates CRDs in-place with removal of fields, which is ok
because this is still beta.

Evaluation resource has been storing list of replicas to each history record. It appears to still blow out of the waters the size of the object on big number of replicas. This change removes the distribution from history records on evaluations. The idea is that we store the last known projected winning distribution and compare it to the current one, and if current one becomes the winner - we store it insead. At no point in time we should need to know any other non-winning distribution, other than what was its hash, how many times we've seen it and when was the last time we've seen it. There would be an edge case when a while ago there was a very much wanted distribution that no longer wanter but it's total seen count is higher than any of the current distributions. When that huge part of the history goes out of the bounds and getting erased - we might not have current projection anymore. In that case evaluator will go in not ready state for a while until the new projected winning distribution is clear. This change updates CRDs in-place with removal of fields, which is ok because this is still beta.

…ction

dee-kryvenko added 3 commits February 18, 2025 14:35

Add a test case to validate the edge case scenario with unknown proje…

8dee454

…ction

Add a note to the design doc about the new behavior

ab78a72

dee-kryvenko merged commit b0a7e00 into main Feb 18, 2025
6 checks passed

dee-kryvenko deleted the compact-evaluation-history branch February 18, 2025 22:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BREAKING] Only store hash of distributions over time #12

[BREAKING] Only store hash of distributions over time #12

dee-kryvenko commented Feb 18, 2025

[BREAKING] Only store hash of distributions over time #12

[BREAKING] Only store hash of distributions over time #12

Conversation

dee-kryvenko commented Feb 18, 2025