-
Notifications
You must be signed in to change notification settings - Fork 12
Description
Is your feature request related to a problem?
The feature is part of the problem of leading segment merge.
At the moment, incremental insertion into new nodes for graph construction during merges is done by choosing a leading segment and later add the new nodes into the existing graph.
However there are couple of limitations at the moment:
- Deletes
- Quantization
The quantization case becomes problematic especially as it is important for the construction of graphs in a RAM constrained environment.
What solution would you like?
We would like to see incremental graph construction with leading segment on PQ quantized graphs by employing the following technique:
- Checkpointing PQ codebooks after each merge
- Determine whether the codebooks centroids drift is substantial or whether the codebooks can be reused.
- In case that the codebook drift is significant we would want to re-construct the graph.
We believe that for the most part, after the initial construction of a large graph (1B for example) the drift for addition of smaller batches in PQ codebooks will not be substantial.
And therefore the above approach works.
What alternatives have you considered?
NA
Do you have any additional context?
NA