Skip to content

Optimize StoredFields merge strategy based on chunk deletion ratio #16045

@kkewwei

Description

@kkewwei

Description

For StoredField merging, the merge strategy has to switch from the BULK mode to the DOC mode even if only a document is deleted, which significantly increases merging overhead.

StoredField is compressed at the chunk granularity I propose introducing a new merge strategy:

  • For the chunk with no deleted documents, retain copyChunk during merging;

  • For the chunk containing deleted documents, fall back to copyOneDoc.

Furthermore, we can leverage the segment deletion ratio (e.g., a threshold below 5%) as a condition to enable or disable this optimized merging logic.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions