Skip to content

Make model training fully deterministic #373

@jeancochrane

Description

@jeancochrane

Our investigation in https://github.com/ccao-data/enterprise-intelligence/pull/258 revealed that two model training runs with identical hyperpameters can produce slightly different tree structures, with enough variation to lead to ~80 cards with significant prediction differences.

We should already be using lightgbm in a deterministic fashion, so there are two main possibilities I see:

I think the trickiest part of this issue will be generating a reproducible example that we can use to test and confirm that our fix worked.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions