Make model training fully deterministic

Our investigation in https://github.com/ccao-data/enterprise-intelligence/pull/258 revealed that two model training runs with identical hyperpameters can produce slightly different tree structures, with enough variation to lead to ~80 cards with significant prediction differences.

We should already be using lightgbm in a deterministic fashion, so there are two main possibilities I see:

* Our determinism configuration is not quite correct, and needs to be tweaked (see https://github.com/microsoft/LightGBM/issues/6683)
* There's a bug in lightgbm core

I think the trickiest part of this issue will be generating a reproducible example that we can use to test and confirm that our fix worked.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make model training fully deterministic #373

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Make model training fully deterministic #373

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions