-
-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PipeEncodeImpact: Add CV #423
Labels
Status: Needs Discussion
We still need to think about what the solution should look like
Comments
sumny
added
the
Status: Needs Discussion
We still need to think about what the solution should look like
label
May 28, 2020
Can we have some discussion about how to implement this? Because "training" the encoding happens here and imo this is not straightforward to swap out for a resampled/cross-validated version. |
In principle what we'd basically do:
|
this thread alternatively advocates adding a small noise to avoid overfitting and contains some additional interesting info |
This was referenced Aug 11, 2020
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
See vtreat Webinar for more info.
Concretely, this would mean that we switch out the standard learner with a cross-validated learner.
The text was updated successfully, but these errors were encountered: