Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ImbalanceComparator should also check imbalance compared to entire dataset #72

Open
kwinkunks opened this issue Sep 25, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@kwinkunks
Copy link
Member

It would be interesting to compare the imbalance in train and in test to the imbalance in the combined train+test dataset, to make sure they are more or less the same.

E.g. imagine both train and test were imbalanced, but in a different way (a) to each other and (b) to the combined dataset.

Storing the histogram should enable this, if we don't already do it.

@kwinkunks kwinkunks added the enhancement New feature or request label Sep 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant