Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove "suspicious data" functionality from model exploration #178

Merged
merged 6 commits into from
Dec 11, 2024

Conversation

riley-harper
Copy link
Contributor

This is for #176.

I've removed all of the code for keeping track of and saving the suspicious data, and I've removed output_suspicious_TD from the docs. I took this opportunity to rewrite _get_confusion_matrix() to use a single select() instead of 4 filter() + count()s. I did a little bit of initial profiling, and I think this should be a pretty significant speedup.

I renamed some variables which had capital letters like TP and FN, replacing those with their lowercase counterparts, or spelling out "true_positives" and "false_negatives", etc.

@riley-harper riley-harper requested a review from ccdavis December 10, 2024 21:36
@riley-harper
Copy link
Contributor Author

The tests were failing because of an update to scikit-learn 1.6.0, which came out yesterday. It's only a problem with xgboost, so I've added an additional requirement to the xgboost extra, and that seems to have fixed things.

Copy link

@ccdavis ccdavis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, just what I expected. The get_confusion_matrix() is much better.

@riley-harper riley-harper merged commit c1f0d8c into v4-dev Dec 11, 2024
6 checks passed
@riley-harper riley-harper deleted the no-suspicious-data branch December 11, 2024 17:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants