Reprex for producing a labelled dataset such as fake_1000_labels.csv #2082
Unanswered
aalexandersson
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How to create and use labelled data is an important use case to me. While truth_space_table_from_labels_column has bug #2059, we rely on linker.truth_space_table_from_labels_table for labelled data.
It means that we must compute the score for each labelled pair, and use that as the basis for TP, TN, FP, FN. A good example for how to use labelled data is charts/threshold_selection_tool_from_labels_table. It uses splink_dataset_labels for accessing the fake_1000_labels.csv dataset, However, there is no reproducible example (reprex) for how to actually produce it or any other labelled dataset.
Beta Was this translation helpful? Give feedback.
All reactions