U an M training #2249
deanditton
started this conversation in
General
U an M training
#2249
Replies: 1 comment 1 reply
-
Some suggestions:
# this takes a list of rules, each of which should capture matches but not non matches
linker.estimate_probability_two_random_records_match(det_rules , recall=0.6)
# Use a bigger sample size
linker.estimate_u_using_random_sampling(max_pairs=1e7)
# M values estimated once anchored by other params
for col in columns:
blocking_rule_for_training = block_on([col])
linker.estimate_parameters_using_expectation_maximisation(blocking_rule_for_training)
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello all,
I would like to ask for some feedback on how we are approaching training U and M values if that is okay Please let me know if this is not the area for it.
Does my current approach make sense? or am I missing something?
Currently we have are linking across two databases with these columns
This is my training function
Beta Was this translation helpful? Give feedback.
All reactions