Expand model correctness tests #15

null-a · 2019-10-01T07:51:06Z

Check that mu is computed as the correct function of latents & data? (Add tests to check correctness of model's mu computation #52)
Check that the response distribution has the correct parameters? (test_expected_response_codegen might already do this to some degree. Perhaps that can be reworked along the lines of test_mu_correctness -- using .fitted('expectation') to generate actual values, and comparing against the output of the mean method of a pyro distribution.)
Extend code gen tests to check that the response is observed, and that it comes from the expected family?

The text was updated successfully, but these errors were encountered:

null-a · 2019-10-11T13:49:00Z

Check that mu is computed as the correct function of latents & data?

Here's one way we might do this. We might define test cases that contain something like the following:

df = pd.DataFrame({
    'y': [0., 0.],
    'a': pd.Categorical(['a0', 'a1']),
    'b': pd.Categorical(['b0', 'b1']),
})
model = defm('y ~ 1 | a:b', df)
def expected(df, coef):
    return (((df['a']=='a0') & (df['b']=='b0')) * coef('r_a:b[a0_b0,intercept]') +
            ((df['a']=='a1') & (df['b']=='b1')) * coef('r_a:b[a1_b1,intercept]'))

... which would allow us to check that generated models correctly compute the location parameter of the response distribution with something like:

fit = model.generate(backend=numpyro).prior(num_samples=1)
actual_mu = fitted(fit, what='linear')[0]
expected_mu = expected(df, partial(get_scalar_param, fit)).to_numpy()
print(np.all(np.equal(actual_mu, expected_mu)))

I like this because such tests are essentially deterministic, and they're easy to write. While it wouldn't guarantee that generated models have the correct semantics, it would give us confidence that all backends compute mu in the same way, and provide reassurance when making changes to e.g. code generation. (e.g. #10.)

Eventually we might even consider generating the expected functions from the model definition itself. I guess this would be of most interest if we were also generating model descriptions in statistical notation (#33). If these shared a common implementation (you'd need generate something like the expected function when generating the math description) then these tests would help convince us that the math and code we generate are consistent.

null-a added this to the v0.0.1 milestone Oct 10, 2019

null-a mentioned this issue Oct 17, 2019

Add tests to check correctness of model's mu computation #52

Merged

This was referenced Nov 26, 2019

Test expectation via public API. #75

Merged

Extend code generation tests to check that response is observed #76

Merged

null-a closed this as completed Nov 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expand model correctness tests #15

Expand model correctness tests #15

null-a commented Oct 1, 2019 •

edited

Loading

null-a commented Oct 11, 2019

Expand model correctness tests #15

Expand model correctness tests #15

Comments

null-a commented Oct 1, 2019 • edited Loading

null-a commented Oct 11, 2019

null-a commented Oct 1, 2019 •

edited

Loading