Bug Fixes in sampler.py to correct sequence score mismatch #14

carlossolugen · 2024-10-29T17:20:19Z

I identified and fixed several bugs in the sampler.py file which resulted in:

Accepting proposed sequences even when accepted is False.
Lost alignment between the sequences in the chains and their scores.
In combination, these issues resulted in a mismatch between the scores returned by the sampler, and those obtained by rescoring the same sequences with the model used for sampling.

The fixes make sure that only proposed sequences for which accepted is True are accepted, and that the sampler state is correctly updated to reflect this, ensuring ongoing alignment between the sequences and their scores throughout the optimization.

To reproduce the issue, it suffices to run EvoProtGrad with any expert, e.g., ESM2, to produce all the sequences and scores found, and then rescoring the sequences with the same expert. You will find that a decent percent of sequences end up with non-matching scores.

I will create a pull request shortly with my bugfixes to sampler.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug Fixes in sampler.py to correct sequence score mismatch #14

Bug Fixes in sampler.py to correct sequence score mismatch #14

carlossolugen commented Oct 29, 2024

Bug Fixes in sampler.py to correct sequence score mismatch #14

Bug Fixes in sampler.py to correct sequence score mismatch #14

Comments

carlossolugen commented Oct 29, 2024