Skip to content

Decoy Implementation and Additional Selection Methods in Snekmer Learn/Apply #127

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 10 commits into
base: main
Choose a base branch
from

Conversation

jjacobson95
Copy link
Collaborator

This PR brings Snekmer Learn/Apply up to date with numerous developments that have occurred since the last update.

These updates include:

  • Reverse Sequence Decoys
  • Family Specific Thresholds
  • Additional Selection methods (such as 'greatest_distance')
  • Added an output directory from Learn called "apply_inputs" that consolidates the 3 key files needed for Apply.
  • Additional misc bug fixes.

To do: use snakefmt for linting - I did a quick test and it broke formatting, so I'll come back to this when we want to merge the PR.


Additional possible updates in this or a future PR include:

  • Many to Many 'Apply': allow users to use sequences for hits instead of families.
  • Inclusion of values below the decoy threshold. Currently these are excluded, we should instead add a flag and let the user decide if they want to use them.
  • Multi-hit function: Allow there to be multiple hits for a query sequence. This may be done by utilizing the family specific cutoffs and allowing all scores above a threshold to pass.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant