-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Impact of score_threshold on results #76
Comments
Yes, that's correct. The output will be a de-replicated set of MAGs with score above 0 and MAGs with a score below 0 will not be reported. Following up with CheckM is a good idea. |
Great, thanks. I'll follow up with some alternative binners first before resorting to this approach. |
One additional question: DAS Tool is still removing bins that are presumably contaminated or megabins even with the score threshold of zero. I am guessing that behavior is probably what I want. How should I interpret the negative scores though? |
Basically, a negative score means a high fraction of redundant single copy genes (SCGs). The score is calculated based on the number of unique- and redundant SCGs per bin. In case of a high number of redundant SCGs, the number of unique SCGs may get outweighed and the score can turn negative. You can always check the |
I have the same question. My main goal is to dereplicate and not necessarily score the quality as I do that with CheckM post hoc. Would you recommend setting the score to 0 in this case? |
Hi,
Thanks for your work on DAS_Tool, I have found it very useful in the past! I do have one question about the impact of the score threshold on the output. I noticed that DAS Tool is (correctly) discarding a lot of low quality bins for some of my samples. I generally like to do post-hoc assembly quality filtering with CheckM, so I was wondering if setting the score_threshold = 0 means that DAS Tool effectively just dereplicates the input MAGs only + discards any MAG with a negative score?
Thanks for your help!
The text was updated successfully, but these errors were encountered: