Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ensure preservation of RX tag; UMI-aware ref-based assembly #164

Open
tomkinsc opened this issue Oct 15, 2020 · 3 comments
Open

ensure preservation of RX tag; UMI-aware ref-based assembly #164

tomkinsc opened this issue Oct 15, 2020 · 3 comments

Comments

@tomkinsc
Copy link
Member

When demultiplexed with a read structure indicating presence of UMIs (including M; ex. 146T8B9M8B146T for a 9bp UMI), the resulting bam files include per-read UMI sequences via the RX tag. Our current picard-based demultiplexing handles UMIs as expected (the RX tags are present), but the ref-based assembly yields aligned bams with the RX tags conspicuously missing. We should figure at what stage the RX tags are initially lost and find a way to preserve them or re-annotate the reads in the output bam file.

Related to the above, we should have a UMI-aware ref-based assembly pipeline that makes use of picard's UmiAwareMarkDuplicatesWithMateCigar to deduplicate reads, taking UMIs into account. In this pipeline, the reads will need to be aligned to the reference once to determine alignment coordinates, then deduplicated via UmiAwareMarkDuplicatesWithMateCigar, then aligned again. The ultimate output should contain UMI and position-distinct aligned reads (while tolerating some level of mismatch in the UMIs).

This all is in support of tiled amplicon sequencing and iSNV analysis.

@tomkinsc
Copy link
Member Author

UMI-tools may also be useful for some of the UMI-related processing tasks

@tomkinsc
Copy link
Member Author

for collapsing UMIs for unmapped reads:
https://github.com/abossers-uu/dedupUMI

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant