ensure preservation of RX tag; UMI-aware ref-based assembly #164

tomkinsc · 2020-10-15T22:42:56Z

When demultiplexed with a read structure indicating presence of UMIs (including M; ex. 146T8B9M8B146T for a 9bp UMI), the resulting bam files include per-read UMI sequences via the RX tag. Our current picard-based demultiplexing handles UMIs as expected (the RX tags are present), but the ref-based assembly yields aligned bams with the RX tags conspicuously missing. We should figure at what stage the RX tags are initially lost and find a way to preserve them or re-annotate the reads in the output bam file.

Related to the above, we should have a UMI-aware ref-based assembly pipeline that makes use of picard's UmiAwareMarkDuplicatesWithMateCigar to deduplicate reads, taking UMIs into account. In this pipeline, the reads will need to be aligned to the reference once to determine alignment coordinates, then deduplicated via UmiAwareMarkDuplicatesWithMateCigar, then aligned again. The ultimate output should contain UMI and position-distinct aligned reads (while tolerating some level of mismatch in the UMIs).

This all is in support of tiled amplicon sequencing and iSNV analysis.

The text was updated successfully, but these errors were encountered:

tomkinsc · 2023-09-21T16:02:14Z

UMI-tools may also be useful for some of the UMI-related processing tasks

tomkinsc · 2023-11-16T16:39:52Z

for collapsing UMIs for unmapped reads:
https://github.com/abossers-uu/dedupUMI

tomkinsc · 2024-08-08T17:23:29Z

Here are a few links in this area to re-vist related to this:

https://github.com/vpc-ccg/calib?tab=readme-ov-file
https://github.com/jeffdaily/parasail-python (maybe, for insert comparison scoring within UMI clusters)
https://github.com/DormanLab/AmpliCI?tab=readme-ov-file (and https://github.com/DormanLab/AmpliCI/blob/master/daumi.md)
https://github.com/BiodataAnalysisGroup/UMIc
https://github.com/ksahlin/isONclust?tab=readme-ov-file

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ensure preservation of RX tag; UMI-aware ref-based assembly #164

ensure preservation of RX tag; UMI-aware ref-based assembly #164

tomkinsc commented Oct 15, 2020

tomkinsc commented Sep 21, 2023

tomkinsc commented Nov 16, 2023

tomkinsc commented Aug 8, 2024

ensure preservation of RX tag; UMI-aware ref-based assembly #164

ensure preservation of RX tag; UMI-aware ref-based assembly #164

Comments

tomkinsc commented Oct 15, 2020

tomkinsc commented Sep 21, 2023

tomkinsc commented Nov 16, 2023

tomkinsc commented Aug 8, 2024