amino acid support #23

jianshu93 · 2022-06-22T18:59:11Z

Hello COBS team,

Any possibilities to also support amino acid sequences (20 alphabets instead of 4 for DNA). I found this to be extremely useful for genome query/search.

Thanks,

Jianshu

iqbal-lab · 2022-06-22T20:03:14Z

Should already support amino acid sequences in the sense that it indexes free text. Pass in a .txt file of amino acids and see how you go?

jianshu93 · 2022-06-22T20:36:50Z

Thanks for quick response. I saw that fasta file in test folder is amino acid sequence. for example sample6.fasta. I will try now.

Thanks

Jianshu

iqbal-lab · 2022-06-22T20:45:15Z

Note that by default it tries to canonicalize kmers (taking kmer and reverse complement and choosing smaller). See this text from the README:

With the flag --no-canonicalize any letters or text can be indexed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

amino acid support #23

amino acid support #23

jianshu93 commented Jun 22, 2022

iqbal-lab commented Jun 22, 2022

jianshu93 commented Jun 22, 2022

iqbal-lab commented Jun 22, 2022

amino acid support #23

amino acid support #23

Comments

jianshu93 commented Jun 22, 2022

iqbal-lab commented Jun 22, 2022

jianshu93 commented Jun 22, 2022

iqbal-lab commented Jun 22, 2022