Skip to content

Conversation

andreasbuhr
Copy link

I've seen a runtime decrease of 3.4% on average over three runs in one example raising the optimization level from -O2 to -O3 in gcc.

bwa uses lots of bit shift operators. It seems -O3 makes a difference here, see this trivial example:
https://www.godbolt.org/z/jD_cC5

I aligned one million reads to a mm10 using the command line
bwa aln -t 4 -f reads.sai mm10.fa Andreas_BWA/SRR1519948.1_1000000.fastq
Time went down from 62.8s to 60.6s.

@junaruga
Copy link

+1. Any reason not to merge this PR?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants