Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor AdapterCounter #220

Merged
merged 3 commits into from
Dec 17, 2024
Merged

Refactor AdapterCounter #220

merged 3 commits into from
Dec 17, 2024

Conversation

rhpvorderman
Copy link
Owner

@rhpvorderman rhpvorderman commented Dec 16, 2024

Checklist

  • Pull request details were added to CHANGELOG.rst

  • Documentation was updated (if needed)

  • Simplify data structures to allow for easier vectorization.

  • Utilize dynamic dispatch to write a single AVX2 algorithm, which can handle cases of 2,3 or 4 matchers with equal efficiency.

  • No more reliance on #ifdef __SSE2__ throughout the code.

Fixes #211
Fixes #171 . The autovectorization is simply is not productive enough. A hand-written avx2 implementation is much faster than whaver auto-vectorization can produce. I tested this using GCC14. As such it is better to write a separate ARM-based implementation when the need arises.

@rhpvorderman rhpvorderman force-pushed the refactoradaptercounter branch from 65af499 to 1927740 Compare December 16, 2024 19:03
@rhpvorderman rhpvorderman merged commit c335a9b into develop Dec 17, 2024
17 checks passed
@rhpvorderman rhpvorderman deleted the refactoradaptercounter branch December 31, 2024 14:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant