Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
May 31, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
A lightning-fast memory pattern scanner, capable of scanning gigabytes of data per second
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Bilinear image filtering implemented with SSE4, AVX2 and AVX512.
➗ SIMD-accelerated bitwise hamming distance Python module for hexadecimal strings
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
A collection of high speed non-cryptographic hashing algorithms
Minify JSON files fast! Supports Comments. Uses D, C, and AVX2 and SSE4_1 SIMD.
Agenium Scale vectorization library for CPUs and GPUs
Here, I mainly work on a console frontend known an SDECF (SDE Console Frontend) for Intel's instruction set emulator, SDE, which is found up-to-date at Intel's dev/software main page at https://software.intel.com/content/www/us/en/develop/articles/intel-software-development-emulator.html if it's not currently up to date in my release builds. Mor…
A GPU-based Graph500 implementation providing compressed data movements.
Fast CRC32C calculations in Swift. Using the Intel SSE 4.2 hardware acceleration if possible.
Tiny optimized math framework game oriented
What features does your CPU and OS support?
Add a description, image, and links to the sse42 topic page so that developers can more easily learn about it.
To associate your repository with the sse42 topic, visit your repo's landing page and select "manage topics."