Bruce-Lee-LY / cuda_hook Star 105 Code Issues Pull requests Hooked CUDA-related dynamic libraries by using automated code generation tools. gpu cublas nvidia nvml cudnn auto-generate cufft cuda-driver cusolver code-generate curand cusparse nvrtc cublaslt nvtx nvjpeg cuda-hook cuda-hijack cudart nvblas Updated Dec 12, 2023 C
nghiapq77 / face-recognition-cpp-tensorrt Star 66 Code Issues Pull requests Face Recognition with RetinaFace and ArcFace. opencv cpp sqlite face-recognition face-detection crow tensorrt arcface jetson-nano retinaface cublaslt Updated May 4, 2022 C++
zhaocc1106 / cuxx-programing Star 0 Code Issues Pull requests cuda、cublas、cublaslt、cusparse... cuda cublas cusparse cublaslt Updated Apr 24, 2024 Cuda
vadimkantorov / fastmlp Star 0 Code Issues Pull requests [WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP pytorch mlp quantized-neural-networks cublaslt Updated Aug 23, 2023