Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.1k 165

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 783 63

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 778 102

  4. PanzaMail PanzaMail Public

    Python 285 18

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 272 22

  6. QUIK QUIK Public

    Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024

    C++ 178 14

Repositories

Showing 10 of 57 repositories
  • HALO-anon Public
    IST-DASLab/HALO-anon’s past year of commit activity
    0 0 0 0 Updated Apr 1, 2025
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 20 2 0 0 Updated Mar 29, 2025
  • PanzaMail Public
    IST-DASLab/PanzaMail’s past year of commit activity
    Python 285 Apache-2.0 18 4 5 Updated Mar 28, 2025
  • torch_cgx Public

    Pytorch distributed backend extension with compression support

    IST-DASLab/torch_cgx’s past year of commit activity
    C++ 16 AGPL-3.0 0 4 0 Updated Mar 24, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 51 MIT 4 2 0 Updated Mar 17, 2025
  • gemm-int8 Public

    High Performance Int8 GEMM Kernels for SM80 and later GPUs.

    IST-DASLab/gemm-int8’s past year of commit activity
    Python 6 MIT 0 0 0 Updated Mar 11, 2025
  • DarwinLM Public

    Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"

    IST-DASLab/DarwinLM’s past year of commit activity
    Python 9 2 0 0 Updated Feb 21, 2025
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 8 Apache-2.0 0 0 0 Updated Feb 19, 2025
  • ScalableMNN Public

    Official Repository for "Scalable Mechanistic Neural Networks" (ICLR 2025)

    IST-DASLab/ScalableMNN’s past year of commit activity
    Python 1 MIT 0 0 0 Updated Feb 19, 2025
  • SPADE Public

    Code of SPADE: Sparsity Guided Debugging for Deep Neural Networks

    IST-DASLab/SPADE’s past year of commit activity
    Jupyter Notebook 1 3 1 0 Updated Feb 18, 2025