Skip to content
Change the repository type filter

All

    Repositories list

    • 昇腾(Ascend)DRA驱动,专为华为昇腾AI处理器设计的Kubernetes动态资源分配驱动实现。欢迎社区开发者使用、贡献和改进,共同打造更高效的AI加速卡资源调度框架。支持从单卡到多卡集群的灵活资源管理,适合AI训练和推理场景部署。
      Go
      Apache License 2.0
      44030Updated Mar 20, 2025Mar 20, 2025
    • Integration testing of different accelerators with PyTorch
      Python
      BSD 3-Clause "New" or "Revised" License
      1084Updated Mar 19, 2025Mar 19, 2025
    • splitter

      Public
      Tool for splitting software packages into smaller components.
      Apache License 2.0
      1000Updated Mar 6, 2025Mar 6, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      6.4k100Updated Feb 12, 2025Feb 12, 2025
    • vllm-ascend

      Public archive
      Python
      Apache License 2.0
      61200Updated Feb 5, 2025Feb 5, 2025
    • Dockerfile
      Apache License 2.0
      1511Updated Jan 13, 2025Jan 13, 2025
    • Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      24k001Updated Dec 9, 2024Dec 9, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.3k000Updated Nov 4, 2024Nov 4, 2024
    • torch_backend

      Public archive
      C++
      Other
      0380Updated Oct 21, 2024Oct 21, 2024
    • A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
      Python
      BSD 3-Clause "New" or "Revised" License
      9.6k011Updated Sep 20, 2024Sep 20, 2024
    • op-plugin

      Public
      C++
      Other
      0010Updated Jul 27, 2024Jul 27, 2024
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      Apache License 2.0
      4.7k000Updated Apr 23, 2024Apr 23, 2024
    • HTML
      21144Updated Apr 2, 2024Apr 2, 2024
    • .github

      Public
      0000Updated Jul 20, 2023Jul 20, 2023
    • Daily Ceph build and test on openEuler
      Apache License 2.0
      1000Updated Jul 20, 2023Jul 20, 2023
    • ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
      4100Updated Jul 20, 2023Jul 20, 2023