Skip to content
Change the repository type filter

All

    Repositories list

    • 昇腾(Ascend)DRA驱动,专为华为昇腾AI处理器设计的Kubernetes动态资源分配驱动实现。欢迎社区开发者使用、贡献和改进,共同打造更高效的AI加速卡资源调度框架。支持从单卡到多卡集群的灵活资源管理,适合AI训练和推理场景部署。
      Go
      74212Updated Oct 14, 2025Oct 14, 2025
    • Integration testing of different accelerators with PyTorch
      Python
      10107Updated Aug 12, 2025Aug 12, 2025
    • A cli tool to interaction with elasticsearch
      Python
      0000Updated Jul 1, 2025Jul 1, 2025
    • Shell
      0040Updated May 26, 2025May 26, 2025
    • llama.cpp

      Public
      Shell
      1052Updated May 9, 2025May 9, 2025
    • run vllm benchmarks
      Python
      0010Updated Feb 28, 2025Feb 28, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      11k100Updated Feb 12, 2025Feb 12, 2025
    • vllm-ascend

      Public archive
      Python
      41100Updated Feb 5, 2025Feb 5, 2025
    • Dockerfile
      1511Updated Jan 13, 2025Jan 13, 2025
    • Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      26k001Updated Dec 9, 2024Dec 9, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      4.6k000Updated Nov 4, 2024Nov 4, 2024
    • torch_backend

      Public archive
      C++
      0380Updated Oct 21, 2024Oct 21, 2024
    • A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
      Python
      9.8k011Updated Sep 20, 2024Sep 20, 2024
    • op-plugin

      Public
      C++
      0010Updated Jul 27, 2024Jul 27, 2024
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      4.8k000Updated Apr 23, 2024Apr 23, 2024
    • HTML
      21164Updated Apr 2, 2024Apr 2, 2024
    • .github

      Public
      0000Updated Jul 20, 2023Jul 20, 2023
    • Daily Ceph build and test on openEuler
      1000Updated Jul 20, 2023Jul 20, 2023
    • ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
      4100Updated Jul 20, 2023Jul 20, 2023