cosdt

All

19 repositories

ascend-dra-driver
Public
昇腾（Ascend）DRA驱动，专为华为昇腾AI处理器设计的Kubernetes动态资源分配驱动实现。欢迎社区开发者使用、贡献和改进，共同打造更高效的AI加速卡资源调度框架。支持从单卡到多卡集群的灵活资源管理，适合AI训练和推理场景部署。
Go
•
Apache License 2.0
•74•2•1•2•Updated Oct 14, 2025Oct 14, 2025
pytorch-integration-tests
Public
Integration testing of different accelerators with PyTorch
Python
•
BSD 3-Clause "New" or "Revised" License
•1•0•10•7•Updated Aug 12, 2025Aug 12, 2025
elastic-tool
Public
A cli tool to interaction with elasticsearch
Python
•
Apache License 2.0
•0•0•0•0•Updated Jul 1, 2025Jul 1, 2025
onnxruntime
Public
Shell
•0•0•4•0•Updated May 26, 2025May 26, 2025
llama.cpp
Public
Shell
•1•0•5•2•Updated May 9, 2025May 9, 2025
vllm-benchmarks
Public
run vllm benchmarks
Python
•
Apache License 2.0
•0•0•1•0•Updated Feb 28, 2025Feb 28, 2025
vllm
Public
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
•
Apache License 2.0
•11k•1•0•0•Updated Feb 12, 2025Feb 12, 2025
vllm-ascend
Public archive
See vLLM official support: https://github.com/vllm-project/vllm-ascend
Python
•
Apache License 2.0
•4•11•0•0•Updated Feb 5, 2025Feb 5, 2025
dockerfiles
Public
Dockerfile
•
Apache License 2.0
•1•5•1•1•Updated Jan 13, 2025Jan 13, 2025
pytorch-upstream
Public
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Python
•
Other
•26k•0•0•1•Updated Dec 9, 2024Dec 9, 2024
DeepSpeed
Public
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python
•
Apache License 2.0
•4.6k•0•0•0•Updated Nov 4, 2024Nov 4, 2024
torch_backend
Public archive
C++
•
Other
•0•3•8•0•Updated Oct 21, 2024Oct 21, 2024
pytorch-examples
Public
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Python
•
BSD 3-Clause "New" or "Revised" License
•9.8k•0•1•1•Updated Sep 20, 2024Sep 20, 2024
op-plugin
Public
C++
•
Other
•0•0•1•0•Updated Jul 27, 2024Jul 27, 2024
FastChat
Public
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python
•
Apache License 2.0
•4.8k•0•0•0•Updated Apr 23, 2024Apr 23, 2024
cosdt.github.io
Public
https://cosdt.github.io
HTML
•2•1•16•4•Updated Apr 2, 2024Apr 2, 2024
.github
Public
0•0•0•0•Updated Jul 20, 2023Jul 20, 2023
ceph-openEuler-CI
Public
Daily Ceph build and test on openEuler
Apache License 2.0
•1•0•0•0•Updated Jul 20, 2023Jul 20, 2023
onnxruntime-ascend-CI
Public
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
4•1•0•0•Updated Jul 20, 2023Jul 20, 2023