Fudan University
220 Handan Road, Shanghai
Shanghai, China.
- Large Language Models
- Fault tolerance
- Multimodal Training system
Fudan University
220 Handan Road, Shanghai
Shanghai, China.
This repository based by Mellanox/gpu_direct_rdma_access. Some errors in the code have been modified, some methods have been optimized, and some features have been added
Forked from HRNPH/AIwaifu
Open-Waifu open-sourced finetunable customizable simpable AI waifu inspired by neuro-sama
Python 1
An upgraded version of MujicaChk, which supports fast checkpoints for multiple nodes
Python
This Description based by gpu_dircect_rdma_access. Hoping provide some tools to python gdr access. More information in MujicaChk