Lists (3)
Sort Name ascending (A-Z)
Stars
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Implementation of DPO and the paper: Self-Rewarding Language Model (Unofficial)
Retrieve dialogues from Reddit and generate an empathetic conversation dataset.
A quick guide (especially) for trending instruction finetuning datasets
Conversational RPA SDK for Chatbot Makers. Join our Discord: https://discord.gg/7q8NBZbQzt
TigerBot: A multi-language multi-task LLM
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈
Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Interpretable unified language safety checking with large language models
Datasets collection and preprocessings framework for NLP extreme multitask learning
State-of-the-Art Text Embeddings
A flexible Federated Learning Framework based on PyTorch, simplifying your Federated Learning research.
Markdown-formatted Creative Commons licenses
An open source implementation of CLIP.
PyTorch deep learning projects made easy.
OpenMMLab Detection Toolbox and Benchmark
Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)