dataset
Here are 10,215 public repositories matching this topic...
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
-
Updated
May 13, 2024 - Python
-
Updated
May 13, 2024 - Jupyter Notebook
Automated Data Collection: COVID-19/SARS-COV-2 Cases in EU by Country, State/Province/Local Authorities, and Date
-
Updated
May 13, 2024 - HTML
📥 Archive for data from mcbroken.com.
-
Updated
May 13, 2024
TLDR 2 (TLD Records 2) is a continually updated DNS archive of zone transfer attempts against all existing TLD nameservers as well as the root servers.
-
Updated
May 13, 2024 - Python
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
-
Updated
May 13, 2024 - TypeScript
Dataframes library for data analysis. Providing an interface for easy data access, manipulation, and calculation. Data are stored in memory (native driver) or external storage (sql drivers or others), enabling the library to be used seamlessly as a connector. Exports and imports with popular formats (CSV, Spreadsheet, Json...) are also supported.
-
Updated
May 13, 2024 - PHP
Dataset Helper program to automatically select, re scale and tag Datasets (composed of image and text) for Machine Learning training.
-
Updated
May 13, 2024 - C#
A Library for Managing your Connection to Different DataSources . Still in Alpha.please be patient
-
Updated
May 13, 2024 - C#
Application for Managing your Different DataSources . Still in Alpha.please be patient
-
Updated
May 13, 2024 - C#
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, learderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向大型语言模型评测(例如ChatGPT、LLaMA、GLM、Baichuan等).
-
Updated
May 13, 2024
Knowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
-
Updated
May 13, 2024 - Java
Colour Science for Python
-
Updated
May 13, 2024 - Python
Official code and dataset repository of KoBBQ (TACL 2024)
-
Updated
May 13, 2024 - Python
-
Updated
May 13, 2024
Free and open source code of the https://tournesol.app platform. Meet the community on Discord https://discord.gg/WvcSG55Bf3
-
Updated
May 13, 2024 - Python
The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36K
-
Updated
May 13, 2024 - Python
Improve this page
Add a description, image, and links to the dataset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataset topic, visit your repo's landing page and select "manage topics."