A topic-centric list of HQ open datasets.
-
Updated
Apr 18, 2024
A topic-centric list of HQ open datasets.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
(CGCSTCD'2017) An easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations. CGCSTCD = China Graduate Contest on Smart-city Technology and Creative Design
Label Studio is a multi-type data labeling and annotation tool with standardized output format
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Open source annotation tool for machine learning practitioners.
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Machine learning datasets used in tutorials on MachineLearningMastery.com
pix2code: Generating Code from a Graphical User Interface Screenshot
Techniques for deep learning with satellite & aerial imagery
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
搜索所有中文NLP数据集,附常用英文NLP数据集
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
🪐 End-to-end NLP workflows from prototype to production
✏️ Web-based image segmentation tool for object detection, localization, and keypoints
Benchmark datasets, data loaders, and evaluators for graph machine learning
Add a description, image, and links to the datasets topic page so that developers can more easily learn about it.
To associate your repository with the datasets topic, visit your repo's landing page and select "manage topics."