Releases: NVIDIA/NeMo-Curator
Releases · NVIDIA/NeMo-Curator
NVIDIA NeMo Curator 0.7.0rc2.dev0
Prerelease: NVIDIA NeMo Curator 0.7.0rc2.dev0 (2025-02-25)
NVIDIA NeMo Curator 0.7.0rc1.dev1
Prerelease: NVIDIA NeMo Curator 0.7.0rc1.dev1 (2025-02-19)
NVIDIA NeMo Curator 0.7.0rc0.dev1
Prerelease: NVIDIA NeMo Curator 0.7.0rc0.dev1 (2025-02-04)
NVIDIA NeMo Curator 0.6.0
What's changed
- Synthetic Data Generation for Text Retrieval
- LLM-based Filters
- Easiness
- Answerability
- Q&A Retrieval Generation Pipeline
- LLM-based Filters
- Parallel Dataset Curation for Machine Translation
- Load/Write Bitext Files
- Heuristic filtering (Histogram, Length Ratio)
- Classifier filtering (Comet, Cometoid)
NVIDIA NeMo Curator 0.6.0rc2.dev1
Prerelease: NVIDIA NeMo Curator 0.6.0rc2.dev1 (2025-01-03)
NVIDIA NeMo Curator 0.6.0rc1.dev1
Prerelease: NVIDIA NeMo Curator 0.6.0rc1.dev1 (2024-12-20)
v0.6.0rc0
v0.6.0rc0
v0.5.1
v0.5.0
Highlights
- Image Curation
- Image Embedding Creation
- Aesthetic Classifier
- NSFW Classifier
- Semantic Deduplication
- Text Curation
- Quality Classifier
- Aegis Classifier
- FineWeb-Edu Classifier
Full Changelog: https://github.com/NVIDIA/NeMo-Curator/commits/v0.5.0