-
Notifications
You must be signed in to change notification settings - Fork 105
Pull requests: NVIDIA/NeMo-Curator
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Hard negative mining for Retriever fine-tuning
#523
opened Feb 5, 2025 by
vinay-raman
Loading…
3 tasks done
Add support for Nemotron-CC quality classifiers
#518
opened Feb 4, 2025 by
sarahyurick
•
Draft
10 of 14 tasks
Add improved cleaning methods from Nemotron-CC
#517
opened Feb 4, 2025 by
ryantwolf
Loading…
3 tasks done
Update fuzzy deduplication section of tutorials to skip false positive check (where applicable)
#511
opened Feb 3, 2025 by
ayushdg
Loading…
1 of 3 tasks
Removal logic for fuzzy / exact (no class abstraction)
gpuci
Run GPU CI/CD on PR
#509
opened Jan 31, 2025 by
praateekmahajan
Loading…
3 tasks
Update model nomenclature
documentation
Improvements or additions to documentation
#497
opened Jan 24, 2025 by
sarahyurick
Loading…
Clean up Pandas, cuDF, Dask, and Dask-cuDF Run GPU CI/CD on PR
DocumentDataset
type logic
gpuci
#494
opened Jan 23, 2025 by
sarahyurick
Loading…
Add Pooling Strategy Option for embedding creation
gpuci
Run GPU CI/CD on PR
#491
opened Jan 20, 2025 by
VibhuJawa
Loading…
Standardize Run GPU CI/CD on PR
text_field
and id_field
terminology
gpuci
#485
opened Jan 17, 2025 by
sarahyurick
Loading…
Minor CrossFit improvements
gpuci
Run GPU CI/CD on PR
#483
opened Jan 16, 2025 by
sarahyurick
Loading…
Add Run GPU CI/CD on PR
nemo-toolkit
dependency to gpuCI
gpuci
#480
opened Jan 10, 2025 by
sarahyurick
Loading…
Enable ADD ID to work with CPU/GPU both
gpuci
Run GPU CI/CD on PR
#479
opened Jan 10, 2025 by
VibhuJawa
Loading…
Support
dask_expr
migration into dask.dataframe
#477
opened Jan 9, 2025 by
rjzamora
Loading…
3 tasks
Update
get_all_files_paths_under
examples to include keep_extensions
#450
opened Dec 20, 2024 by
sarahyurick
Loading…
[WIP] Add RAPIDS Nightly to GPU CI
gpuci
Run GPU CI/CD on PR
#436
opened Dec 17, 2024 by
praateekmahajan
•
Draft
3 tasks
Bump nltk from 3.8.1 to 3.9 in /tutorials/dapt-curation/code
dependencies
Pull requests that update a dependency file
#429
opened Dec 13, 2024 by
dependabot
bot
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.