Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
864 workflow runs
864 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix: root condition for SENTINEL (#349)
Test & Check Code Quality #410: Commit d8066af pushed by guipenedo
March 4, 2025 17:49 2m 51s main
March 4, 2025 17:49 2m 51s
fix: root condition for SENTINEL (#349)
Secret Leaks #216: Commit d8066af pushed by guipenedo
March 4, 2025 17:49 18s main
March 4, 2025 17:49 18s
[draft] Add chunking option to DocumentTokenizer
Test & Check Code Quality #407: Pull request #344 synchronize by craffel
February 14, 2025 17:17 2m 32s craffel:ignore_doc_ends_chunk
February 14, 2025 17:17 2m 32s
[draft] Add chunking option to DocumentTokenizer
Test & Check Code Quality #406: Pull request #344 synchronize by craffel
February 12, 2025 21:46 2m 58s craffel:ignore_doc_ends_chunk
February 12, 2025 21:46 2m 58s
[draft] Add chunking option to DocumentTokenizer
Test & Check Code Quality #405: Pull request #344 synchronize by craffel
February 12, 2025 21:43 2m 49s craffel:ignore_doc_ends_chunk
February 12, 2025 21:43 2m 49s
[draft] Add chunking option to DocumentTokenizer
Test & Check Code Quality #404: Pull request #344 opened by craffel
February 12, 2025 21:40 2m 52s craffel:ignore_doc_ends_chunk
February 12, 2025 21:40 2m 52s
Revert "Add chunking option to DocumentTokenizer (#342)" (#343)
Test & Check Code Quality #403: Commit b94d771 pushed by guipenedo
February 12, 2025 21:39 2m 38s main
February 12, 2025 21:39 2m 38s
Revert "Add chunking option to DocumentTokenizer (#342)" (#343)
Secret Leaks #215: Commit b94d771 pushed by guipenedo
February 12, 2025 21:39 15s main
February 12, 2025 21:39 15s
Revert "[draft] Add chunking option to DocumentTokenizer"
Test & Check Code Quality #402: Pull request #343 opened by guipenedo
February 12, 2025 21:39 5m 8s revert-342-ignore_doc_ends_chunk
February 12, 2025 21:39 5m 8s
Add chunking option to DocumentTokenizer (#342)
Secret Leaks #213: Commit c9806d2 pushed by guipenedo
February 12, 2025 21:19 17s main
February 12, 2025 21:19 17s
Add chunking option to DocumentTokenizer (#342)
Test & Check Code Quality #401: Commit c9806d2 pushed by guipenedo
February 12, 2025 21:19 2m 48s main
February 12, 2025 21:19 2m 48s
[draft] Add chunking option to DocumentTokenizer
Test & Check Code Quality #400: Pull request #342 opened by craffel
February 12, 2025 21:15 3m 25s craffel:ignore_doc_ends_chunk
February 12, 2025 21:15 3m 25s
bugfix for IS_ALPHA
Secret Leaks #212: Commit b9fb72a pushed by guipenedo
January 30, 2025 14:49 16s main
January 30, 2025 14:49 16s
bugfix for IS_ALPHA
Test & Check Code Quality #399: Commit b9fb72a pushed by guipenedo
January 30, 2025 14:49 2m 41s main
January 30, 2025 14:49 2m 41s
adds revision in hf upload
Secret Leaks #211: Commit b3daef2 pushed by guipenedo
January 30, 2025 14:45 22s main
January 30, 2025 14:45 22s
adds revision in hf upload
Test & Check Code Quality #398: Commit b3daef2 pushed by guipenedo
January 30, 2025 14:45 2m 39s main
January 30, 2025 14:45 2m 39s
Allow custom parquet schema (#330)
Secret Leaks #210: Commit b105dcd pushed by guipenedo
January 30, 2025 10:10 17s main
January 30, 2025 10:10 17s
Allow custom parquet schema (#330)
Test & Check Code Quality #397: Commit b105dcd pushed by guipenedo
January 30, 2025 10:10 2m 59s main
January 30, 2025 10:10 2m 59s
fixes stopwors implementation... (#329)
Secret Leaks #209: Commit f9bbe09 pushed by guipenedo
January 30, 2025 08:00 18s main
January 30, 2025 08:00 18s
fixes stopwors implementation... (#329)
Test & Check Code Quality #396: Commit f9bbe09 pushed by guipenedo
January 30, 2025 08:00 3m 0s main
January 30, 2025 08:00 3m 0s
Allow custom parquet schema
Test & Check Code Quality #394: Pull request #330 synchronize by BramVanroy
January 28, 2025 18:40 2m 36s BramVanroy:main
January 28, 2025 18:40 2m 36s
Allow custom parquet schema
Test & Check Code Quality #391: Pull request #330 opened by BramVanroy
January 26, 2025 13:51 2m 35s BramVanroy:main
January 26, 2025 13:51 2m 35s
fixes stopwors implementation
Test & Check Code Quality #390: Pull request #329 opened by guipenedo
January 26, 2025 12:10 2m 58s stopwords_set
January 26, 2025 12:10 2m 58s
Add customization for fetching SLURM job id (#320)
Secret Leaks #208: Commit 0c3df50 pushed by guipenedo
January 24, 2025 13:06 21s main
January 24, 2025 13:06 21s