Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
863 workflow runs
863 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

updated work_tokenizer assignments and added burmese
Secret Leaks #161: Commit a2ceb48 pushed by guipenedo
November 27, 2024 15:11 22s multilingual
November 27, 2024 15:11 22s
[fixbug]: Fixed the issue in MinhashBuildIndex where get_datafolder w…
Secret Leaks #160: Commit fe81883 pushed by guipenedo
November 27, 2024 14:55 24s main
November 27, 2024 14:55 24s
[fixbug]: Fixed the issue in MinhashBuildIndex where get_datafolder w…
Test & Check Code Quality #332: Commit fe81883 pushed by guipenedo
November 27, 2024 14:55 1m 55s main
November 27, 2024 14:55 1m 55s
[fixbug]: Fixed the issue in MinhashBuildIndex where get_datafolder w…
Test & Check Code Quality #331: Pull request #307 opened by Youggls
November 27, 2024 14:53 1m 54s Youggls:main
November 27, 2024 14:53 1m 54s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #330: Pull request #285 synchronize by guipenedo
November 27, 2024 09:30 2m 34s multilingual
November 27, 2024 09:30 2m 34s
network limiting
Secret Leaks #159: Commit cf4668a pushed by guipenedo
November 27, 2024 09:30 17s multilingual
November 27, 2024 09:30 17s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #329: Pull request #285 synchronize by guipenedo
November 27, 2024 09:28 2m 54s multilingual
November 27, 2024 09:28 2m 54s
network limiting
Secret Leaks #158: Commit ee313a4 pushed by guipenedo
November 27, 2024 09:28 17s multilingual
November 27, 2024 09:28 17s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #328: Pull request #285 synchronize by guipenedo
November 26, 2024 18:22 2m 47s multilingual
November 26, 2024 18:22 2m 47s
giving up. just printing now
Secret Leaks #157: Commit c1ba400 pushed by guipenedo
November 26, 2024 18:22 19s multilingual
November 26, 2024 18:22 19s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #327: Pull request #285 synchronize by guipenedo
November 26, 2024 18:18 2m 41s multilingual
November 26, 2024 18:18 2m 41s
giving up. just printing now
Secret Leaks #156: Commit 04ca4d5 pushed by guipenedo
November 26, 2024 18:18 18s multilingual
November 26, 2024 18:18 18s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #326: Pull request #285 synchronize by guipenedo
November 26, 2024 18:02 2m 55s multilingual
November 26, 2024 18:02 2m 55s
stupid logspath
Secret Leaks #155: Commit da5e004 pushed by guipenedo
November 26, 2024 18:02 17s multilingual
November 26, 2024 18:02 17s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #325: Pull request #285 synchronize by guipenedo
November 26, 2024 17:56 2m 40s multilingual
November 26, 2024 17:56 2m 40s
revert
Secret Leaks #154: Commit f4cd9ac pushed by guipenedo
November 26, 2024 17:55 18s multilingual
November 26, 2024 17:55 18s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #324: Pull request #285 synchronize by guipenedo
November 26, 2024 17:47 2m 48s multilingual
November 26, 2024 17:47 2m 48s
GIVE ME MY PROGRESS BARS GOD DAMN IT
Secret Leaks #153: Commit 3f8d060 pushed by guipenedo
November 26, 2024 17:47 19s multilingual
November 26, 2024 17:47 19s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #323: Pull request #285 synchronize by guipenedo
November 26, 2024 17:32 2m 43s multilingual
November 26, 2024 17:32 2m 43s
GIVE ME MY PROGRESS BARS GOD DAMN IT
Secret Leaks #152: Commit c3ddde6 pushed by guipenedo
November 26, 2024 17:32 21s multilingual
November 26, 2024 17:32 21s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #322: Pull request #285 synchronize by guipenedo
November 26, 2024 17:31 2m 52s multilingual
November 26, 2024 17:31 2m 52s
GIVE ME MY PROGRESS BARS GOD DAMN IT
Secret Leaks #151: Commit 339566b pushed by guipenedo
November 26, 2024 17:31 23s multilingual
November 26, 2024 17:31 23s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #321: Pull request #285 synchronize by guipenedo
November 26, 2024 17:09 2m 31s multilingual
November 26, 2024 17:09 2m 31s
1 sec
Secret Leaks #150: Commit 9250ccf pushed by guipenedo
November 26, 2024 17:09 17s multilingual
November 26, 2024 17:09 17s
FineWeb-2: multilingual, numpy 2.0, minhash improvements
Test & Check Code Quality #320: Pull request #285 synchronize by guipenedo
November 26, 2024 17:07 2m 54s multilingual
November 26, 2024 17:07 2m 54s