Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,807 workflow runs
1,807 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add swiss legal evals as new community tasks
Tests #2075: Pull request #389 synchronize by JoelNiklaus
January 27, 2025 13:21 Action required JoelNiklaus:add_swiss_legal_evals
January 27, 2025 13:21 Action required
Add custom task (bac-fr) for evaluation of models in french
Tests #2074: Pull request #518 synchronize by mdiazmel
January 27, 2025 12:52 Action required mdiazmel:main
January 27, 2025 12:52 Action required
Pass@k
Tests #2073: Pull request #519 synchronize by clefourrier
January 27, 2025 09:02 38m 30s clem_pass_at_k
January 27, 2025 09:02 38m 30s
Pass@k
Tests #2072: Pull request #519 synchronize by clefourrier
January 27, 2025 08:55 2m 59s clem_pass_at_k
January 27, 2025 08:55 2m 59s
Pass@k
Tests #2071: Pull request #519 synchronize by clefourrier
January 27, 2025 08:50 3m 0s clem_pass_at_k
January 27, 2025 08:50 3m 0s
Pass@k
Tests #2070: Pull request #519 opened by clefourrier
January 27, 2025 08:44 3m 1s clem_pass_at_k
January 27, 2025 08:44 3m 1s
Add custom task (bac-fr) for evaluation of models in french
Tests #2069: Pull request #518 opened by mdiazmel
January 27, 2025 08:04 Action required mdiazmel:main
January 27, 2025 08:04 Action required
Fixing commonsense qa: generative metrics, -1 gen length (#517)
Tests #2068: Commit cb075a5 pushed by clefourrier
January 26, 2025 17:18 38m 42s main
January 26, 2025 17:18 38m 42s
Fixing commonsense qa: generative metrics, -1 gen length
Tests #2067: Pull request #517 opened by clefourrier
January 26, 2025 12:57 38m 15s clefourrier-patch-3
January 26, 2025 12:57 38m 15s
Fix Ukrainian indices and confirmation word (#516)
Tests #2066: Commit 499cc82 pushed by clefourrier
January 26, 2025 11:04 40m 22s main
January 26, 2025 11:04 40m 22s
Fix Ukrainian indices and confirmation word
Tests #2065: Pull request #516 opened by ayukh
January 25, 2025 18:30 38m 22s ayukh:main
January 25, 2025 18:30 38m 22s
Fixed bug of import url_to_fs from fsspec (#507) (#512)
Tests #2063: Commit 4f381b3 pushed by clefourrier
January 24, 2025 10:37 39m 57s main
January 24, 2025 10:37 39m 57s
Improve readability of the quick tour.
Tests #2059: Pull request #501 synchronize by vxw3t8fhjsdkghvbdifuk
January 23, 2025 17:59 44m 9s vxw3t8fhjsdkghvbdifuk:patch-2
January 23, 2025 17:59 44m 9s
Add community task specific for french
Tests #2058: Pull request #511 opened by mdiazmel
January 23, 2025 16:07 Action required mdiazmel:main
January 23, 2025 16:07 Action required
Improve readability of the quick tour.
Tests #2056: Pull request #501 synchronize by vxw3t8fhjsdkghvbdifuk
January 23, 2025 14:27 38m 13s vxw3t8fhjsdkghvbdifuk:patch-2
January 23, 2025 14:27 38m 13s
Bump up the latex2sympy2_extended version + more tests (#510)
Tests #2055: Commit 0ab63d0 pushed by hynky1999
January 23, 2025 13:02 43m 29s main
January 23, 2025 13:02 43m 29s
Bump up the latex2sympy2_extended version + more tests
Tests #2054: Pull request #510 opened by hynky1999
January 23, 2025 11:55 42m 25s math_extraction
January 23, 2025 11:55 42m 25s
Support custom results/details push to hub (#457)
Tests #2053: Commit c82143a pushed by clefourrier
January 23, 2025 10:48 38m 22s main
January 23, 2025 10:48 38m 22s
Add custom tasks for evaluation of french models (#505)
Tests #2052: Commit 7028af3 pushed by clefourrier
January 23, 2025 08:24 38m 48s main
January 23, 2025 08:24 38m 48s
Config fixes for VLLMModel
Tests #2051: Pull request #472 synchronize by clefourrier
January 23, 2025 08:04 39m 51s anton-l:vllm_quick_fixes
January 23, 2025 08:04 39m 51s