Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,807 workflow runs
1,807 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

llm_as_a_judge_for_oallv2_arabic (#498)
Tests #2049: Commit 620873b pushed by clefourrier
January 23, 2025 07:24 38m 34s main
January 23, 2025 07:24 38m 34s
Relax upper bound on torch (#508)
Tests #2048: Commit 5f5bed5 pushed by clefourrier
January 23, 2025 07:20 42m 9s main
January 23, 2025 07:20 42m 9s
Add custom tasks for evaluation of french models
Tests #2047: Pull request #505 synchronize by mdiazmel
January 22, 2025 16:39 38m 46s mdiazmel:main
January 22, 2025 16:39 38m 46s
Relax upper bound on torch
Tests #2045: Pull request #508 opened by lewtun
January 22, 2025 10:58 43m 34s lewtun-patch-1-1
January 22, 2025 10:58 43m 34s
Add custom tasks for evaluation of french models
Tests #2044: Pull request #505 synchronize by mdiazmel
January 22, 2025 10:11 38m 54s mdiazmel:main
January 22, 2025 10:11 38m 54s
Add custom tasks for evaluation of french models
Tests #2043: Pull request #505 synchronize by mdiazmel
January 22, 2025 10:09 Action required mdiazmel:main
January 22, 2025 10:09 Action required
Translate task template to Catalan and Galician and fix typos (#506)
Tests #2042: Commit 1ae2fa2 pushed by clefourrier
January 22, 2025 10:01 38m 21s main
January 22, 2025 10:01 38m 21s
Add custom tasks for evaluation of french models
Tests #2039: Pull request #505 opened by mdiazmel
January 21, 2025 21:49 Action required mdiazmel:main
January 21, 2025 21:49 Action required
Improve readability of the quick tour.
Tests #2038: Pull request #501 synchronize by vxw3t8fhjsdkghvbdifuk
January 21, 2025 18:13 38m 45s vxw3t8fhjsdkghvbdifuk:patch-2
January 21, 2025 18:13 38m 45s
Made judge response processing more robust. (#491)
Tests #2032: Commit 0140578 pushed by clefourrier
January 20, 2025 14:41 38m 34s main
January 20, 2025 14:41 38m 34s
Add swiss legal evals as new community tasks
Tests #2031: Pull request #389 synchronize by rolshoven
January 20, 2025 12:39 Action required JoelNiklaus:add_swiss_legal_evals
January 20, 2025 12:39 Action required
January 20, 2025 09:03 38m 4s
Hotfix for litellm judge (#490)
Tests #2029: Commit fee2ec3 pushed by clefourrier
January 20, 2025 09:02 38m 1s main
January 20, 2025 09:02 38m 1s
Fixed issue with o1 in litellm. (#493)
Tests #2028: Commit 3b89734 pushed by clefourrier
January 20, 2025 09:02 37m 26s main
January 20, 2025 09:02 37m 26s
Fix math extraction (#503)
Tests #2027: Commit 90d44c1 pushed by clefourrier
January 18, 2025 17:57 39m 7s main
January 18, 2025 17:57 39m 7s
Fix math extraction
Tests #2026: Pull request #503 opened by hynky1999
January 17, 2025 19:22 40m 21s math_extraction
January 17, 2025 19:22 40m 21s
Add Doc Strings to Config Files
Tests #2025: Pull request #465 synchronize by ParagEkbote
January 17, 2025 15:55 Action required ParagEkbote:Document-Custom-Model-Files
January 17, 2025 15:55 Action required
Fix TGI (Text Generation Inference) Endpoint Inference and TGI JSON Grammar Generation
Tests #2024: Pull request #502 synchronize by cpcdoy
January 15, 2025 17:00 Action required cpcdoy:fix/tgi_inference
January 15, 2025 17:00 Action required
Extractive Match metric (#495)
Tests #2022: Commit 59624c8 pushed by hynky1999
January 15, 2025 10:19 41m 6s main
January 15, 2025 10:19 41m 6s
fix README link (#500)
Tests #2020: Commit a7aa6ed pushed by clefourrier
January 14, 2025 17:15 40m 55s main
January 14, 2025 17:15 40m 55s
fix link to wiki in README
Tests #2019: Pull request #500 opened by vxw3t8fhjsdkghvbdifuk
January 14, 2025 14:47 42m 39s vxw3t8fhjsdkghvbdifuk:patch-1
January 14, 2025 14:47 42m 39s