Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,807 workflow runs
1,807 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add swiss legal evals as new community tasks
Tests #2016: Pull request #389 synchronize by JoelNiklaus
January 13, 2025 19:06 Action required JoelNiklaus:add_swiss_legal_evals
January 13, 2025 19:06 Action required
Extractive Match metric
Tests #2014: Pull request #495 synchronize by hynky1999
January 13, 2025 14:09 39m 15s math_extraction
January 13, 2025 14:09 39m 15s
Extractive Match metric
Tests #2013: Pull request #495 synchronize by hynky1999
January 13, 2025 14:08 39m 40s math_extraction
January 13, 2025 14:08 39m 40s
Extractive Match metric
Tests #2012: Pull request #495 synchronize by hynky1999
January 13, 2025 13:14 38m 7s math_extraction
January 13, 2025 13:14 38m 7s
llm_as_a_judge_for_oallv2_arabic
Tests #2011: Pull request #498 opened by Manel-Hik
January 13, 2025 11:30 38m 0s Manel-Hik:main
January 13, 2025 11:30 38m 0s
Add swiss legal evals as new community tasks
Tests #2010: Pull request #389 synchronize by JoelNiklaus
January 13, 2025 05:35 Action required JoelNiklaus:add_swiss_legal_evals
January 13, 2025 05:35 Action required
Initial proposal for model lazy loading
Tests #2009: Pull request #497 opened by JoelNiklaus
January 11, 2025 21:15 Action required JoelNiklaus:lazy-load-model-init
January 11, 2025 21:15 Action required
Extractive Match metric
Tests #2008: Pull request #495 opened by hynky1999
January 11, 2025 19:03 41m 6s math_extraction
January 11, 2025 19:03 41m 6s
Added custom model inference.
Tests #2007: Pull request #437 synchronize by JoelNiklaus
January 11, 2025 18:31 Action required JoelNiklaus:add-custom-model
January 11, 2025 18:31 Action required
Add Doc Strings to Config Files
Tests #2005: Pull request #465 synchronize by ParagEkbote
January 11, 2025 14:41 Action required ParagEkbote:Document-Custom-Model-Files
January 11, 2025 14:41 Action required
Add swiss legal evals as new community tasks
Tests #2000: Pull request #389 synchronize by JoelNiklaus
January 10, 2025 18:13 Action required JoelNiklaus:add_swiss_legal_evals
January 10, 2025 18:13 Action required
Add swiss legal evals as new community tasks
Tests #1999: Pull request #389 synchronize by JoelNiklaus
January 10, 2025 16:55 Action required JoelNiklaus:add_swiss_legal_evals
January 10, 2025 16:55 Action required
Fixed issue with o1 in litellm.
Tests #1998: Pull request #493 opened by JoelNiklaus
January 10, 2025 02:10 40m 58s JoelNiklaus:fix-o1-litellm
January 10, 2025 02:10 40m 58s
Add swiss legal evals as new community tasks
Tests #1995: Pull request #389 synchronize by JoelNiklaus
January 7, 2025 18:14 Action required JoelNiklaus:add_swiss_legal_evals
January 7, 2025 18:14 Action required
Tests
Tests #1993: by clefourrier
January 7, 2025 15:20 39m 24s main
January 7, 2025 15:20 39m 24s
Hotfix for litellm judge
Tests #1992: Pull request #490 synchronize by JoelNiklaus
January 7, 2025 15:17 37m 41s JoelNiklaus:fix-litellm-judge
January 7, 2025 15:17 37m 41s
Hotfix for litellm judge
Tests #1990: Pull request #490 synchronize by JoelNiklaus
January 7, 2025 14:57 42m 9s JoelNiklaus:fix-litellm-judge
January 7, 2025 14:57 42m 9s
Fix T_co import bug
Tests #1989: Pull request #484 synchronize by gucci-j
January 7, 2025 13:51 38m 30s gucci-j:fix-tco
January 7, 2025 13:51 38m 30s
feat: add JGLUE tasks
Tests #1987: Pull request #469 synchronize by ryan-minato
January 7, 2025 09:10 Action required ryan-minato:jglue
January 7, 2025 09:10 Action required
feat: add JGLUE tasks
Tests #1986: Pull request #469 synchronize by ryan-minato
January 7, 2025 09:08 Action required ryan-minato:jglue
January 7, 2025 09:08 Action required
Made litellm judge backend more robust. (#485)
Tests #1985: Commit fdb12f4 pushed by clefourrier
January 7, 2025 08:24 37m 57s main
January 7, 2025 08:24 37m 57s