Skip to content

Issues: open-compass/opencompass

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug] F1-score file found error
#1989 opened Mar 31, 2025 by GenerallyCovetous
2 tasks done
[Bug] 多卡测试llama-3-8b-vllm,精度为0
#1979 opened Mar 27, 2025 by normlpl
2 tasks done
[Feature] 增加 benbench 评测集
#1975 opened Mar 25, 2025 by linbeyoung
1 task
[Bug] If-eval deepseek-r1 score may not corret
#1949 opened Mar 17, 2025 by MeJerry215
2 tasks done
[Feature] aime2025的llm测评
#1946 opened Mar 14, 2025 by GaoJieCN
1 task done
[Bug] huamneval测试Deepseek-R1结果过低
#1943 opened Mar 13, 2025 by Gannn12138
2 tasks done
[Bug] commonsense_qa和strategyqa的results为空
#1941 opened Mar 13, 2025 by gxlover0625
2 tasks done
[Bug] VLLM推理时参数不匹配
#1935 opened Mar 12, 2025 by c-box
2 tasks done
[Bug] mmlu_pro结果正则提取出错
#1933 opened Mar 11, 2025 by wangzhaode
2 tasks done
[Bug] LiveCodeBench评测报错
#1930 opened Mar 10, 2025 by AllenShow
2 tasks done
[Bug] math_500_gen评测报错
#1929 opened Mar 10, 2025 by AllenShow
2 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.