Issue search results

Filter by

26 results

(52 ms)inOpenBMB/InfiniteBench (press backspace or delete to remove)

OpenBMB/InfiniteBench
Suprisingly low perf on code.Debug

Thanks for the work on this benchmark. I was wondering why the baseline accuracies on code.Debug are so low. de.Debug | 37.06% | 5% | 17.77% | 5% | 9.14% | 13.96% | 7.36% Since it s multiple choice ...

seyuboglu

Opened
on Jan 9

OpenBMB/InfiniteBench
How to deal with the generate response is None, which the model directly generate the <im_end> token.

I hope this message find you well. When I use the InfiniteBench to compute score, if the response of the model is None , it will raise an mistake,therefore, in our result table with the results of gpt4 ...

unicorneeee

Opened
on Dec 11, 2024

OpenBMB/InfiniteBench
[Question] 请问repo首页的examples size是最新的吗？

如题，请问repo首页的examples size是最新的吗？我从huggingface下载了infinitebench数据集，发现数据集有些文件的size和repo首页写的size对不上，比如longbook_sum_eng.jsonl里面有148个examples, 首页上写的是 En.Sum #examples 103. 是这期间数据集进行了更改吗，还是有其他我理解错了的对方？期待回复~ ...

lepangdan

Opened
on Dec 5, 2024

OpenBMB/InfiniteBench
样本输入截断

我看到eval_yarn_mistral.py中可以对输入长度进行截断，请问假如截断到64k，32k或更小，是否有将正确答案所在位置截去的可能（例如kv_retrieval中正确的键值对）。期待您的回复，谢谢！

Dori-Nilou

Opened
on Dec 1, 2024

OpenBMB/InfiniteBench
Support LLaMA Model

Thanks for your great work! Could you kindly advise on how to support the models in the LLaMA series?

ydyhello

Opened
on Nov 25, 2024

OpenBMB/InfiniteBench
found some annotation errors in 'longbook_qa_chn'

ID 41，42 的内容是斗破苍穹的内容，但是context里面并没有 “ 萧峰”这个人物出现。是entity的替换没有放到context里面还是问题本身写错了？

Zeyu1994

Opened
on Oct 27, 2024

OpenBMB/InfiniteBench
数据标注问题

请问咱们针对小说的问答数据集，问题和答案都是怎么获得的？纯人工标注的吗？（我看论文中没有明确提到这一点，麻烦指教🙏

ktlKTL

Opened
on Sep 24, 2024

OpenBMB/InfiniteBench
Mismatch for longbook_qa_eng

Are the GPT4 results evaluated on a different set of longbook_qa_eng? The ground_truth fields in results/gpt4/preds_longbook_qa_eng.jsonl don t seem match with ground_truth in results/chatglm3/preds_longbook_qa_eng.jsonl ...

xuandif-cmu

Opened
on Aug 20, 2024

OpenBMB/InfiniteBench
Error in loading from Huggingface

When I try to run the following code in colab: from datasets import load_dataset dataset = load_dataset( xinrongzhang2022/InfiniteBench ) I get the following error: DatasetGenerationCastError: An error ...

BenHamm

Opened
on Jul 25, 2024

OpenBMB/InfiniteBench
bug in computing scores for longdialogue_qa_eng

https://github.com/OpenBMB/InfiniteBench/blob/main/src/compute_scores.py#L238 1. only one reference label is used for comparison, better loop around each answer in label, e.g., label=[ ECKER , COMMANDER ...

Xianchao-Wu

Opened
on Jul 17, 2024

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Press the

key to activate the search input again and adjust your query.

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Press the

key to activate the search input again and adjust your query.

Languages

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter by

State

Advanced

OpenBMB/InfiniteBench
Suprisingly low perf on code.Debug

OpenBMB/InfiniteBench
How to deal with the generate response is None, which the model directly generate the <im_end> token.

OpenBMB/InfiniteBench
[Question] 请问repo首页的examples size是最新的吗？

OpenBMB/InfiniteBench
样本输入截断

OpenBMB/InfiniteBench
Support LLaMA Model

OpenBMB/InfiniteBench
found some annotation errors in 'longbook_qa_chn'

OpenBMB/InfiniteBench
数据标注问题

OpenBMB/InfiniteBench
Mismatch for longbook_qa_eng

OpenBMB/InfiniteBench
Error in loading from Huggingface

OpenBMB/InfiniteBench
bug in computing scores for longdialogue_qa_eng

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.

issues Search Results · repo:OpenBMB/InfiniteBench language:Python

Filter by

State

Advanced

26 results

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.