Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language capabilities #247

Open
janEbert opened this issue Jan 8, 2025 · 0 comments
Open

Language capabilities #247

janEbert opened this issue Jan 8, 2025 · 0 comments

Comments

@janEbert
Copy link

janEbert commented Jan 8, 2025

Congrats on the amazing model release! I've been unable to find which languages you actually consider "supported" for the model, since the paper just mentions that you were "expanding multilingual coverage beyond English and Chinese" without further data. If you don't have a concrete answer to that question, maybe you can respond which languages the model has been instruction-tuned with. If you can open up the per-language sampling/mixture rate in the data distribution, that'd also be very helpful.

I also noticed that the paper says you used the non-English part from the MMMLU dataset. In the paper, the HF repo is referenced, which does not contain an English part (I'm assuming you mean that the original English MMLU wasn't included), but which does contain a Chinese part.
I'm assuming that you did not include English in this evaluation set in order to not skew the multilingual results, because English is one of the two main powerful languages of the model. Shouldn't Chinese have been filtered as well to achieve a similar reduction of result skewing?

Thank you and good luck continuing the great work! :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant