Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: select tasks by language #4

Open
avidale opened this issue Jul 20, 2023 · 2 comments
Open

Feature request: select tasks by language #4

avidale opened this issue Jul 20, 2023 · 2 comments
Labels
enhancement New feature or request

Comments

@avidale
Copy link

avidale commented Jul 20, 2023

Currently, the package doesn't allow choosing the language.
I think many people who are developing models for specific languages (or language sets) would like to be able to access task data for a given language, so if you implement this functionality, it might be of a great help.

@sileod
Copy link
Owner

sileod commented Jul 20, 2023

Hi, thanks for your suggestion !
Currently, you can use the dataframe and check for the presence of some languages in the names. But it's not enough, some datasets have the language in a particular column that is removed by the preprocessings. So it's not great, I agree.
Proper language handling is in my roadmap.

@sileod sileod added the enhancement New feature or request label Jul 20, 2023
@avidale
Copy link
Author

avidale commented Jul 20, 2023

Yes, adding the languages id to the dataframe would be a great first step.
Another potential enhancement is to make the file recast.py localizeable, so that the user could provide the prompt templates in the chosen language instead of the default (English).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants