Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Arabizi/ Franco-Arabic text translated to English #5469

Open
omnipervius opened this issue Mar 31, 2024 · 1 comment
Open

Arabizi/ Franco-Arabic text translated to English #5469

omnipervius opened this issue Mar 31, 2024 · 1 comment

Comments

@omnipervius
Copy link

What is your question?

I am trying to translate Arabizi to English. This is the romanized version of the Arab language which is often used on social-media. Is this model able to translate from arabizi or do you know other model already trained for that purpose? I want to avoid normalization of the text before passing it to the model, but if anybody can propose some good normalization model (arabizi to arabic) it would also be helpful. :)

What have you tried?

We are classifiyng the input as Arabic or Moroccan language, but the model is not able to transalte a single word as it execpt the classic arabic symbols.

@hwang136
Copy link

Hey, my friend how do you know which special token represents the Arabic language? there are 21 special tokens ending with Arab and I do not know which one I should use.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants