Expected amount of data needed in hours to get good model on a new language❓ #31
Answered
by
snakers4
ErenBalatkan
asked this question in
Q&A
-
❓ Questions and HelpHow much labeled data in terms of speech hours would you say we need to train a good model on a new language? |
Beta Was this translation helpful? Give feedback.
Answered by
snakers4
Nov 24, 2020
Replies: 2 comments 2 replies
-
For minor languages (non English) with limited dialects about 200-300h per domain, at least 4-5 domains |
Beta Was this translation helpful? Give feedback.
2 replies
Answer selected by
snakers4
-
Thanks, closing |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
For minor languages (non English) with limited dialects about 200-300h per domain, at least 4-5 domains