-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Segfault with "IOT instruction (core dumped)" #11
Comments
Could be a phrase length issue; try "this is a test message" |
Also the message can only include lowercase letters, spaces, and punctuation(though this isn't why it core dumped). |
Yes, it warns that it sees unknown tokens, but it shouldn't segfault on them anyways. I've retried with lower-cased version without the unknown tokens with the same result.
Maybe. This exact phrase works, phrase from the readme works. This works
and this doesn't
|
The second phrase works on a 1070ti for me.
Make sure your submodule is up to date; I released a patch that allows long phrases on GPU very recently. You might want to do a fresh recursive clone of the repository to make sure the submodule is actually updated. |
I'm running it on CPU, not on GPU. Nevertheless, I've tried with updated sources - and segfault is gone. Though the sound deteriorates further into the sentence - but I couldn't run the original model to compare to. |
yeah you might have to try a few times to get a good generation with multiple sentences; you're better off splitting your input text by "." or something to that effect and passing one sentence into tortoise.cpp at a time. Generation quality is consistent with the "fast" preset of tortoise-tts, with the caveat that the original version generates multiple wav candidates and uses CLVP to pick the best one, whereas tortoise.cpp doesn't implement CLVP. So, at the cost of additional computation, tortoise-tts has a bit of an edge over tortoise.cpp in terms of average final generation quality. but you can get just as good generations with tortoise.cpp with enough tries. Another caveat is that tortoise.cpp only re-implements the "fast" preset of tortoise-tts, when there are higher quality settings with more diffusion steps available in tortoise-tts. Please let me know if you have any further issues; feedback is super valuable at this stage of the project :^) |
Yeah, I thought the same. Single sentences sounds good, but I haven't yet tried to merge them - I was more interested in checking intonations/overall voice quality. |
It varies a lot by voice, I find that the premade voices provided with tortoise-tts with the name format train_<name> seem subjectively the most stable and humanlike. This includes mol and mouse, but I think there are others. |
The following invocation
results in
The text was updated successfully, but these errors were encountered: