-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when training VITS model for vctk dataset #5708
Labels
Bug
bug should be fixed
Comments
Thanks for the report. |
It seems that default speaker embedding is changed (I designed the config for kaldi-xvector). espnet/egs2/TEMPLATE/tts1/tts.sh Lines 69 to 73 in ca7716f
There are two method:
Maybe 1 is easier. Just change the following line from 512 to 192.
|
Hi, echo @kan-bayashi 's point
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I am trying to train the VITS TTS for multi-speaker setup using xvector using the vctk recipe. I am using the instructions provided in https://github.com/espnet/espnet/blob/master/egs2/TEMPLATE/tts1/README.md#vits-training. I get the following error while training
line 116, in forward
return F.linear(input, self.weight, self.bias)
RuntimeError: mat1 and mat2 shapes cannot be multiplied (39x192 and 512x256)
Basic environments:
Environments from
torch.utils.collect_env
:The text was updated successfully, but these errors were encountered: