Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How Can I use my own voice to training the AI #284

Open
CodeMagic6 opened this issue Jun 7, 2024 · 8 comments · May be fixed by #680
Open

How Can I use my own voice to training the AI #284

CodeMagic6 opened this issue Jun 7, 2024 · 8 comments · May be fixed by #680
Labels
algorithm Algorithm improvements & issues enhancement New feature or request help wanted Extra attention is needed

Comments

@CodeMagic6
Copy link

I want to use my own voice to training the AI, but I don't know the steps.Can anyone write a torturial about it?

我不知道怎么使用自己的语音训练AI生成自己的语音,有人能写出详细的步骤吗?

@cesinsingapore
Copy link

same question, i need the english speaker to speak more singlish/chinese accent

@kuang-kuang
Copy link

I also need clone my own voice

@MethanJess
Copy link

This fork is trying to add fine tuning: https://github.com/ain-soph/ChatTTS

@yoesak
Copy link

yoesak commented Jun 14, 2024

This fork is trying to add fine tuning: https://github.com/ain-soph/ChatTTS

Does it support to train a new language? instead of fine-tune for cloning voice?

@cesinsingapore
Copy link

This fork is trying to add fine tuning: https://github.com/ain-soph/ChatTTS

Does it support to train a new language? instead of fine-tune for cloning voice?

the fork is not clear how to do the fine tune

@xpdd123
Copy link

xpdd123 commented Jun 20, 2024

This fork is trying to add fine tuning: https://github.com/ain-soph/ChatTTS

Does it support to train a new language? instead of fine-tune for cloning voice?

the fork is not clear how to do the fine tune

first finetune encoder, then finetune gpt

@ain-soph
Copy link
Contributor

ain-soph commented Jul 5, 2024

I’m the one who create that fork.
Yes, first train the encoder and then GPT. However, the current encoder loss cannot be optimized to be smaller than 0.1, making training unfeasible.

The possible solution is to modify current VQ encoder architecture and explore better training hyper-parameters. Or find a better dataset for training.

But I’m busy working, so … Hope anyone other could do that

@github-actions github-actions bot added the stale The topic has been ignored for a long time label Aug 5, 2024
@fumiama fumiama added enhancement New feature or request help wanted Extra attention is needed algorithm Algorithm improvements & issues and removed stale The topic has been ignored for a long time labels Aug 14, 2024
@fumiama fumiama linked a pull request Aug 14, 2024 that will close this issue
@vivek-kumar-vkb
Copy link

I’m the one who create that fork. Yes, first train the encoder and then GPT. However, the current encoder loss cannot be optimized to be smaller than 0.1, making training unfeasible.

The possible solution is to modify current VQ encoder architecture and explore better training hyper-parameters. Or find a better dataset for training.

But I’m busy working, so … Hope anyone other could do that

@ain-soph Can you tell me how to fine-tune encoder and then GPT? Unfortunately I am not able to find script of finetuning anywhere. If you could guide me in this it would be great.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algorithm Algorithm improvements & issues enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

Successfully merging a pull request may close this issue.

9 participants