Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to clone my own voice locally #185

Open
gitihobo opened this issue Aug 14, 2023 · 1 comment
Open

How to clone my own voice locally #185

gitihobo opened this issue Aug 14, 2023 · 1 comment

Comments

@gitihobo
Copy link

I want to create my own dataset, how do I do this? I cant seem to make sense of the dataset section

@bitplane
Copy link
Contributor

bitplane commented Sep 15, 2023

Dunno if it'll work but I did this:

  • Used Mozilla Common Voice to help build a dataset for the whole world
  • Downloaded my data containing a couple of thousand recordings through the Common Voice export page in my profile
  • Converted the mp3 files in the zip to wav files and put them in a folder called wavs
  • Then took the text from the export and put the name of the wav followed by the text, pipe separated: wav_file.wav|Text spoken, and put it in metadata.csv
  • replaced all the fancy quotes for " and the fancy apostrophes with '
  • zipped this up
  • Ran docker build . -t voice-cloning-app in the project dir
  • Then docker run --gpus all -p 5000:5000 -v$(pwd)/data:/app/data voice-cloning-app
  • Then went to http://localhost:5000/ and uploaded the zip file as an import
  • It then said it'd take a year to run, so I exported it and used the colab notebook. Had to make a couple of changes to get it working:

https://colab.research.google.com/gist/bitplane/60543855a8fc3df473e94e18d45b06a3/copy-of-voice-cloning-training.ipynb

You'll need this file in your Drive dir too, call it pretrained.pt:

https://drive.usercontent.google.com/download?id=1c5ZTuT7J08wLUoVZ2KkUs_VdZuJ86ZqA

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants