Skip to content

The best looking and most functional webui for RVC related tasks. See website for UI demo:

License

Notifications You must be signed in to change notification settings

SayanoAI/RVC-Studio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

⚠️ The main branch is bleeding edge: Expect frequent updates and many breaking changes after every commit

RVC Studio

This project provides a comprehensive platform for training RVC models and generating AI voice covers. Use the app to download the required files before using or manually download them here: https://huggingface.co/datasets/SayanoAI/RVC-Studio/tree/main

Features

  • Youtube music downloader: download any music video from Youtube as an mp3 file with just one click.
  • 1-click AI song covers: easily create AI song covers using RVC.
  • RVC Model fine-tuning: fine-tune an RVC model to mimic any voice you want using your own data.
  • 1-click TTS using RVC model: convert any text to speech using the fine-tuned VC model with just one click.
  • Built-in tensorboard: You can monitor the training progress and performance of your VC model using a built-in tensorboard dashboard.
  • LLM integration: chat with your RVC model in real time using popular LLMs.
  • Auto-Playlist: let your RVC model sing songs from your favourite playlist.

Planned Features

  • Demucs: Meta's vocals and instrumental music source separation.
  • Audio-postprocessing: You can enhance the quality of your generated songs by adding reverbs, echos, etc.
  • TTS using cloud API: use a cloud-based text-to-speech service to generate high-quality and natural-sounding speech from any text.
  • Real-time VC interface: convert your voice using your favourite RVC model.

Requirements

  • Python 3.6 or higher (developed and tested on v3.8.17)
  • Pip
  • Virtualenv or conda package manager

Easy Install

  1. Clone this repository or download the zip file and extract it.
  2. Double-click "conda-installer.bat" to install the latest version of conda package manager
  3. Double-click "conda-start.bat" (if you skipped step 2.)

Manual Installation

  1. Clone this repository or download the zip file.
  2. Navigate to the project directory and create a virtual environment with the command virtualenv venv.
  3. Activate the virtual environment with the command source venv/bin/activate on Linux/Mac or venv\Scripts\activate on Windows. Or use conda create -n RVC-Studio & conda activate RVC-Studio if you're using conda package manager.
  4. Install the required packages with the command pip install -r requirements.txt.
  5. Run the streamlit app with the command streamlit run Home.py.

Or run it in Google Colab

Instructions for Inference page

  1. Download all the required models on the webui page or here: https://huggingface.co/datasets/SayanoAI/RVC-Studio/tree/main
  2. Put your favourite songs in the ./songs folder
  3. Navigate to "RVC Server" page and start the server
  4. Navigate to "Inference" page and press "Refresh Data" button
  5. Select a song (only wav/flac/ogg/mp3 are supported for now)
  6. Select a voice model (put your RVC v2 models in ./models/RVC/ and index file in ./models/RVC/.index/)
  7. Choose a vocal extraction model (preprocessing model is optional)
  8. Click "Save Options" and "1-Click VC" to get started

Instructions for Chat page

Chat functionality has been migrated to RVC-Chat.

Feel free to use larger versions of these models if your computer can handle it. (you will have to build your own config)

Dockerize

Run docker compose up --build in the main project folder.

Known issue: Tensorboard doesn't work inside a docker container. Feel free to submit a PR if you know a solution. fixed in commit 8b720936b4dab347cba0e4a791330fb533bfdf1d

FAQs

Disclaimer

This project is for educational and research purposes only. The generated voice overs are not intended to infringe on any copyrights or trademarks of the original songs or text. The project does not endorse or promote any illegal or unethical use of the generative AI technology. The project is not responsible for any damages or liabilities arising from the use or misuse of the generated voice overs.

Credits

This project uses code and AI models from the following repositories:

We thank all the authors and contributors of these repositories for their amazing work and for making their code and models publicly available.