ComfyUI-KokoroTTS is a powerful text-to-speech node for ComfyUI. It leverages the Kokoro TTS framework to convert text into natural-sounding speech, supporting multiple languages and voices. The node is easy to integrate and customize, making it perfect for various applications.
(2025/02/24) V1.1.0 Added audio saving functionality update.md
- Multiple languages and voices support
- Adjustable speech rate and volume
- High-quality voice synthesis
- All voices are setting in Languages.json
Search for ComfyUI-KokoroTTS
in ComfyUI-Manager and install.
Then install dependencies:
./ComfyUI/python_embeded/python -m pip install -r requirements.txt
cd ComfyUI/custom_nodes
git clone https://github.com/1038lab/ComfyUI-KokoroTTS.git
cd ComfyUI-KokoroTTS
./ComfyUI/python_embeded/python -m pip install -r requirements.txt
https://huggingface.co/1038lab/KokoroTTS/tree/main
download and save save all files to TTS/KokoroTTS
By default, only English is supported. To add more languages, edit Languages.json
to add more languages support. remove #
language name in the json file. restart ComfyUI.
from KokoroTTS import KokoroTTS
tts = KokoroTTS()
audio_output = tts.tts("Hello, world!", voice="default_voice")
- Add Kokoro TTS node to workflow
- Input text and select voice
- Adjust speed and volume (optional)
- Connect to Save Audio node for export
Available voices can be loaded using the load_voices
method. See config.json for detailed voice list.
- Python packages (see requirements.txt)
- CUDA compatible GPU (optional, for faster processing)
GNU GPLv3
- Kokoro TTS Framework