Skip to content

ComfyUI-KokoroTTS: A text-to-speech model that utilizes the Kokoro TTS framework to convert text into natural-sounding speech. It supports multiple voices and languages

License

Notifications You must be signed in to change notification settings

1038lab/ComfyUI-KokoroTTS

Repository files navigation

ComfyUI-KokoroTTS

ComfyUI-KokoroTTS is a powerful text-to-speech node for ComfyUI. It leverages the Kokoro TTS framework to convert text into natural-sounding speech, supporting multiple languages and voices. The node is easy to integrate and customize, making it perfect for various applications.

KokoroTTS

Updates

(2025/02/24) V1.1.0 Added audio saving functionality update.md

image

Features

Kokoro TTS Node

  • Multiple languages and voices support
  • Adjustable speech rate and volume
  • High-quality voice synthesis
  • All voices are setting in Languages.json

Installation

Method 1: Install via ComfyUI-Manager

Search for ComfyUI-KokoroTTS in ComfyUI-Manager and install. Then install dependencies:

./ComfyUI/python_embeded/python -m pip install -r requirements.txt

Method 2: Manual Clone Installation

cd ComfyUI/custom_nodes
git clone https://github.com/1038lab/ComfyUI-KokoroTTS.git

cd ComfyUI-KokoroTTS
./ComfyUI/python_embeded/python -m pip install -r requirements.txt

manually download models

https://huggingface.co/1038lab/KokoroTTS/tree/main

download and save save all files to TTS/KokoroTTS

more languages support:

By default, only English is supported. To add more languages, edit Languages.json to add more languages support. remove # language name in the json file. restart ComfyUI.

Usage Examples

Basic Usage

from KokoroTTS import KokoroTTS

tts = KokoroTTS()
audio_output = tts.tts("Hello, world!", voice="default_voice")

Using in ComfyUI Workflow

  1. Add Kokoro TTS node to workflow
  2. Input text and select voice
  3. Adjust speed and volume (optional)
  4. Connect to Save Audio node for export

Available Voices

Available voices can be loaded using the load_voices method. See config.json for detailed voice list.

Requirements

  • Python packages (see requirements.txt)
  • CUDA compatible GPU (optional, for faster processing)

License

GNU GPLv3

Credits

  • Kokoro TTS Framework

About

ComfyUI-KokoroTTS: A text-to-speech model that utilizes the Kokoro TTS framework to convert text into natural-sounding speech. It supports multiple voices and languages

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages