Text2Vox is a user-friendly text-to-speech (TTS) web application that utilizes Hugging Face's state-of-the-art models to convert text into natural-sounding audio. With its simple interface and support for multiple voice models, it offers a seamless way to transform written content into speech.
- Multiple Voice Models: Choose between top-tier TTS models:
- ESPNET - VITS: Clear and natural English voice
- Facebook - MMS TTS: High-quality English speech
- Custom Text Input: Enter text in a responsive textarea and convert it to audio with a single click.
- Audio Playback and Download: Listen to the generated audio directly or download it for offline use.
- Add support for more languages and models.
- Improve background animations to react dynamically to audio generation.
- Implement user authentication for personalized settings.
- Provide an option to adjust speech parameters (e.g., pitch, speed).
- Hugging Face for their amazing TTS models.
- Framer Motion for animation support.
- Tailwind CSS for the design framework.
Feel free to explore Text2Vox! 🚀