Skip to content

Latest commit

 

History

History
37 lines (21 loc) · 1.33 KB

README.md

File metadata and controls

37 lines (21 loc) · 1.33 KB

Text2Vox

Text2Vox is a user-friendly text-to-speech (TTS) web application that utilizes Hugging Face's state-of-the-art models to convert text into natural-sounding audio. With its simple interface and support for multiple voice models, it offers a seamless way to transform written content into speech.

Features

  • Multiple Voice Models: Choose between top-tier TTS models:
    • ESPNET - VITS: Clear and natural English voice
    • Facebook - MMS TTS: High-quality English speech
  • Custom Text Input: Enter text in a responsive textarea and convert it to audio with a single click.
  • Audio Playback and Download: Listen to the generated audio directly or download it for offline use.

Screenshots

image

Future Enhancements

  • Add support for more languages and models.
  • Improve background animations to react dynamically to audio generation.
  • Implement user authentication for personalized settings.
  • Provide an option to adjust speech parameters (e.g., pitch, speed).

Acknowledgments


Feel free to explore Text2Vox! 🚀