Skip to content
#

hifi-gan

Here are 24 public repositories matching this topic...

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

  • Updated May 10, 2024
  • Python

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

  • Updated Sep 24, 2022
  • Python

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

  • Updated Jun 6, 2022
  • Python

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

  • Updated Jul 31, 2023
  • Python

Improve this page

Add a description, image, and links to the hifi-gan topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hifi-gan topic, visit your repo's landing page and select "manage topics."

Learn more