#

hifi-gan

Here are 24 public repositories matching this topic...

Amphion

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

text-to-speech audit speech-synthesis audio-synthesis music-generation voice-conversion text-to-audio fastspeech2 vits hifi-gan audio-generation singing-voice-conversion vall-e audioldm naturalspeech2

Updated May 10, 2024
Python

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

text-to-speech deep-learning pytorch tts speech-synthesis gan vocoder hifi-gan

Updated Jul 23, 2023
Python

keonlee9420 / PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

text-to-speech deep-neural-networks pytorch tts speech-synthesis generative-model vae normalizing-flows high-quality neural-tts non-autoregressive fastspeech hifi-gan non-ar mel-gan portable-tts

Updated Feb 17, 2022
Python

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Updated Sep 24, 2022
Python

keonlee9420 / DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

text-to-speech deep-neural-networks pytorch tts speech-synthesis gan generative-model diffusion diffusion-models neural-tts non-autoregressive fastspeech multi-speaker-tts hifi-gan ddpm non-ar diffspeech diffgan-tts single-speaker-tts

Updated Feb 21, 2022
Python

NTT123 / vietTTS

Vietnamese Text to Speech library

text-to-speech deep-learning vietnamese tts-engines vietnam vocoder tacotron hifi-gan

Updated Aug 20, 2023
Python

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

text-to-speech deep-learning unsupervised end-to-end pytorch tts speech-synthesis jets multi-speaker sota single-speaker neural-tts non-autoregressive fastspeech2 hifi-gan non-ar ultimate-tts text-to-wav

Updated Jun 6, 2022
Python

rishikksh20 / Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

text-to-speech pytorch tts speech-synthesis generative-adversarial-network gan vocoder hifi-gan avocodo

Updated Jul 14, 2022
Python

tts-arabic-pytorch

nipponjo / tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

python text-to-speech deep-learning speech pytorch tts speech-synthesis arabic torchaudio tacotron2-pytorch tacotron2 hifi-gan fastpitch arabic-tts

Updated May 9, 2024
Jupyter Notebook

Voice-Privacy-Challenge / Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Updated Mar 14, 2024
Python

keonlee9420 / Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

text-to-speech deep-learning efficiency pytorch tts speech-synthesis autoregressive multi-speaker robustness comprehensive tacotron single-speaker neural-tts tacotron2 reduction-factor hifi-gan mel-gan diagonal-guided-attention

Updated Jul 31, 2023
Python

hwRG / End-to-End-TTS-Fine-Tune

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

end-to-end tts fine-tune fastspeech2 hifi-gan

Updated Jul 30, 2023
Python

NTT123 / hifigan-tpu

Train HiFi-GAN on TPU

text-to-speech tts gan pax vocoder jax hifi-gan

Updated Apr 3, 2022
Python

manhph2211 / ViTTS

In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...

text-to-speech speech-synthesis mfa vocoder deepspeech normalizing-flow hifi-gan multispeaker-speech-synthesis mosnet portaspeech realtime-tts istftnet vietnamese-tts vietnamese-text-to-speech

Updated Nov 24, 2023
Python

jik876 / hifi-gan-demo

Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"

text-to-speech deep-learning tts speech-synthesis gan hifi-gan

Updated Oct 28, 2020
HTML

ssmlkl / MnTTS2

This is the experimental description of MnTTS2.

tts mongolian multi-speaker-tts fastspeech2 hifi-gan

Updated Apr 11, 2024
Jupyter Notebook

nipponjo / tts-german-pytorch

TTS (FastPitch) for German

python text-to-speech deep-learning german speech pytorch tts speech-synthesis german-language torchaudio emotional-speech hifi-gan fastpitch

Updated Apr 29, 2023
Python

34j / neural-source-filter

Python package for NSF and NSF-HiFi-GAN (unofficial)

python pytorch tts nsf voice-conversion mypy vocoder hifi-gan neural-source-filter

Updated May 6, 2024
Python

watchstep / glow-tts-jejueo

제주어 음성 합성 (보완 중)

tts korean jejueo glow-tts hifi-gan

Updated Dec 21, 2022
Jupyter Notebook

mehdihosseinimoghadam / Catalan-Text-to-Speech

Catalan Text to Speech

speech pytorch speech-synthesis speech-to-text catalan speech-processing tacotron wavernn fastspeech tacotron2-pytorch tacotron2 melgan catalan-language hifi-gan catalan-text-to-speech

Updated Dec 12, 2022
Python

Improve this page

Add a description, image, and links to the hifi-gan topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hifi-gan topic, visit your repo's landing page and select "manage topics."