Tacotron2 Pytorch

A PyTorch implementation of Tacotron2, described in Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions, an end-to-end text-to-speech(TTS) neural network architecture, which directly converts character text sequence to speech.

https://github.com/kaituoxu/Tacotron2 is refereced with pytorch_sound
Differences

Use log mel spectrogram and Waveglow Vocoder to synthesize audios

Change dimension of tensors from (N, T, C) to (N, C, T)

N : batch size, C : channels, T : time steps

Add stop status on inference time.

Add thiner pre-net to get more accurate attention.

And little bit different text encoder.

Environment

Ubuntu 16.04
Python 3.6
PyTorch 1.2.0
2 GPUs

Install

Install above external repos

You should see first README.md of pytorch_sound, to prepare dataset.

$ pip install git+https://github.com/Appleholic/pytorch_sound

Install package

$ pip install -e .

Usage

Train

$ python tacotron2_pytorch/train.py [YOUR_META_DIR] [SAVE_DIR] [SAVE_PREFIX] [[OTHER OPTIONS...]]

Synthesize (one sample)
- It writes audio, wave plot, attention and mel spectrogram image.

$ python tacotron2_pytorch/synthesize.py [TEXT] [PRETRAINED_PATH] [MODEL_NAME] [SAVE_DIRECTORY]

Known Issues

When inference time, spectrogram got several stripes. It might be occurred by hard drop out. (Not appear on training time)
Stop token is not working well on inference time.
Error case and resolve them: Torchhub waveglow
- Automatically downloaded checkpoint file is crashed with using hubconf.py on Nvidia DeepLearningExample
- Download directly from nvidia waveglow checkpoint 32fp code, and copy that into '$HOME/.cache/torch/checkpoints'

Results

Total Validation Loss
- Sum of 2 MSE Losses (linear, linear + post) and stop BCE Loss
- red : pre net 64 dim, blue : pre net 256 dim
- 100,000 steps

Attention, Mel Spectrogram Sample

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
tacotron2_pytorch		tacotron2_pytorch
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

tacotron2_pytorch

tacotron2_pytorch

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Tacotron2 Pytorch

Environment

Install

Usage

Known Issues

Results

About

Releases

Packages

Languages

AppleHolic/tacotron2-pytorch

Folders and files

Latest commit

History

Repository files navigation

Tacotron2 Pytorch

Environment

Install

Usage

Known Issues

Results

About

Topics

Resources

Stars

Watchers

Forks

Languages