MLSA Neural Vocoder

この記事のMLSAニューラルボコーダーの学習コードです。

使い方

Python 3.10以上が必要です。あらかじめ環境にあったPyTorch 2を導入してください。

pip install -r requirements.txt

configディレクトリにサンプルの設定ファイルがあります。適宜data_pathやpreprocessed_path、log_dirなどのパラメータを変更することで前処理・学習に使用できます。

python preprocessor.py <config file>

長い音声(歌声データなど)を使用する場合は-sもしくは--splitオプションを使ってください。

python preprocessor.py <config file> -s

python train.py <config file>

サンプル音声がTensorboard上に出力されます。

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dataset.py		dataset.py
preprocessor.py		preprocessor.py
requirements.txt		requirements.txt
tensorboard.sh		tensorboard.sh
train.py		train.py
yaml_to_dataclass.py		yaml_to_dataclass.py