-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
运行过程中,不断打印“Load pinyin_en_mix_dict failed” #68
Comments
(/media/lab-hp/B23AB5DD3AB59F33/condaenv/maas) lab-hp@labhp-HP:~/桌面/KAN-TTS$ python ./kantts/bin/text_to_wav.py --txt test.txt --output_dir res --res_zip speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/resource.zip --am_ckpt speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/sambert/ckpt/checkpoint_980000.pth --voc_ckpt speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/hifigan/ckpt/checkpoint_2000000.pth --speaker xiaoyu 你遇到过这个问题吗【Could not load library libcudnn_cnn_infer.so.8. Error: /media/lab-hp/B23AB5DD3AB59F33/condaenv/maas/bin/../lib/libnvrtc.so: undefined symbol: nvrtcGetCUBIN |
没遇到过。。。 |
我遇到了,也不知道怎么回事,但是也不出结果 |
ttsfrd 版本问题,重装个低版本 0.0.4,就好了。 |
安装0.0.4后和tts-autolabel需要的版本又冲突了,怎么破?大佬 |
|
感谢大佬,明了!只是即便是有这个错误Load pinyin_en_mix_dict failed,也能出来结果,不知道对结果是否影响,我先试试,感谢! |
运行过程中,不断打印“Load pinyin_en_mix_dict failed”。虽然能正常输出音频,但不知道这条log是否表明运行有问题?是不是我配置还缺了啥?
直接用的是SambertHifigan语音合成-中文-多人预训练-16k模型。
主角本如下:
#!/bin/bash
SambertHifigan语音合成-中文-多人预训练-16k
git clone -b pretrain http://www.modelscope.cn/speech_tts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k.git
speaker_list: 'F7,F74,FBYN,FRXL,M7,xiaoyu'} all except M7 are female
res_zip=../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/resource.zip
am_ckpt=../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/sambert/ckpt/checkpoint_980000.pth
voc_ckpt=../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/hifigan/ckpt/checkpoint_2000000.pth
spk=xiaoyu
outdir=out_$spk
[ -d $outdir ] && rm -rf $outdir; mkdir -p $outdir
python ./kantts/bin/text_to_wav.py
--txt ./test_data/txt
--output_dir $outdir
--res_zip $res_zip
--am_ckpt $am_ckpt
--voc_ckpt $voc_ckpt
--speaker $spk
运行过程中打印的log如下:
Converting text to symbols...
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
text.cc: festival_Text_init
AM is infering...
Loading checkpoint: ../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/sambert/ckpt/checkpoint_980000.pth
Inference sentence: 0_0
x_band_width:7, h_band_width: 7
Inference sentence: 1_0
x_band_width:6, h_band_width: 6
Inference sentence: 2_0
x_band_width:8, h_band_width: 8
Inference sentence: 3_0
x_band_width:7, h_band_width: 7
Vocoder is infering...
Loss = {'discriminator_adv_loss': {'enable': True, 'params': {'average_by_discriminators': False}, 'weights': 1.0}, 'feat_match_loss': {'enable': True, 'params': {'average_by_discriminators': False, 'average_by_layers': False}, 'weights': 2.0}, 'generator_adv_loss': {'enable': True, 'params': {'average_by_discriminators': False}, 'weights': 1.0}, 'mel_loss': {'enable': True, 'params': {'fft_size': 2048, 'fmax': 8000, 'fmin': 0, 'fs': 16000, 'hop_size': 200, 'log_base': None, 'num_mels': 80, 'win_length': 1000, 'window': 'hann'}, 'weights': 45.0}, 'stft_loss': {'enable': False}, 'subband_stft_loss': {'enable': False, 'params': {'fft_sizes': [384, 683, 171], 'hop_sizes': [35, 75, 15], 'win_lengths': [150, 300, 60], 'window': 'hann_window'}}}
Model = {'Generator': {'optimizer': {'params': {'betas': [0.5, 0.9], 'lr': 0.0002, 'weight_decay': 0.0}, 'type': 'Adam'}, 'params': {'bias': True, 'channels': 256, 'in_channels': 80, 'kernel_size': 7, 'nonlinear_activation': 'LeakyReLU', 'nonlinear_activation_params': {'negative_slope': 0.1}, 'out_channels': 1, 'resblock_dilations': [[1, 3, 5, 7], [1, 3, 5, 7], [1, 3, 5, 7]], 'resblock_kernel_sizes': [3, 7, 11], 'upsample_kernal_sizes': [20, 10, 4, 4], 'upsample_scales': [10, 5, 2, 2], 'use_weight_norm': True}, 'scheduler': {'params': {'gamma': 0.5, 'milestones': [200000, 400000, 600000, 800000]}, 'type': 'MultiStepLR'}}, 'MultiPeriodDiscriminator': {'optimizer': {'params': {'betas': [0.5, 0.9], 'lr': 0.0002, 'weight_decay': 0.0}, 'type': 'Adam'}, 'params': {'discriminator_params': {'bias': True, 'channels': 32, 'downsample_scales': [3, 3, 3, 3, 1], 'in_channels': 1, 'kernel_sizes': [5, 3], 'max_downsample_channels': 1024, 'nonlinear_activation': 'LeakyReLU', 'nonlinear_activation_params': {'negative_slope': 0.1}, 'out_channels': 1, 'use_spectral_norm': False}, 'periods': [2, 3, 5, 7, 11]}, 'scheduler': {'params': {'gamma': 0.5, 'milestones': [200000, 400000, 600000, 800000]}, 'type': 'MultiStepLR'}}, 'MultiScaleDiscriminator': {'optimizer': {'params': {'betas': [0.5, 0.9], 'lr': 0.0002, 'weight_decay': 0.0}, 'type': 'Adam'}, 'params': {'discriminator_params': {'bias': True, 'channels': 128, 'downsample_scales': [4, 4, 4, 4, 1], 'in_channels': 1, 'kernel_sizes': [15, 41, 5, 3], 'max_downsample_channels': 1024, 'max_groups': 16, 'nonlinear_activation': 'LeakyReLU', 'nonlinear_activation_params': {'negative_slope': 0.1}, 'out_channels': 1}, 'downsample_pooling': 'DWT', 'downsample_pooling_params': {'kernel_size': 4, 'padding': 2, 'stride': 2}, 'follow_official_norm': True, 'scales': 3}, 'scheduler': {'params': {'gamma': 0.5, 'milestones': [200000, 400000, 600000, 800000]}, 'type': 'MultiStepLR'}}}
allow_cache = True
audio_config = {'fmax': 8000.0, 'fmin': 0.0, 'hop_length': 200, 'max_norm': 1.0, 'min_level_db': -100.0, 'n_fft': 2048, 'n_mels': 80, 'norm_type': 'mean_std', 'num_workers': 16, 'phone_level_feature': True, 'preemphasize': False, 'ref_level_db': 20, 'sampling_rate': 16000, 'symmetric': False, 'trim_silence': True, 'trim_silence_threshold_db': 60, 'wav_normalize': True, 'win_length': 1000}
batch_max_steps = 9600
batch_size = 16
create_time = 2022-09-18 14:11:30
discriminator_grad_norm = -1
discriminator_train_start_steps = 0
eval_interval_steps = 10000
generator_grad_norm = -1
generator_train_start_steps = 1
git_revision_hash = 22ae438
log_interval_steps = 1000
model_type = hifigan
num_save_intermediate_results = 4
num_workers = 2
pin_memory = True
remove_short_samples = False
save_interval_steps = 20000
train_max_steps = 2500000
Loaded model parameters from ../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/hifigan/ckpt/checkpoint_2000000.pth.
Removing weight norm...
Finished generation of 4 utterances (RTF = 0.310).
['out_xiaoyu/0_0_mel_gen.wav', 'out_xiaoyu/1_0_mel_gen.wav', 'out_xiaoyu/2_0_mel_gen.wav', 'out_xiaoyu/3_0_mel_gen.wav']
Text to wav finished!
Conda list:
Name Version Build Channel
_libgcc_mutex 0.1 main https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
_openmp_mutex 5.1 1_gnu https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
absl-py 1.4.0 pypi_0 pypi
addict 2.4.0 pypi_0 pypi
aiohttp 3.8.4 pypi_0 pypi
aiosignal 1.3.1 pypi_0 pypi
aliyun-python-sdk-core 2.13.36 pypi_0 pypi
aliyun-python-sdk-kms 2.16.1 pypi_0 pypi
aniso8601 9.0.1 pypi_0 pypi
async-timeout 4.0.2 pypi_0 pypi
attrs 23.1.0 pypi_0 pypi
audioread 3.0.0 pypi_0 pypi
autopep8 2.0.2 pypi_0 pypi
bitstring 4.0.2 pypi_0 pypi
ca-certificates 2023.05.30 h06a4308_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
certifi 2023.5.7 pypi_0 pypi
cffi 1.15.1 pypi_0 pypi
cfgv 3.3.1 pypi_0 pypi
charset-normalizer 3.1.0 pypi_0 pypi
click 8.0.4 pypi_0 pypi
cmake 3.26.4 pypi_0 pypi
coloredlogs 14.0 pypi_0 pypi
contourpy 1.1.0 pypi_0 pypi
crcmod 1.7 pypi_0 pypi
cryptography 41.0.1 pypi_0 pypi
cycler 0.11.0 pypi_0 pypi
cython 0.29.35 pypi_0 pypi
datasets 2.8.0 pypi_0 pypi
decorator 5.1.1 pypi_0 pypi
dill 0.3.6 pypi_0 pypi
distance 0.1.3 pypi_0 pypi
distlib 0.3.6 pypi_0 pypi
dnspython 2.3.0 pypi_0 pypi
easyasr 0.0.7 pypi_0 pypi
edit-distance 1.0.6 pypi_0 pypi
editdistance 0.5.2 pypi_0 pypi
einops 0.6.1 pypi_0 pypi
espnet-tts-frontend 0.0.3 pypi_0 pypi
et-xmlfile 1.1.0 pypi_0 pypi
eventlet 0.33.3 pypi_0 pypi
filelock 3.12.2 pypi_0 pypi
flask 2.1.3 pypi_0 pypi
flask-cors 3.0.10 pypi_0 pypi
flask-restful 0.3.10 pypi_0 pypi
flask-socketio 4.3.2 pypi_0 pypi
flask-talisman 1.0.0 pypi_0 pypi
fonttools 4.40.0 pypi_0 pypi
frozenlist 1.3.3 pypi_0 pypi
fsspec 2023.6.0 pypi_0 pypi
funasr 0.6.1 pypi_0 pypi
future 0.18.3 pypi_0 pypi
g2p 1.1.20230511 pypi_0 pypi
g2p-en 2.1.0 pypi_0 pypi
gast 0.5.4 pypi_0 pypi
greenlet 2.0.2 pypi_0 pypi
grpcio 1.54.2 pypi_0 pypi
h5py 3.8.0 pypi_0 pypi
huggingface-hub 0.15.1 pypi_0 pypi
humanfriendly 10.0 pypi_0 pypi
hyperpyyaml 1.2.1 pypi_0 pypi
identify 2.5.24 pypi_0 pypi
idna 3.4 pypi_0 pypi
importlib-metadata 6.6.0 pypi_0 pypi
importlib-resources 5.12.0 pypi_0 pypi
inflect 6.0.4 pypi_0 pypi
itsdangerous 2.1.2 pypi_0 pypi
jaconv 0.3.4 pypi_0 pypi
jamo 0.4.1 pypi_0 pypi
jedi 0.18.2 pypi_0 pypi
jinja2 3.1.2 pypi_0 pypi
jmespath 0.10.0 pypi_0 pypi
joblib 1.2.0 pypi_0 pypi
kaldiio 2.18.0 pypi_0 pypi
kantts 0.0.1 pypi_0 pypi
kiwisolver 1.4.4 pypi_0 pypi
kwsbp 0.0.6 pypi_0 pypi
ld_impl_linux-64 2.38 h1181459_1 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
libffi 3.4.4 h6a678d5_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
libgcc-ng 11.2.0 h1234567_1 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
libgomp 11.2.0 h1234567_1 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
librosa 0.9.2 pypi_0 pypi
libstdcxx-ng 11.2.0 h1234567_1 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
lit 16.0.5.post0 pypi_0 pypi
llvmlite 0.40.1rc1 pypi_0 pypi
lxml 4.9.2 pypi_0 pypi
markdown 3.4.3 pypi_0 pypi
markupsafe 2.1.3 pypi_0 pypi
matplotlib 3.7.1 pypi_0 pypi
mindaec 0.0.2 pypi_0 pypi
mir-eval 0.7 pypi_0 pypi
modelscope 1.6.1 pypi_0 pypi
mpmath 1.3.0 pypi_0 pypi
msgpack 1.0.5 pypi_0 pypi
multidict 6.0.4 pypi_0 pypi
multiprocess 0.70.14 pypi_0 pypi
munkres 1.1.4 pypi_0 pypi
ncurses 6.4 h6a678d5_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
networkx 2.8.4 pypi_0 pypi
nltk 3.8.1 pypi_0 pypi
nodeenv 1.8.0 pypi_0 pypi
numba 0.57.0 pypi_0 pypi
numpy 1.22.0 pypi_0 pypi
nvidia-cublas-cu11 11.10.3.66 pypi_0 pypi
nvidia-cuda-cupti-cu11 11.7.101 pypi_0 pypi
nvidia-cuda-nvrtc-cu11 11.7.99 pypi_0 pypi
nvidia-cuda-runtime-cu11 11.7.99 pypi_0 pypi
nvidia-cudnn-cu11 8.5.0.96 pypi_0 pypi
nvidia-cufft-cu11 10.9.0.58 pypi_0 pypi
nvidia-curand-cu11 10.2.10.91 pypi_0 pypi
nvidia-cusolver-cu11 11.4.0.1 pypi_0 pypi
nvidia-cusparse-cu11 11.7.4.91 pypi_0 pypi
nvidia-nccl-cu11 2.14.3 pypi_0 pypi
nvidia-nvtx-cu11 11.7.91 pypi_0 pypi
openpyxl 3.1.2 pypi_0 pypi
openssl 3.0.8 h7f8727e_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
oss2 2.18.0 pypi_0 pypi
packaging 23.1 pypi_0 pypi
pandas 1.5.3 pypi_0 pypi
panphon 0.20.0 pypi_0 pypi
parso 0.8.3 pypi_0 pypi
pexpect 4.8.0 pypi_0 pypi
pickleshare 0.7.5 pypi_0 pypi
pillow 9.5.0 pypi_0 pypi
pip 23.1.2 py38h06a4308_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
platformdirs 3.5.3 pypi_0 pypi
pooch 1.7.0 pypi_0 pypi
pre-commit 3.3.3 pypi_0 pypi
prompt-toolkit 3.0.38 pypi_0 pypi
protobuf 3.20.0 pypi_0 pypi
ptflops 0.7 pypi_0 pypi
ptyprocess 0.7.0 pypi_0 pypi
py-sound-connect 0.2.1 pypi_0 pypi
pyarrow 12.0.1 pypi_0 pypi
pycodestyle 2.10.0 pypi_0 pypi
pycparser 2.21 pypi_0 pypi
pycryptodome 3.18.0 pypi_0 pypi
pydantic 1.10.9 pypi_0 pypi
pygments 2.15.1 pypi_0 pypi
pyparsing 3.0.9 pypi_0 pypi
pypinyin 0.44.0 pypi_0 pypi
pysptk 0.1.21 pypi_0 pypi
python 3.8.16 h955ad1f_4 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
python-dateutil 2.8.2 pypi_0 pypi
python-engineio 3.14.2 pypi_0 pypi
python-socketio 4.6.1 pypi_0 pypi
pytorch-wavelets 1.3.0 pypi_0 pypi
pytorch-wpe 0.0.1 pypi_0 pypi
pytz 2023.3 pypi_0 pypi
pywavelets 1.4.1 pypi_0 pypi
pyyaml 6.0 pypi_0 pypi
readline 8.2 h5eee18b_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
regex 2023.6.3 pypi_0 pypi
requests 2.31.0 pypi_0 pypi
resampy 0.4.2 pypi_0 pypi
responses 0.18.0 pypi_0 pypi
rotary-embedding-torch 0.2.3 pypi_0 pypi
ruamel-yaml 0.17.28 pypi_0 pypi
ruamel-yaml-clib 0.2.7 pypi_0 pypi
scikit-learn 1.2.2 pypi_0 pypi
scipy 1.10.1 pypi_0 pypi
sentencepiece 0.1.99 pypi_0 pypi
setuptools 67.8.0 py38h06a4308_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
simplejson 3.19.1 pypi_0 pypi
six 1.16.0 pypi_0 pypi
sortedcontainers 2.4.0 pypi_0 pypi
soundfile 0.12.1 pypi_0 pypi
sox 1.4.1 pypi_0 pypi
speechbrain 0.5.14 pypi_0 pypi
sqlite 3.41.2 h5eee18b_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
sympy 1.12 pypi_0 pypi
tensorboard 1.15.0 pypi_0 pypi
tensorboardx 2.6 pypi_0 pypi
text-unidecode 1.3 pypi_0 pypi
textgrid 1.5 pypi_0 pypi
threadpoolctl 3.1.0 pypi_0 pypi
tk 8.6.12 h1ccaba5_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
tomli 2.0.1 pypi_0 pypi
torch 2.0.1 pypi_0 pypi
torch-complex 0.4.3 pypi_0 pypi
torchaudio 2.0.2 pypi_0 pypi
tqdm 4.65.0 pypi_0 pypi
traitlets 5.9.0 pypi_0 pypi
triton 2.0.0 pypi_0 pypi
ttsfrd 0.2.1 pypi_0 pypi
typeguard 2.13.3 pypi_0 pypi
typing-extensions 4.6.3 pypi_0 pypi
unicodecsv 0.14.1 pypi_0 pypi
unidecode 1.3.6 pypi_0 pypi
urllib3 2.0.3 pypi_0 pypi
virtualenv 20.23.0 pypi_0 pypi
wcwidth 0.2.6 pypi_0 pypi
werkzeug 2.0.3 pypi_0 pypi
wheel 0.38.4 py38h06a4308_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
xxhash 3.2.0 pypi_0 pypi
xz 5.4.2 h5eee18b_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
yapf 0.40.0 pypi_0 pypi
yarl 1.9.2 pypi_0 pypi
zipp 3.15.0 pypi_0 pypi
zlib 1.2.13 h5eee18b_0 https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
git rev-parse HEAD:
8caf892
The text was updated successfully, but these errors were encountered: