运行过程中，不断打印“Load pinyin_en_mix_dict failed” #68

airstillblue · 2023-06-29T12:31:18Z

运行过程中，不断打印“Load pinyin_en_mix_dict failed”。虽然能正常输出音频，但不知道这条log是否表明运行有问题？是不是我配置还缺了啥？

直接用的是SambertHifigan语音合成-中文-多人预训练-16k模型。
主角本如下：
#!/bin/bash

SambertHifigan语音合成-中文-多人预训练-16k

git clone -b pretrain http://www.modelscope.cn/speech_tts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k.git

speaker_list: 'F7,F74,FBYN,FRXL,M7,xiaoyu'} all except M7 are female

res_zip=../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/resource.zip
am_ckpt=../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/sambert/ckpt/checkpoint_980000.pth
voc_ckpt=../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/hifigan/ckpt/checkpoint_2000000.pth
spk=xiaoyu

outdir=out_$spk
[ -d $outdir ] && rm -rf $outdir; mkdir -p $outdir

python ./kantts/bin/text_to_wav.py
--txt ./test_data/txt
--output_dir $outdir
--res_zip $res_zip
--am_ckpt $am_ckpt
--voc_ckpt $voc_ckpt
--speaker $spk

运行过程中打印的log如下：
Converting text to symbols...
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
text.cc: festival_Text_init
AM is infering...
Loading checkpoint: ../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/sambert/ckpt/checkpoint_980000.pth
Inference sentence: 0_0
x_band_width:7, h_band_width: 7
Inference sentence: 1_0
x_band_width:6, h_band_width: 6
Inference sentence: 2_0
x_band_width:8, h_band_width: 8
Inference sentence: 3_0
x_band_width:7, h_band_width: 7
Vocoder is infering...
Loss = {'discriminator_adv_loss': {'enable': True, 'params': {'average_by_discriminators': False}, 'weights': 1.0}, 'feat_match_loss': {'enable': True, 'params': {'average_by_discriminators': False, 'average_by_layers': False}, 'weights': 2.0}, 'generator_adv_loss': {'enable': True, 'params': {'average_by_discriminators': False}, 'weights': 1.0}, 'mel_loss': {'enable': True, 'params': {'fft_size': 2048, 'fmax': 8000, 'fmin': 0, 'fs': 16000, 'hop_size': 200, 'log_base': None, 'num_mels': 80, 'win_length': 1000, 'window': 'hann'}, 'weights': 45.0}, 'stft_loss': {'enable': False}, 'subband_stft_loss': {'enable': False, 'params': {'fft_sizes': [384, 683, 171], 'hop_sizes': [35, 75, 15], 'win_lengths': [150, 300, 60], 'window': 'hann_window'}}}
Model = {'Generator': {'optimizer': {'params': {'betas': [0.5, 0.9], 'lr': 0.0002, 'weight_decay': 0.0}, 'type': 'Adam'}, 'params': {'bias': True, 'channels': 256, 'in_channels': 80, 'kernel_size': 7, 'nonlinear_activation': 'LeakyReLU', 'nonlinear_activation_params': {'negative_slope': 0.1}, 'out_channels': 1, 'resblock_dilations': [[1, 3, 5, 7], [1, 3, 5, 7], [1, 3, 5, 7]], 'resblock_kernel_sizes': [3, 7, 11], 'upsample_kernal_sizes': [20, 10, 4, 4], 'upsample_scales': [10, 5, 2, 2], 'use_weight_norm': True}, 'scheduler': {'params': {'gamma': 0.5, 'milestones': [200000, 400000, 600000, 800000]}, 'type': 'MultiStepLR'}}, 'MultiPeriodDiscriminator': {'optimizer': {'params': {'betas': [0.5, 0.9], 'lr': 0.0002, 'weight_decay': 0.0}, 'type': 'Adam'}, 'params': {'discriminator_params': {'bias': True, 'channels': 32, 'downsample_scales': [3, 3, 3, 3, 1], 'in_channels': 1, 'kernel_sizes': [5, 3], 'max_downsample_channels': 1024, 'nonlinear_activation': 'LeakyReLU', 'nonlinear_activation_params': {'negative_slope': 0.1}, 'out_channels': 1, 'use_spectral_norm': False}, 'periods': [2, 3, 5, 7, 11]}, 'scheduler': {'params': {'gamma': 0.5, 'milestones': [200000, 400000, 600000, 800000]}, 'type': 'MultiStepLR'}}, 'MultiScaleDiscriminator': {'optimizer': {'params': {'betas': [0.5, 0.9], 'lr': 0.0002, 'weight_decay': 0.0}, 'type': 'Adam'}, 'params': {'discriminator_params': {'bias': True, 'channels': 128, 'downsample_scales': [4, 4, 4, 4, 1], 'in_channels': 1, 'kernel_sizes': [15, 41, 5, 3], 'max_downsample_channels': 1024, 'max_groups': 16, 'nonlinear_activation': 'LeakyReLU', 'nonlinear_activation_params': {'negative_slope': 0.1}, 'out_channels': 1}, 'downsample_pooling': 'DWT', 'downsample_pooling_params': {'kernel_size': 4, 'padding': 2, 'stride': 2}, 'follow_official_norm': True, 'scales': 3}, 'scheduler': {'params': {'gamma': 0.5, 'milestones': [200000, 400000, 600000, 800000]}, 'type': 'MultiStepLR'}}}
allow_cache = True
audio_config = {'fmax': 8000.0, 'fmin': 0.0, 'hop_length': 200, 'max_norm': 1.0, 'min_level_db': -100.0, 'n_fft': 2048, 'n_mels': 80, 'norm_type': 'mean_std', 'num_workers': 16, 'phone_level_feature': True, 'preemphasize': False, 'ref_level_db': 20, 'sampling_rate': 16000, 'symmetric': False, 'trim_silence': True, 'trim_silence_threshold_db': 60, 'wav_normalize': True, 'win_length': 1000}
batch_max_steps = 9600
batch_size = 16
create_time = 2022-09-18 14:11:30
discriminator_grad_norm = -1
discriminator_train_start_steps = 0
eval_interval_steps = 10000
generator_grad_norm = -1
generator_train_start_steps = 1
git_revision_hash = 22ae438
log_interval_steps = 1000
model_type = hifigan
num_save_intermediate_results = 4
num_workers = 2
pin_memory = True
remove_short_samples = False
save_interval_steps = 20000
train_max_steps = 2500000
Loaded model parameters from ../funtts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/hifigan/ckpt/checkpoint_2000000.pth.
Removing weight norm...
Finished generation of 4 utterances (RTF = 0.310).
['out_xiaoyu/0_0_mel_gen.wav', 'out_xiaoyu/1_0_mel_gen.wav', 'out_xiaoyu/2_0_mel_gen.wav', 'out_xiaoyu/3_0_mel_gen.wav']
Text to wav finished!

Conda list:

git rev-parse HEAD:
8caf892

The text was updated successfully, but these errors were encountered:

XuWink · 2023-07-24T14:28:51Z

(/media/lab-hp/B23AB5DD3AB59F33/condaenv/maas) lab-hp@labhp-HP:~/桌面/KAN-TTS$ python ./kantts/bin/text_to_wav.py --txt test.txt --output_dir res --res_zip speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/resource.zip --am_ckpt speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/sambert/ckpt/checkpoint_980000.pth --voc_ckpt speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/hifigan/ckpt/checkpoint_2000000.pth --speaker xiaoyu
2023-07-24:22:10:22, INFO [text_to_wav.py:97] Converting text to symbols...
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
Load pinyin_en_mix_dict failed
text.cc: festival_Text_init
2023-07-24:22:10:26, INFO [text_to_wav.py:109] AM is infering...
2023-07-24:22:10:29, INFO [infer_sambert.py:198] Loading checkpoint: speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/basemodel_16k/sambert/ckpt/checkpoint_980000.pth
2023-07-24:22:10:29, INFO [infer_sambert.py:210] Inference sentence: 0_0
Could not load library libcudnn_cnn_infer.so.8. Error: /media/lab-hp/B23AB5DD3AB59F33/condaenv/maas/bin/../lib/libnvrtc.so: undefined symbol: nvrtcGetCUBIN
已放弃 (核心已转储)

你遇到过这个问题吗【Could not load library libcudnn_cnn_infer.so.8. Error: /media/lab-hp/B23AB5DD3AB59F33/condaenv/maas/bin/../lib/libnvrtc.so: undefined symbol: nvrtcGetCUBIN
已放弃 (核心已转储)】

airstillblue · 2023-07-25T06:53:38Z

没遇到过。。。

wangheqi1105 · 2023-09-15T07:44:01Z

我遇到了，也不知道怎么回事，但是也不出结果

SaltedSlark · 2023-09-22T03:19:03Z

ttsfrd 版本问题，重装个低版本 0.0.4，就好了。

stevin-dong · 2023-11-14T15:53:52Z

ttsfrd 版本问题，重装个低版本 0.0.4，就好了。

安装0.0.4后和tts-autolabel需要的版本又冲突了，怎么破？大佬

SaltedSlark · 2023-11-15T03:18:21Z

ttsfrd 版本问题，重装个低版本 0.0.4，就好了。

安装0.0.4后和tts-autolabel需要的版本又冲突了，怎么破？大佬
建两个conda环境可破之（推理的时候是用不到autolabel的，而预处理时是用不到ttsfrd的，你应该懂了）

stevin-dong · 2023-11-15T14:34:24Z

ttsfrd 版本问题，重装个低版本 0.0.4，就好了。

安装0.0.4后和tts-autolabel需要的版本又冲突了，怎么破？大佬
建两个conda环境可破之（推理的时候是用不到autolabel的，而预处理时是用不到ttsfrd的，你应该懂了）

感谢大佬，明了！只是即便是有这个错误Load pinyin_en_mix_dict failed，也能出来结果，不知道对结果是否影响，我先试试，感谢！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

运行过程中，不断打印“Load pinyin_en_mix_dict failed” #68

运行过程中，不断打印“Load pinyin_en_mix_dict failed” #68

airstillblue commented Jun 29, 2023

XuWink commented Jul 24, 2023

airstillblue commented Jul 25, 2023

wangheqi1105 commented Sep 15, 2023

SaltedSlark commented Sep 22, 2023

stevin-dong commented Nov 14, 2023

SaltedSlark commented Nov 15, 2023

stevin-dong commented Nov 15, 2023

运行过程中，不断打印“Load pinyin_en_mix_dict failed” #68

运行过程中，不断打印“Load pinyin_en_mix_dict failed” #68

Comments

airstillblue commented Jun 29, 2023

SambertHifigan语音合成-中文-多人预训练-16k

git clone -b pretrain http://www.modelscope.cn/speech_tts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k.git

speaker_list: 'F7,F74,FBYN,FRXL,M7,xiaoyu'} all except M7 are female

Name Version Build Channel

XuWink commented Jul 24, 2023

airstillblue commented Jul 25, 2023

wangheqi1105 commented Sep 15, 2023

SaltedSlark commented Sep 22, 2023

stevin-dong commented Nov 14, 2023

SaltedSlark commented Nov 15, 2023

stevin-dong commented Nov 15, 2023