Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐛 Bug: 以api调用f5-tts出现异常 #1062

Open
irrikrlla opened this issue Nov 22, 2024 · 3 comments
Open

🐛 Bug: 以api调用f5-tts出现异常 #1062

irrikrlla opened this issue Nov 22, 2024 · 3 comments
Labels
bug Something isn't working

Comments

@irrikrlla
Copy link

操作系统

Windows

Python 版本

3.11.9

AI Vtuber 版本

v0.29 20240418 #aba008d....(上周更新 main)

描述问题

通过AI vtuber 调用 f5-tts时,f5-tts表现奇怪,会有大量异常声音出现(不在文本内), 而直接操作f5-tts没有这些问题
音频对比.zip

复现步骤

如题

期望的结果

f5-tts应该正常的合成提供文字对应音频,而非包含大量其他杂音内容

截图或日志

后台日志看不出问题所在

@irrikrlla irrikrlla added the bug Something isn't working label Nov 22, 2024
@Ikaros-521
Copy link
Owner

没看到具体截图,盲猜是f5的版本问题。gradio调用和webui上的操作应该理论上是一致的

@irrikrlla
Copy link
Author

屏幕截图 2024-12-08 145225
我今天又做了一些测试,箭头1是f5-tts直出的,箭头2是通过luna调用,我发现luna调用这边会把文本按标点符号切分了,分别合成然后再拼回来,也许是哪个环节发生了“泄露”让最后合成的音频出问题。非常感谢大佬抽空回复

@Ikaros-521
Copy link
Owner

程序会有一个 音频文本切分的功能,默认是对语句做标点符号切分的,你不需要的话 可以把 文本切分功能关掉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants