Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

数据集很奇怪 #42

Open
JohnFengNeumann opened this issue Dec 27, 2024 · 1 comment
Open

数据集很奇怪 #42

JohnFengNeumann opened this issue Dec 27, 2024 · 1 comment

Comments

@JohnFengNeumann
Copy link

speech Separation中提供的这个链接的数据集听起来怪怪的。请问这个是能用来正常训练的嘛?有正常的WSJ0 数据集的链接吗?
https://www.kaggle.com/datasets/sonishmaharjan555/[wsj0-2mix](https://www.kaggle.com/datasets/sonishmaharjan555/wsj0-2mix)

@alibabasglab
Copy link
Collaborator

alibabasglab commented Jan 8, 2025

不好意思,我们确实发现这个数据的语速有问题,WSJ0数据是有版权的,需要线上购买才可以使用。
如果你没有WSJ0数据集,我们建议你使用这个已经混合好的小数据集MiniLibriMix dataset:https://zenodo.org/records/3871592
或者使用LibriSpeech来生成LibriMix。具体步骤请参考我们的最新说明: https://github.com/modelscope/ClearerVoice-Studio/blob/main/train/speech_separation/README.md

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants