-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Downloading AudioCaps data #36
Comments
You need to use something like youtube-dl to download the audio files from youtube. Otherwise, you may want to download the WavCaps dataset and extract the audios from the zip files. This will include AudioCaps among several other datasets. The dataset already provides a ChatGPT-generated caption for each audio file, but you can probably map the audio files to the original AudioCaps captions using the filenames. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi,
I'm trying to download the AudioCaps data in order to train the Tango model. However, I'm not seeing any instructions in the AudioCaps repository on how to download it. Can you share any scripts or instructions on how to download and format the audio to train Tango?
Thanks!
The text was updated successfully, but these errors were encountered: