Downloading AudioCaps data #36

fyell · 2023-10-12T03:06:40Z

Hi,

I'm trying to download the AudioCaps data in order to train the Tango model. However, I'm not seeing any instructions in the AudioCaps repository on how to download it. Can you share any scripts or instructions on how to download and format the audio to train Tango?

Thanks!

deepanwayx · 2023-10-21T02:46:06Z

You need to use something like youtube-dl to download the audio files from youtube.

Otherwise, you may want to download the WavCaps dataset and extract the audios from the zip files. This will include AudioCaps among several other datasets.

The dataset already provides a ChatGPT-generated caption for each audio file, but you can probably map the audio files to the original AudioCaps captions using the filenames.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Downloading AudioCaps data #36

Downloading AudioCaps data #36

fyell commented Oct 12, 2023

deepanwayx commented Oct 21, 2023

Downloading AudioCaps data #36

Downloading AudioCaps data #36

Comments

fyell commented Oct 12, 2023

deepanwayx commented Oct 21, 2023