Skip to content

Latest commit

 

History

History
16 lines (12 loc) · 938 Bytes

README.md

File metadata and controls

16 lines (12 loc) · 938 Bytes

MPD TRACK DATASET 🎵

Extracted all unique tracks from Million Playlist Dataset and added audio features to every track using Spotify API.

Total Unique Songs: 2,261,644

Useage of dataset

The dataset is in the form of SQL table. You can find the data on kaggle here. Use this while respecting Spotify's Policies. We are not responsible for anything done by you using this dataset.

Replicate the extraction

  1. Install the Million Playlist Dataset from here.
  2. Copy the data folder into the root project folder.
  3. Run per_json.py -> sql_data.py -> easy_extraction.py.
  4. The last step will take weeks due to rate limits by spotify.

Future plans

Million Playlist Dataset only has songs till 2018. We plan on looking for more songs from 2018 to 2024.