Add fma keys dataset #625

stellaywong · 2024-03-07T02:45:27Z

This adds support for the FMA Keys dataset, which is a new dataset for the evaluation of key detection containing 340 hours (5489 songs) of song-level key and mode annotations, spread across 17 genres.

genisplaja

Hello @stellaywong, sorry for the late late late reply... we were very busy with soundata and its corresponding JOSS submission. Thank you for your contribution to mirdata :) We are targeting a release for next month, and we would like this loader to be included, glad to help on that! The PR is really good, it only needs a bit of refinement, the module docstring, and the documentation (find here the instructions for that!). Let us know if you need any help!

mirdata/datasets/fma_keys.py

genisplaja · 2024-10-12T07:50:29Z

mirdata/datasets/fma_keys.py

+    # librosa has problems reading FMA mp3s without clamping down to the second.
+    duration = librosa.get_duration(path=fhandle)
+    return librosa.load(fhandle, sr=None, mono=True, duration=floor(duration))


wow is that so? Can we track this error and see why it happens? I'll take a look. Nice workaround for now :)

Yes, I'll dig more into it. We didn't see this with other mp3s, just with ones coming from the FMA dataset.

Thanks! Let me know if yu want us to also take a look at that issue :)

tests/datasets/test_fma_keys.py

magdalenafuentes · 2024-10-15T16:51:54Z

@stellaywong thanks for contributing to mirdata! Just FYI, we're about to make a release for ISMIR in a couple of weeks. Do you have time to address this changes before? It would be great to include this loader in the release! If you don't have the time don't worry, but please let us know

This adds support for the FMA Keys dataset, which is a new dataset for the evaluation of key detection containing 340 hours (5489 songs) of song-level key and mode annotations, spread across 17 genres.

stellaywong · 2024-10-26T02:59:33Z

Thank you for the reviews! I've made the changes and pushed the updates.

It looks like the PR build is stuck waiting for status to be reported? Do you know how we can kick start the build?

genisplaja · 2024-10-31T15:31:55Z

Hey @stellaywong, thanks for the update :) Sorry that I was afk for a few days. OK, that looks quite good, you are missing a few things that I am gonna list in my next review, but the loader looks good! Hope that you will have some time to do the final updates... thanks for the patience!

genisplaja

Alright, getting way closer! Thanks again for your patience and collaboration :) I just left few indications of the things your PR is missing. Please let me know if you have any questions. Thanks @stellaywong!

genisplaja · 2024-10-31T15:42:29Z

mirdata/datasets/fma_keys.py

+        return jams_utils.jams_converter(
+            metadata=self._track_metadata,
+        )


You jams_converter is missing the audio_path, so it cannot compute needed attributes of the corresponding audio. See this exemple:

return jams_utils.jams_converter( audio_path=self.audio_path, metadata={ "instrument": self.instrument, "genre": self.genre, "drum": self.drum, "train": self.train, }, )

That will be needed for the tests to pass!

genisplaja · 2024-10-31T15:46:12Z

mirdata/datasets/fma_keys.py

+"""
+FMA Keys Dataset Loader
+
+.. admonition:: Dataset Info
+    :class: dropdown


alright, now the docstrings looks quite good. However, you need to update some files in the documentation source code to make this visible to users.

docs/source/mirdata.rst: here, you need to add your dataset as it is done for the rest!

docs/source/table.rst: here, you need to add your dataset as it is done for the rest as well. Add the number of tracks, available annotations, data availability, licensing, etc.

docs/source/quick_reference.rst: if your dataset has some annotation type of information that is not available in quick reference to link, please add it there, and therefore, users will be able to click the annotations in the table and read more info!

genisplaja · 2024-10-31T15:46:52Z

mirdata/datasets/fma_keys.py

+        spotify_uri (str): Spotify URI if available
+        key (str): path to the track's audio file
+        mode (str): path to the track's audio file
+        key_number (int): path to the track's audio file
+        mode_number (int): path to the track's audio file
+        audio (str): path to the track's audio file


most of them seem to have path to the track's audio file as description. Please update! also please change audio for audio_path` .

genisplaja · 2024-10-31T15:47:20Z

mirdata/datasets/fma_keys.py

+    # librosa has problems reading FMA mp3s without clamping down to the second.
+    duration = librosa.get_duration(path=fhandle)
+    return librosa.load(fhandle, sr=None, mono=True, duration=floor(duration))


Thanks! Let me know if yu want us to also take a look at that issue :)

genisplaja requested changes Oct 12, 2024

View reviewed changes

stellaywong force-pushed the master branch from 2947a82 to c73ccc0 Compare October 25, 2024 03:17

Add fma keys dataset

c347049

This adds support for the FMA Keys dataset, which is a new dataset for the evaluation of key detection containing 340 hours (5489 songs) of song-level key and mode annotations, spread across 17 genres.

stellaywong force-pushed the master branch from c73ccc0 to c347049 Compare October 25, 2024 03:33

Merge branch 'master' into master

8ea338b

genisplaja requested changes Oct 31, 2024

View reviewed changes

genisplaja and others added 2 commits November 2, 2024 18:33

Merge branch 'master' into master

971c939

Merge branch 'master' into master

361723f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fma keys dataset #625

Add fma keys dataset #625

stellaywong commented Mar 7, 2024

genisplaja left a comment

genisplaja Oct 12, 2024

stellaywong Oct 26, 2024

genisplaja Oct 31, 2024

magdalenafuentes commented Oct 15, 2024

stellaywong commented Oct 26, 2024

genisplaja commented Oct 31, 2024

genisplaja left a comment

genisplaja Oct 31, 2024 •

edited

Loading

genisplaja Oct 31, 2024

genisplaja Oct 31, 2024

genisplaja Oct 31, 2024

Add fma keys dataset #625

Are you sure you want to change the base?

Add fma keys dataset #625

Conversation

stellaywong commented Mar 7, 2024

genisplaja left a comment

Choose a reason for hiding this comment

genisplaja Oct 12, 2024

Choose a reason for hiding this comment

stellaywong Oct 26, 2024

Choose a reason for hiding this comment

genisplaja Oct 31, 2024

Choose a reason for hiding this comment

magdalenafuentes commented Oct 15, 2024

stellaywong commented Oct 26, 2024

genisplaja commented Oct 31, 2024

genisplaja left a comment

Choose a reason for hiding this comment

genisplaja Oct 31, 2024 • edited Loading

Choose a reason for hiding this comment

genisplaja Oct 31, 2024

Choose a reason for hiding this comment

genisplaja Oct 31, 2024

Choose a reason for hiding this comment

genisplaja Oct 31, 2024

Choose a reason for hiding this comment

genisplaja Oct 31, 2024 •

edited

Loading