Add dataset_id to tracks #36

TamaraNaboulsi · 2025-05-01T16:13:57Z

Description

A dataset_id field was added into the track model to include the specific dataset uuid that a track datafile is connected to. DB migration was also applied to dev with a default value for dataset_id. This migration should also be applied to staging and prod.

Review App URL(s)

http://add-dataset-id.review.ensembl.org

Knowledge Base

To apply the DB migration, the appropriate DB parameters should be defined, either added manually into environment args or exported using a file. An example of these parameters can be seen in the .env file. The following commands are then run to apply the migration.

python manage.py makemigrations
python manage.py migrate

Checklist

Black formatting
Tests

veidenberg

Looks good to me.
Notes:

DB migrate commands need to be applied to staging and prod after PR is merged.
Track API loading script needs an update to include dataset IDs in track submission payloads.
Current tracks have placeholder dataset IDs which can be updated later.
Ensembl-client needs to be updated to add dataset IDs to Track API requests

Mehrnaz-Charkhchi

How TrackAPI is going to handle the track that GB needs to use for the latest release?
For example, Release 6 introduces new datasets for regulation tracks, while older versions of these regulation tracks (with previous datasets) already exist. After loading data into TrackAPI, we'll have tracks of the same type but with different dataset UUIDs.

How will TrackAPI determine which track version (dataset) should be used for the latest release?
Is this logic handled within TrackAPI, or will the FE manage the selection?

Mehrnaz-Charkhchi

One more question, Do we have tracks that won't have dataset UUIDs? One example that I can think of would be GC tracks, do we need to allow null for dataset uuids?

veidenberg · 2025-05-08T13:42:53Z

@Mehrnaz-Charkhchi
Tracks are requested from Track API by Ensembl client, which should pass a dataset id together with genome id. That means track_categories endpoint needs to be updated to accept and filter with a dataset id. Note: If no dataset id is provided and a genome id has tracks with multiple dataset ids it currently returns all versions of the track. Not sure how to work around that.
Making dataset IDs nullable sounds better than the palceholder uuids (for tracks without dataset id).

veidenberg

Some updates from the feedback:

Add dataset_id parameter to track_categories endpoint
Make dataset_id field nullable

TamaraNaboulsi and others added 2 commits May 1, 2025 16:02

Added dataset_id to tracks

2552e0a

Add migration

11a444d

TamaraNaboulsi requested review from veidenberg and azangru May 1, 2025 16:14

veidenberg requested a review from Mehrnaz-Charkhchi May 8, 2025 11:33

veidenberg approved these changes May 8, 2025

View reviewed changes

Mehrnaz-Charkhchi reviewed May 8, 2025

View reviewed changes

veidenberg requested changes May 8, 2025

View reviewed changes

Mehrnaz-Charkhchi requested review from dpopleton and vinay-ebi May 14, 2025 08:50

Add is_current and release to tracks

721e96c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add dataset_id to tracks #36

Add dataset_id to tracks #36

Uh oh!

TamaraNaboulsi commented May 1, 2025

Uh oh!

veidenberg left a comment •

edited

Loading

Uh oh!

Mehrnaz-Charkhchi left a comment

Uh oh!

Mehrnaz-Charkhchi left a comment

Uh oh!

veidenberg commented May 8, 2025

Uh oh!

veidenberg left a comment

Uh oh!

Uh oh!

Add dataset_id to tracks #36

Are you sure you want to change the base?

Add dataset_id to tracks #36

Uh oh!

Conversation

TamaraNaboulsi commented May 1, 2025

Description

Review App URL(s)

Knowledge Base

Checklist

Uh oh!

veidenberg left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Mehrnaz-Charkhchi left a comment

Choose a reason for hiding this comment

Uh oh!

Mehrnaz-Charkhchi left a comment

Choose a reason for hiding this comment

Uh oh!

veidenberg commented May 8, 2025

Uh oh!

veidenberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

veidenberg left a comment •

edited

Loading