Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mismatch in load_csv in four batches in cpg0036-EU-OS-bioactives #139

Open
migunasekera opened this issue Nov 21, 2024 · 0 comments
Open

Comments

@migunasekera
Copy link

migunasekera commented Nov 21, 2024

Dataset: cpg0036-EU-OS-bioactives
Thank you very much for sharing this dataset in the public. I wanted to report a mismatch in the paths provided in the load_data folder. There appears to be an issue with associating the load_data.csv files for four batches in the IMTM site with the S3 bucket filenames:

Site Metadata_Batch instances
IMTM 2022_03_29_Batch4_HepG2 17272
2022_03_30_Batch5_HepG2 17275
2022_03_31_Batch6_HepG2 17272
2022_05_20_Batch7_HepG2 3455

As an example, if you were to aggregate the load_csv files for all the sites, and select the following:

## aggregate final_dataset_df from the load_csv platemaps
batch_filt = (final_dataset_df['Metadata_Batch']=='2022_03_30_Batch5_HepG2')
well_filt = final_dataset_df['Metadata_Well'] == 'A01'
fov_filt = final_dataset_df['Metadata_Site'] == 7
plate_filt = final_dataset_df['Metadata_Plate']=='B1001_R3'
final_dataset_df.loc[batch_filt & well_filt & fov_filt & plate_filt,'FileName_OrigDNA'].iloc[0]

You would find the following PathName and FileName:

  • /home/ubuntu/bucket/cpg0036-EU-OS-bioactives/IMTM/images/2022_03_30_Batch5_HepG2/images/Bioactives_1_plate_3
  • Bioactives_1_plate_3_A01_T0001F007L01A01Z01C01.tif

However, on the S3 bucket, at this location, this is the FileName:

  • Bioactives_1_plate_3_A01_T0001F007L01A04Z01C01.tif

Would you suggest for now, to match on the following A01_T0001F007 and C01 at the end, and ignore the parts in the middle?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant