Mismatch in load_csv in four batches in cpg0036-EU-OS-bioactives #139

migunasekera · 2024-11-21T00:58:42Z

Dataset: cpg0036-EU-OS-bioactives
Thank you very much for sharing this dataset in the public. I wanted to report a mismatch in the paths provided in the load_data folder. There appears to be an issue with associating the load_data.csv files for four batches in the IMTM site with the S3 bucket filenames:

Site	Metadata_Batch	instances
IMTM	2022_03_29_Batch4_HepG2	17272
	2022_03_30_Batch5_HepG2	17275
	2022_03_31_Batch6_HepG2	17272
	2022_05_20_Batch7_HepG2	3455

As an example, if you were to aggregate the load_csv files for all the sites, and select the following:

## aggregate final_dataset_df from the load_csv platemaps
batch_filt = (final_dataset_df['Metadata_Batch']=='2022_03_30_Batch5_HepG2')
well_filt = final_dataset_df['Metadata_Well'] == 'A01'
fov_filt = final_dataset_df['Metadata_Site'] == 7
plate_filt = final_dataset_df['Metadata_Plate']=='B1001_R3'
final_dataset_df.loc[batch_filt & well_filt & fov_filt & plate_filt,'FileName_OrigDNA'].iloc[0]

You would find the following PathName and FileName:

/home/ubuntu/bucket/cpg0036-EU-OS-bioactives/IMTM/images/2022_03_30_Batch5_HepG2/images/Bioactives_1_plate_3
Bioactives_1_plate_3_A01_T0001F007L01A01Z01C01.tif

However, on the S3 bucket, at this location, this is the FileName:

Bioactives_1_plate_3_A01_T0001F007L01A04Z01C01.tif

Would you suggest for now, to match on the following A01_T0001F007 and C01 at the end, and ignore the parts in the middle?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mismatch in load_csv in four batches in cpg0036-EU-OS-bioactives #139

Mismatch in load_csv in four batches in cpg0036-EU-OS-bioactives #139

migunasekera commented Nov 21, 2024 •

edited

Loading

Mismatch in load_csv in four batches in cpg0036-EU-OS-bioactives #139

Mismatch in load_csv in four batches in cpg0036-EU-OS-bioactives #139

Comments

migunasekera commented Nov 21, 2024 • edited Loading

migunasekera commented Nov 21, 2024 •

edited

Loading