-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify why some compounds have multiple replicates #85
Comments
Hi @ChenyuWang-Monica, my answers are below
We have been having some issues with matching InChIKeys between what we previously released in the JUMP-Target repo and what we released in this repo. But I can confirm that JCP2022_025848 is dexamethasone. The mapping between JCP2022 IDs and compound names are below.
Thanks for bringing this to our attention. I believe this is a metadata issue. Most of these wells come from a single source (source_9) and all the wells are in columns 1, 24, 25 or 48. @shntnu you had noticed the number of replicates in #30 (comment), but I don't know whether we flagged this as a metadata error or not.
In general, most compounds should have five replicates, but there are some exceptions and I have listed some of them below.
|
Indeed – not sure why this was the case. I'll follow up in that internal issue and loop back here |
When I'm counting the replicates of each compound in the COMPOUND plates, I have a few questions:
The top ten compounds have >6000 replicates. Among them are DMSO, the empty well (JCP2022_999999), and 8 positive controls. However, when I compare the InChIKey of the 8 positive controls with those given in https://github.com/jump-cellpainting/JUMP-Target/tree/master#positive-control-compounds, one of them disagrees: JCP2022_025848 (GJFCONYVAUNLKB-UHFFFAOYSA-N) has 8127 replicates but is not listed as a positive control; dexamethasone (UREBDLICKHMUKA-CXSFZGCWSA-N) listed as a positive control doesn't appear in the metadata compound.csv.gz.
The 11th-ranked compound JCP2022_033954 has 1594 replicates. Is it also a positive control or what is it aiming for?
There are many compounds with multiple replicates (for example over 10 but less than 60). Why do they have much more replicates than the common case as mentioned in the paper (i.e. about 5)?
Thanks!
The text was updated successfully, but these errors were encountered: