-
Notifications
You must be signed in to change notification settings - Fork 117
(feat): match pbmc3k tutorial to seurat's #171
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
776f7f5
to
7629596
Compare
@flying-sheep Still very rough, but looking for some feedback given the above "outstanding" issues, especially on framing the reproducibility aspect |
Blast from the past: I also use an ARPACKy PCA in If nothing changed in the space, that might be their way forward as well, but of course I don’t know if RSpectra’s PCA is 100% identical to ARPACK. |
View / edit / reply to this conversation on ReviewNB flying-sheep commented on 2025-05-15T15:26:26Z there seems to be no output from |
View / edit / reply to this conversation on ReviewNB flying-sheep commented on 2025-05-15T15:26:27Z I don’t really get what “up to ties” means |
View / edit / reply to this conversation on ReviewNB flying-sheep commented on 2025-05-15T15:26:28Z Line #2. adata_subset_hvg = adata[:, adata.var["highly_variable"]].copy()
hmm, maybe explain that you’re using that subset for a while until you go back to the non-subset one?
I think it’s maybe a bit confusing that there are two ilan-gold commented on 2025-05-27T11:45:30Z A couple of things about this:
1. If you don't do the subset, the marker genes found in 0 vs. rest are ribosomal proteins, which is not immediately clear from the Seurat equivalent but becomes immediately clear if you score the genes. Hence my mention to Rahul of this issue about providing some (transparent) way to rank. And I don't think ribosomal proteins are particularly helpful, just a guess. Th2 output:
2. Plotting the 3. The "How can I remove unwanted sources of variation" part of the seurat tutorial uses HVG for scaling but the part above it does not i.e., scaling without regressing out. But we can't regress out on a subset of the feature space. ilan-gold commented on 2025-05-27T11:46:23Z Genuinely unsure how to proceed here, we could stop doing the regressing out (and just do scaling), and then report the ribosomal protein genes. But I'd be curious to hear what Rahul has to say. |
View / edit / reply to this conversation on ReviewNB flying-sheep commented on 2025-05-15T15:26:28Z We should just switch to
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
see above
A couple of things about this:
1. If you don't do the subset, the marker genes found in 0 vs. rest are ribosomal proteins, which is not immediately clear from the Seurat equivalent but becomes immediately clear if you score the genes. Hence my mention to Rahul of this issue about providing some (transparent) way to rank. And I don't think ribosomal proteins are particularly helpful, just a guess. Th2 output:
2. Plotting the 3. The "How can I remove unwanted sources of variation" part of the seurat tutorial uses HVG for scaling but the part above it does not i.e., scaling without regressing out. But we can't regress out on a subset of the feature space. View entire conversation on ReviewNB |
Genuinely unsure how to proceed here, we could stop doing the regressing out (and just do scaling), and then report the ribosomal protein genes. But I'd be curious to hear what Rahul has to say. View entire conversation on ReviewNB |
TODO:
arpack
since an R implementation exists: https://search.r-project.org/CRAN/refmans/igraph/html/arpack.html andigraph
is available in R: https://igraph.org/r/doc/cluster_leiden.htmlrendered