(feat): match pbmc3k tutorial to seurat's #171

ilan-gold · 2025-05-06T13:42:03Z

TODO:

Figure out best way for ranking genes so that we recover meaningful results. At the moment, running the method on the full gene list yields a lot of RP genes for the CD4 cluster ,which I would guess is basically noise. But it seems that seurat uses the full list. Separately I can't seem to figure out how to get the scores from seurat - they are claimed to be present but I don't seem them. I don't think they are just lfc.
Marker gene documentation CD8A and CD8B are not present in the ranked genes either here or in seurat but are noted as marker genes. So I think we should just change that table and note that some genes are not present the ranked genes (maybe explain why? talk to Rahul again?)
PCA Rahul's PCA is quite similar but not exact. It would be nice maybe for them to have arpack since an R implementation exists: https://search.r-project.org/CRAN/refmans/igraph/html/arpack.html and
Clustering Same as above, especially since igraph is available in R: https://igraph.org/r/doc/cluster_leiden.html

rendered

review-notebook-app · 2025-05-06T13:42:08Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

ilan-gold · 2025-05-07T12:51:05Z

@flying-sheep Still very rough, but looking for some feedback given the above "outstanding" issues, especially on framing the reproducibility aspect

for more information, see https://pre-commit.ci

flying-sheep · 2025-05-08T11:17:45Z

Blast from the past: I also use an ARPACKy PCA in destiny, using RSpectra:

If nothing changed in the space, that might be their way forward as well, but of course I don’t know if RSpectra’s PCA is 100% identical to ARPACK.

review-notebook-app · 2025-05-15T15:26:27Z

View / edit / reply to this conversation on ReviewNB

flying-sheep commented on 2025-05-15T15:26:26Z
----------------------------------------------------------------

there seems to be no output from print_header

review-notebook-app · 2025-05-15T15:26:27Z

View / edit / reply to this conversation on ReviewNB

flying-sheep commented on 2025-05-15T15:26:27Z
----------------------------------------------------------------

I don’t really get what “up to ties” means

review-notebook-app · 2025-05-15T15:26:28Z

View / edit / reply to this conversation on ReviewNB

flying-sheep commented on 2025-05-15T15:26:28Z
----------------------------------------------------------------

Line #2.    adata_subset_hvg = adata[:, adata.var["highly_variable"]].copy()

hmm, maybe explain that you’re using that subset for a while until you go back to the non-subset one?

I think it’s maybe a bit confusing that there are two adata objects being used interspersedly. I think modifying a notebook like that can easily result in copy-pasting the wrong name.

ilan-gold commented on 2025-05-27T11:45:30Z
----------------------------------------------------------------

A couple of things about this:

1. If you don't do the subset, the marker genes found in 0 vs. rest are ribosomal proteins, which is not immediately clear from the Seurat equivalent but becomes immediately clear if you score the genes. Hence my mention to Rahul of this issue about providing some (transparent) way to rank. And I don't think ribosomal proteins are particularly helpful, just a guess. Th2 output:

2. Plotting the marker_genes at the end includes non-HVG genes.

3. The "How can I remove unwanted sources of variation" part of the seurat tutorial uses HVG for scaling but the part above it does not i.e., scaling without regressing out. But we can't regress out on a subset of the feature space.

ilan-gold commented on 2025-05-27T11:46:23Z
----------------------------------------------------------------

Genuinely unsure how to proceed here, we could stop doing the regressing out (and just do scaling), and then report the ribosomal protein genes. But I'd be curious to hear what Rahul has to say.

review-notebook-app · 2025-05-15T15:26:29Z

View / edit / reply to this conversation on ReviewNB

flying-sheep commented on 2025-05-15T15:26:28Z
----------------------------------------------------------------

We should just switch to sc.tl.marker_gene_overlap instead of changing these around everytime this file is touched.

flying-sheep

see above

ilan-gold · 2025-05-27T11:45:31Z

A couple of things about this:

1. If you don't do the subset, the marker genes found in 0 vs. rest are ribosomal proteins, which is not immediately clear from the Seurat equivalent but becomes immediately clear if you score the genes. Hence my mention to Rahul of this issue about providing some (transparent) way to rank. And I don't think ribosomal proteins are particularly helpful, just a guess. Th2 output:

2. Plotting the marker_genes at the end includes non-HVG genes.

3. The "How can I remove unwanted sources of variation" part of the seurat tutorial uses HVG for scaling but the part above it does not i.e., scaling without regressing out. But we can't regress out on a subset of the feature space.

View entire conversation on ReviewNB

ilan-gold · 2025-05-27T11:46:25Z

Genuinely unsure how to proceed here, we could stop doing the regressing out (and just do scaling), and then report the ribosomal protein genes. But I'd be curious to hear what Rahul has to say.

View entire conversation on ReviewNB

ilan-gold added 2 commits April 28, 2025 17:55

(chore): up to PCA done

fb9ae9b

(feat): make notebook match seurat as close as possible

8eab607

(chore): clean up some parts

7629596

ilan-gold force-pushed the ig/seruat_compat branch from 776f7f5 to 7629596 Compare May 7, 2025 12:50

ilan-gold requested a review from flying-sheep May 7, 2025 12:50

pre-commit-ci bot and others added 2 commits May 7, 2025 12:51

[pre-commit.ci] auto fixes from pre-commit.com hooks

bbfd7f3

for more information, see https://pre-commit.ci

Merge branch 'main' into ig/seruat_compat

8675ac7

flying-sheep linked an issue May 15, 2025 that may be closed by this pull request

(bug): pbmc3k tutorial is not reproducible #82

Open

flying-sheep reviewed May 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(feat): match pbmc3k tutorial to seurat's #171

(feat): match pbmc3k tutorial to seurat's #171

Uh oh!

ilan-gold commented May 6, 2025 •

edited by flying-sheep

Loading

Uh oh!

review-notebook-app bot commented May 6, 2025

Uh oh!

ilan-gold commented May 7, 2025 •

edited

Loading

Uh oh!

flying-sheep commented May 8, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented May 15, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented May 15, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented May 15, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented May 15, 2025 •

edited

Loading

Uh oh!

flying-sheep left a comment

Uh oh!

ilan-gold commented May 27, 2025

Uh oh!

ilan-gold commented May 27, 2025

Uh oh!

Uh oh!

(feat): match pbmc3k tutorial to seurat's #171

Are you sure you want to change the base?

(feat): match pbmc3k tutorial to seurat's #171

Uh oh!

Conversation

ilan-gold commented May 6, 2025 • edited by flying-sheep Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

rendered

Uh oh!

review-notebook-app bot commented May 6, 2025

Uh oh!

ilan-gold commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flying-sheep commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flying-sheep left a comment

Choose a reason for hiding this comment

Uh oh!

ilan-gold commented May 27, 2025

Uh oh!

ilan-gold commented May 27, 2025

Uh oh!

Uh oh!

ilan-gold commented May 6, 2025 •

edited by flying-sheep

Loading

ilan-gold commented May 7, 2025 •

edited

Loading

flying-sheep commented May 8, 2025 •

edited

Loading

review-notebook-app bot commented May 15, 2025 •

edited

Loading

review-notebook-app bot commented May 15, 2025 •

edited

Loading

review-notebook-app bot commented May 15, 2025 •

edited

Loading

review-notebook-app bot commented May 15, 2025 •

edited

Loading