Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Resolve sample batch #2

Open
millersan opened this issue Apr 29, 2022 · 1 comment
Open

Resolve sample batch #2

millersan opened this issue Apr 29, 2022 · 1 comment

Comments

@millersan
Copy link

Dear authors, Please accept my sincere thanks for providing such a useful tool.
How to solve the sample batch of input counts, and can I use the normalized data for calculation?
Best,
Miller

@chanwkimlab
Copy link
Owner

chanwkimlab commented May 4, 2022

Hi Miller,

Thank you so much for using our software. As MarcoPolo internally uses Poisson distribution, it cannot take normalized data as input. Instead, MarcoPolo can handle the batches of samples by directly modeling them as covariates - denoted as ß in the paper.

To use this feature, you can put the covariate matrix of the batches to the Covar parameter of the save_QQscore function: https://github.com/chanwkimlab/MarcoPolo/blob/master/MarcoPolo/QQscore.py#L113. As you can see in the code, when the Covar parameter is not set, it only models intercepts, which means that the same baseline expression is assumed for all cells. For batches of samples, you can set the Covar parameter to a matrix where the batch information is shown in one-hot so that different baseline expressions are used per each group of cells.

Please let me know if there are any other issues.

Best,
Chanwoo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants