-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assessing UMAP embedding quality and sweeping across n_neighbours parameter #1
Comments
Hi Oskar! If you want run the code using UMAP, it will ignore the |
Thanks! I was a bit confused by this:
in the EMBEDR class. |
Oh, of course! The perplexity parameter does double duty in that it is involved with how the embedding quality is assessed as well as in running t-SNE. That is, currently, the quality of an embedding is calculated as the similarity of two data-affinity matrices, one from the original data space and one from the embedded space. The high-dimensional affinity matrix depends on a perplexity parameter, If you use the same value for Alternately, you can change However, if you're concerned after you've run your sweep that you've chosen the wrong
I'm going to leave this whole thing open as something to prioritize in the next version because this should be easier! Also, this really underscores how these parameters should be separated semantically in the code. In my reply, I invented TLDR: You can probably leave |
Hi,
I am very exicted to try to assess the quality of my embeddings using EMBEDR. I am unsure though how to set the
perplexity
value, while doing then_neighbours
parameter sweep for UMAP. Should I set the EMBEDRperplexity
to always equaln_neighbours
?Best
Oskar
The text was updated successfully, but these errors were encountered: