Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use lineages for build targets #34

Open
trvrb opened this issue Oct 24, 2024 · 1 comment
Open

Use lineages for build targets #34

trvrb opened this issue Oct 24, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@trvrb
Copy link
Member

trvrb commented Oct 24, 2024

We have a number of pathogens where the primary split is in terms of geography. For example: ncov/gisaid/global/ vs ncov/gisaid/north-america/. However, for WNV it looks like there are really very distinct lineages (that often have geographic correspondence). Here's a quick phylogeny from https://www.mdpi.com/1999-4915/15/1/35:

Screenshot 2024-10-24 at 12 53 46 PM

I think you have a dengue like situation where a single phylogenetic analysis across all WNV lineages (or dengue serotypes) may be challenging to have very accurate molecular clock.

Rather than cuts between a global analysis and a north-america analysis, you might be better served by cutting by lineage instead. This would give URLs of wnv/all-lineages, wnv/lineage-1, wnv/lineage-2, etc... I wouldn't worry about trying to preserve that historical WNV/NA endpoint.

NCBI doesn't seem to have lineage well annotated, however if you source data from https://pathoplexus.org/west-nile/ you should get lineage for every sequence.

@trvrb trvrb added the enhancement New feature or request label Oct 24, 2024
@trvrb trvrb changed the title Auspice endpoints Use lineages for build targets Oct 24, 2024
@trvrb
Copy link
Member Author

trvrb commented Dec 20, 2024

Nice to see the lineage calls at https://nextstrain.org/staging/WNV/genome?c=lineage, I'd suggest to make the structure of the phylogenetic outputs follow:

  • nextstrain.org/wnv/all-lineages (basically what's currently at WNV/genome)
  • nextstrain.org/wnv/lineage-1A
  • nextstrain.org/wnv/lineage-2

If a lineage is too rare / too small it's okay for it not to be listed out as a specific build endpoint (so starting with just 1A and 2). These should still be viewable on the full phylogeny.

Get this working well for the genome view and later on we could update this to

  • nextstrain.org/wnv/all-lineages/genome
  • nextstrain.org/wnv/all-lineages/env (or whatever)

But just get things all the way working with genome as implicit before worrying about splitting it out.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant