Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include quick clade count after every ingest in logs #251

Open
corneliusroemer opened this issue Dec 10, 2021 · 0 comments
Open

Include quick clade count after every ingest in logs #251

corneliusroemer opened this issue Dec 10, 2021 · 0 comments
Labels
enhancement New feature or request

Comments

@corneliusroemer
Copy link
Member

It'd be good as sanity check to have clade counts output after every ingest run.

This command would be enough:

zcat metadata.tsv.gz | tsv-summarize -H --group-by Nextstrain_clade --count

Yielding something like:

Nextstrain_clade        count
20I (Alpha, V1) 1160560
21A (Delta)     109534
20G     125069
21F (Iota)      43376
21I (Delta)     256077
20A     310770
21J (Delta)     2966243
19B     16740
21D (Eta)       7553
20C     123191
20B     261388
20H (Beta, V2)  42294
20D     13044
21E (Theta)     638
21B (Kappa)     8114
20J (Gamma, V3) 116987
20E (EU1)       175137
21G (Lambda)    9245
21C (Epsilon)   64377
19A     24359
21H (Mu)        14275
20F     13766
21K (Omicron)   1450
21L     7
        2190
@corneliusroemer corneliusroemer added the enhancement New feature or request label Dec 10, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant