Skip to content

sunbeam-labs/sbx_sga

Repository files navigation

sbx_sga

Tests Condabot DockerHub

Introduction

sbx_sga (Single Genome Assembly) is a sunbeam extension for isolate QC, assembly, and classification. This pipeline uses Mash for quality control, Shovill for bacterial isolate assembly, CheckM2 and QUAST for assembly QC, MLST for typing, Bakta for annotation, abriTAMR for AMR profiling, and Sylph for taxonomic classification.

Config

  • mash_ref: the reference file for running Mash (should be a file ending in .msh)
  • checkm_ref: the diamond database for running CheckM2 (should be a file ending in .dmnd)
  • bakta_ref: the bakta reference database (should be a directory similar to .../bakta_db/db/)
  • genomad_ref: the genomad reference database (should be a directory containing many files including version.txt (this is what our pipeline checks for to verify it exists))
  • sylph_ref: the sylph reference database (should be a .syldb file)
  • snippy_ref: the snippy reference genome (should be a fasta with at least a decent quality genome)

Database installations

conda create -n sga_dbs -c conda-forge -c bioconda mash bakta checkm2 genomad diamond prodigal
conda activate sga_dbs

For making smaller test databases, see .tests/data/README.md.

Genomad

genomad download-database /path/to/db_storage/

Docs

More docs.

About

A Sunbeam extension for Single Genome Assembly

Resources

Stars

Watchers

Forks

Packages

No packages published

Contributors 6

Languages