Skip to content
Francisco Zorrilla edited this page Apr 15, 2021 · 42 revisions

What is metaGEM?

metaGEM is a Snakemake-based workflow designed to generate high quality metagenome assembled genomes from short read paired-end data, reconstruct genome scale metabolic models, and perform community metabolic interaction simulations on high performance computing clusters. In the metaGEM manuscript we apply the workflow to lab culture, human gut, plant associated, bulk soil, and TARA oceans metagenomic samples.

Quickstart

The fastest way for you to start submitting jobs with metaGEM is to follow this quickstart guide. If you want to have more control over the software installed you can alternatively follow this manual setup guide.

Tutorial

A detailed tutorial of the metaGEM core workflow is demonstrated here.

Frequently asked questions

You can find answers to FAQs here.

Workflow

The image below shows a modified version of the workflow diagram, highlighting the most important parts of the core workflow on the left-hand column and auxiliary or supplementary tools on the right-hand column.

metagem_repo_fig 001

In the following pages you can find documentation regarding the implementation of each tool within the Snakefile. For a more hands on demonstration of how to use metaGEM please see the tutorial above.

Core

  1. Quality filter reads with fastp
  2. Assembly with megahit
  3. Draft bin sets with CONCOCT, MaxBin2, and MetaBAT2
  4. Refine & reassemble bins with metaWRAP
  5. Taxonomic assignment with GTDB-tk
  6. Relative abundances with bwa
  7. Reconstruct & evaluate genome-scale metabolic models with CarveMe and memote
  8. Species metabolic coupling analysis with SMETANA

Bonus

  1. Growth rate estimation
  2. Pangenome analysis based on MAGs
  3. Eukaryotic draft bins with EukRep and EukCC

Binning performance

image

Please cite

While the metaGEM publication is currently in the second round of revision please cite the pre-print if you use our pipeline in your research:

metaGEM: reconstruction of genome scale metabolic models directly from metagenomes
Francisco Zorrilla, Kiran R. Patil, Aleksej Zelezniak
bioRxiv 2020.12.31.424982; doi: https://doi.org/10.1101/2020.12.31.424982 

Clone this wiki locally