Skip to content
Change the repository type filter

All

    Repositories list

    • adam

      Public
      ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
      Scala
      3151k355Updated Jul 12, 2025Jul 12, 2025
    • convert

      Public
      Conversions to and from Big Data Genomics Avro Formats. Apache 2 licensed.
      Java
      5021Updated Jul 12, 2025Jul 12, 2025
    • utils

      Public
      General utility code used across BDG products. Apache 2 licensed.
      Scala
      261821Updated May 6, 2025May 6, 2025
    • cannoli

      Public
      Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
      Scala
      174110Updated Mar 2, 2025Mar 2, 2025
    • Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.
      Shell
      354011Updated Feb 12, 2025Feb 12, 2025
    • Web Site for the Big Data Genomics Group
      HTML
      71000Updated Sep 3, 2023Sep 3, 2023
    • mango

      Public
      A scalable genome browser. Apache 2 licensed.
      Scala
      32125577Updated Dec 2, 2022Dec 2, 2022
    • workflows

      Public
      Toil workflows for bigdatagenomics tools. Apache 2 licensed.
      Python
      5581Updated Apr 22, 2021Apr 22, 2021
    • Dockerfile
      2101Updated Aug 23, 2020Aug 23, 2020
    • deca

      Public
      Distributed exome CNV analyzer. Apache 2 licensed.
      Scala
      4391Updated Oct 15, 2019Oct 15, 2019
    • Awesome list of applications that extend Big Data Genomics ADAM. CC0 licensed.
      41100Updated Jul 11, 2019Jul 11, 2019
    • avocado

      Public
      A Variant Caller, Distributed. Apache 2 licensed.
      Scala
      4271196Updated Mar 11, 2019Mar 11, 2019
    • gnocchi

      Public
      Scala
      106101Updated Apr 24, 2018Apr 24, 2018
    • lime

      Public
      Distributed Set Theory for Genomics
      Scala
      3572Updated Mar 27, 2018Mar 27, 2018
    • rice

      Public
      An RNA pipeline built on top of ADAM. Apache 2 licensed.
      Scala
      171972Updated Jan 19, 2018Jan 19, 2018
    • quinine

      Public
      A refreshing treatment for all quality control ailments. Apache 2 licensed.
      Scala
      6252Updated Oct 13, 2016Oct 13, 2016
    • Exemplar API that mediates Toil with a WDL front-end and workflow tracking.
      Java
      1101Updated Aug 1, 2016Aug 1, 2016
    • eggo

      Public
      Ready-to-go Parquet-formatted public 'omics datasets
      Python
      830213Updated Nov 2, 2015Nov 2, 2015
    • recipes

      Public
      Recipes using BDG projects. Apache 2 licensed.
      Shell
      3410Updated Mar 25, 2015Mar 25, 2015
    • PacMin

      Public
      Assembler for PacBio reads. Apache 2 licensed.
      Scala
      3340Updated Mar 14, 2015Mar 14, 2015
    • corretto

      Public
      Read error correction utilities.
      2020Updated Mar 1, 2015Mar 1, 2015
    • Notebook tools for Big Data Genomics. Apache 2 licensed.
      JavaScript
      653300Updated Mar 1, 2015Mar 1, 2015
    • Utility classes for wrapping services or other interfaces around a Spark/ADAM cluster. Apache 2 licensed.
      Java
      8520Updated Nov 17, 2014Nov 17, 2014