https://github.com/bigdatagenomics/

https://twitter.com/bigdatagenomics/

Thanks to advances in both the cost and speed of sequencing technology, the amount of genomic data available for processing is growing exponentially. As a project, our goal is to build scalable pipelines for processing genomic data on top of high performance distributed computing frameworks.

Projects

At the moment, we are working on three projects:

ADAM: A scalable API & CLI for genome processing
bdg-formats: Schemas for genomic data
avocado: A Variant Caller, Distributed

The source for these projects is available at Github.

Licensing

All of our development is available under the Apache 2 open source software (OSS) license. This OSS license is non-viral, and places no restrictions on users who would like to use or modify the software.

Comments

About us...

This project is supported in part by NIH BD2K Award 1-U54HG007990-01 and NIH Cancer Cloud Pilot Award HHSN261201400006C with collaborators from the AMPLab at UC Berkeley, Genome Informatics Lab at UC Santa Cruz, Icahn School of Medicine at Mount Sinai, Microsoft Research, Cloudera, and the Broad Institute.

Chat with us...

If you're interested in contributing, take a look at the open "pick me up!" issues.

Recent Posts

ADAM 0.25.0 and Cannoli 0.3.0 Released ADAM 0.24.0 and Cannoli 0.2.0 Released ADAM 0.23.0 Released (+ Avocado and DECA Releases) ADAM 0.22.0 Released ADAM 0.21.0 Released

YourKit is supporting the Big Data Genomics open source project with its full-featured Java Profiler. YourKit, LLC is the creator of innovative and intelligent tools for profiling Java and .NET applications. Take a look at YourKit's leading software products: YourKit Java Profiler and YourKit .NET Profiler.