Big Data Genomics: ADAM
ADAM: Data Alignment/Map
ADAM provides both an application programming interface (API) and a command line interface (CLI) for manipulating genomic data on a computing cluster. ADAM operates on data stored inside of Parquet with the bdg-formats schemas, using Apache Spark, and provides scalable performance on clusters larger than 100 machines.
ADAM is on Github. Quick start guides are available for running ADAM on EC2, and for building ADAM for specific CDH releases.
Releases
The latest available release of ADAM is 0.6.1. ADAM is available for projects using Maven or SBT through the Sonatype OSS repository.
Support
For support using ADAM, please contact the ADAM developer mailing list. Additionally, we track issues and feature enhancement requests through our Github issue tracker.
Citing
ADAM has been described in a UC Berkeley EECS technical report. The Bibtex for this reference is:
1 2 3 4 5 6 7 8 9 |
|
Licensing
ADAM is available under the Apache 2 open source software (OSS) license. This OSS license is non-viral, and places no restrictions on users who would like to use or modify the software.