ADAM 0.19.0 Released - Big Data Genomics

ADAM version 0.19.0 has been released, built for both Scala 2.10 and Scala 2.11.

The 0.19.0 release contains various concordance fixes and performance improvements for accessing read metadata. Schema changes, including a bump to version 0.7.0 of the Big Data Genomics Avro data formats, were made to support the read metadata performance improvements. Additionally, the performance of exporting a single BAM file was improved, and this was made to be guaranteed correct for sorted data.

ADAM now targets Apache Spark 1.5.2 and Apache Hadoop 2.6.0 as the default build environment. ADAM and applications built on ADAM should run on a wide range of Apache Spark (1.3.1 up to and including the most recent, 1.6.0) and Apache Hadoop (currently 2.3.0 and 2.6.0) versions. A compatibility matrix of Spark, Hadoop, and Scala version builds in our continuous integration system verifies this. Please note, as of this release, support for Apache Spark 1.2.x and Apache Hadoop 1.0.x has been dropped.

The full list of changes since version 0.18.2 is below.

Closed issues:

Update bdg-utils dependency version to 0.2.4 #960
Drop support for Spark version 1.2.1, Hadoop version 1.0.x #958
Exception occurs when running tests on master #956
Flagstat results still don’t match samtools flagstat #946
readInFragment value is not properly read from parquet file into RDD[AlignmentRecord] #942
adam2vcf -sort_on_save flag broken #940
Transform -limit_projection requires .sam.seqdict file #937
MarkDuplicates fails if library name is not set #934
fastqtobam or sam #928
Vcf2Adam uses SB field instead of FS field for fisher exact test for strand bias #923
Add back limit_projection on Transform #920
BAM header is not getting set on partition 0 with headerless BAM output format #916
Add numParts apply method to GenomicRegionPartitioner #914
Add Spark version 1.6.x to Jenkins build matrix #913
Target Spark 1.5.2 as default Spark version #911
Move to bdg-formats 0.7.0 #905
secondOfPair and firstOfPair flag is missing in the newest 0.18 adam transformed results from BAM #903
Future pull request #900
error in vcf2adam #899
Importing directory of VCFs seems to fail #898
How to filter genotypeRDD on sample names? org.apache.spark.SparkException: Task not serializable? #891
Add Spark version 1.5.x to Jenkins build matrix #889
Transform DAG causes stages to recompute #883
adam-submit buildinfo is confused #880
move_to_scala_2.11 and maven-javadoc-plugin #863
NativeCodeLoader: Unable to load native-hadoop library for your platform… using builtin-java classes where applicable #837
Fix record oriented shuffle #599
Avro.GenericData error with ADAM 0.12.0 on reading from ADAM file #290

Merged and closed pull requests:

[ADAM-960] Updating bdg-utils dependency version to 0.2.4 #961 (heuermh)
[ADAM-946] Fixes to FlagStat for Samtools concordance issue #954 (jpdna)
Fix for travis build, replace reads2ref with reads2fragments #950 (heuermh)
[ADAM-940] Fix adam2vcf -sort_on_save flag #949 (massie)
Remove BuildInformation and extraneous git-commit-id-plugin configuration #948 (heuermh)
Update readme for spark 1.5.2 and hadoop 2.6.0 #944 (heuermh)
[ADAM-942] Replace first/secondInRead with readInFragment #943 (heuermh)
[ADAM-937] Adding check for aligned read predicate or limit projection flags and non-parquet input path #938 (heuermh)
[ADAM-934] Properly handle unset library name during duplicate marking #935 (fnothaft)
[ADAM-911] Move to Spark 1.5.2 and Hadoop 2.6.0 as default versions. #932 (fnothaft)
added start and end values to Interval Trait. Used for IntervalRDD #931 (akmorrow13)
Removing buildinfo command #929 (heuermh)
Removing symbolic test resource links, read from test classpath instead #927 (heuermh)
Changed fisher strand bias field for VCF2Adam from SB to FS #924 (andrewmchen)
[ADAM-920] Limit tag/orig qual flags in Transform. #921 (fnothaft)
Change the README to use adam-shell -i instead of pasting #919 (andrewmchen)
[ADAM-916] New strategy for writing header. #917 (fnothaft)
[ADAM-914] Create a GenomicRegionPartitioner given a partition count. #915 (fnothaft)
Squashed #907 and ran format-sources #908 (fnothaft)
Various small fixes #907 (huitseeker)
ADAM-599, 905: Move to bdg-formats:0.7.0 and migrate metadata #906 (fnothaft)
Rewrote the getType method to handle all ploidy levels #904 (NeillGibson)
Single file save from #733, rebased #901 (fnothaft)
Added is* genotype methods from HTS-JDK Genotype to RichGenotype #895 (NeillGibson)
[ADAM-891] Mark SparkContext as @transient. #894 (fnothaft)
Update README URLs based on HTTP redirects #892 (ReadmeCritic)
adding —version command line option #888 (heuermh)
Add exception in move_to_scala_2.11.sh for maven-javadoc-plugin #887 (heuermh)
Fix tightlist bug in Pandoc #885 (massie)
[ADAM-883] Add caching to Transform pipeline. #884 (fnothaft)

Comments