ADAM 0.19.0 Released
ADAM version 0.19.0 has been released, built for both Scala 2.10 and Scala 2.11.
The 0.19.0 release contains various concordance fixes and performance improvements for accessing read metadata. Schema changes, including a bump to version 0.7.0 of the Big Data Genomics Avro data formats, were made to support the read metadata performance improvements. Additionally, the performance of exporting a single BAM file was improved, and this was made to be guaranteed correct for sorted data.
ADAM now targets Apache Spark 1.5.2 and Apache Hadoop 2.6.0 as the default build environment. ADAM and applications built on ADAM should run on a wide range of Apache Spark (1.3.1 up to and including the most recent, 1.6.0) and Apache Hadoop (currently 2.3.0 and 2.6.0) versions. A compatibility matrix of Spark, Hadoop, and Scala version builds in our continuous integration system verifies this. Please note, as of this release, support for Apache Spark 1.2.x and Apache Hadoop 1.0.x has been dropped.
The full list of changes since version 0.18.2 is below.
Closed issues:
- Update bdg-utils dependency version to 0.2.4 #960
- Drop support for Spark version 1.2.1, Hadoop version 1.0.x #958
- Exception occurs when running tests on master #956
- Flagstat results still don’t match samtools flagstat #946
- readInFragment value is not properly read from parquet file into RDD[AlignmentRecord] #942
- adam2vcf -sort_on_save flag broken #940
- Transform -limit_projection requires .sam.seqdict file #937
- MarkDuplicates fails if library name is not set #934
- fastqtobam or sam #928
- Vcf2Adam uses SB field instead of FS field for fisher exact test for strand bias #923
- Add back limit_projection on Transform #920
- BAM header is not getting set on partition 0 with headerless BAM output format #916
- Add numParts apply method to GenomicRegionPartitioner #914
- Add Spark version 1.6.x to Jenkins build matrix #913
- Target Spark 1.5.2 as default Spark version #911
- Move to bdg-formats 0.7.0 #905
- secondOfPair and firstOfPair flag is missing in the newest 0.18 adam transformed results from BAM #903
- Future pull request #900
- error in vcf2adam #899
- Importing directory of VCFs seems to fail #898
- How to filter genotypeRDD on sample names? org.apache.spark.SparkException: Task not serializable? #891
- Add Spark version 1.5.x to Jenkins build matrix #889
- Transform DAG causes stages to recompute #883
- adam-submit buildinfo is confused #880
- move_to_scala_2.11 and maven-javadoc-plugin #863
- NativeCodeLoader: Unable to load native-hadoop library for your platform… using builtin-java classes where applicable #837
- Fix record oriented shuffle #599
- Avro.GenericData error with ADAM 0.12.0 on reading from ADAM file #290
Merged and closed pull requests:
- [ADAM-960] Updating bdg-utils dependency version to 0.2.4 #961 (heuermh)
- [ADAM-946] Fixes to FlagStat for Samtools concordance issue #954 (jpdna)
- Fix for travis build, replace reads2ref with reads2fragments #950 (heuermh)
- [ADAM-940] Fix adam2vcf -sort_on_save flag #949 (massie)
- Remove BuildInformation and extraneous git-commit-id-plugin configuration #948 (heuermh)
- Update readme for spark 1.5.2 and hadoop 2.6.0 #944 (heuermh)
- [ADAM-942] Replace first/secondInRead with readInFragment #943 (heuermh)
- [ADAM-937] Adding check for aligned read predicate or limit projection flags and non-parquet input path #938 (heuermh)
- [ADAM-934] Properly handle unset library name during duplicate marking #935 (fnothaft)
- [ADAM-911] Move to Spark 1.5.2 and Hadoop 2.6.0 as default versions. #932 (fnothaft)
- added start and end values to Interval Trait. Used for IntervalRDD #931 (akmorrow13)
- Removing buildinfo command #929 (heuermh)
- Removing symbolic test resource links, read from test classpath instead #927 (heuermh)
- Changed fisher strand bias field for VCF2Adam from SB to FS #924 (andrewmchen)
- [ADAM-920] Limit tag/orig qual flags in Transform. #921 (fnothaft)
- Change the README to use adam-shell -i instead of pasting #919 (andrewmchen)
- [ADAM-916] New strategy for writing header. #917 (fnothaft)
- [ADAM-914] Create a GenomicRegionPartitioner given a partition count. #915 (fnothaft)
- Squashed #907 and ran format-sources #908 (fnothaft)
- Various small fixes #907 (huitseeker)
- ADAM-599, 905: Move to bdg-formats:0.7.0 and migrate metadata #906 (fnothaft)
- Rewrote the getType method to handle all ploidy levels #904 (NeillGibson)
- Single file save from #733, rebased #901 (fnothaft)
- Added is* genotype methods from HTS-JDK Genotype to RichGenotype #895 (NeillGibson)
- [ADAM-891] Mark SparkContext as @transient. #894 (fnothaft)
- Update README URLs based on HTTP redirects #892 (ReadmeCritic)
- adding —version command line option #888 (heuermh)
- Add exception in move_to_scala_2.11.sh for maven-javadoc-plugin #887 (heuermh)
- Fix tightlist bug in Pandoc #885 (massie)
- [ADAM-883] Add caching to Transform pipeline. #884 (fnothaft)