Cluster-based Apache Spark implementation of the GATK DNA analysis pipeline | IEEE Conference Publication | IEEE Xplore