Abstract
In the last decades, the human genoma analysis for addressing health-care problems, has widely grown. With the high throughput of biological data and, needing of represent them, the Next-Generation Sequencing was introduced. In order to store genomic features without losing information, different data format (such as FAST-A, FAST-Q, SAM, VCF) have been proposed. To overcome the storing process issues of these data, several genomic compressors have been presented. A specific VCF compressor is analyzed. Due to the restricted hardware resources limit of multi-core architecture when input size dimension data are given, large execution times are required. Thanks to the well-known parallel nature of the most recent Graphic Process Units, in this work we present a Multi-GPU based parallel implementation by exploiting CUDA framework. An ad-hoc memory approach combined with a suitable work decomposition strategy are able to give a strong increase in performance. To observe the benefits in terms of performance, tests and experiments complete our work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Deorowicz, S., Danek, A.: VCF Shark: how to squeeze a VCF file. bioRxiv (2020)
De Luca, P., Formisano, A.: Haptic data accelerated prediction via multicore implementation. In: Science and Information Conference, pp. 110–121. Springer, Cham (2020)
Marcellino, L. et al.: Using GPGPU accelerated interpolation algorithms for marine bathymetry processing with on-premises and cloud based computational resources. In: International Conference on Parallel Processing and Applied Mathematics. Springer, Cham (2017)
De Luca, P., et al.: Accelerated Gaussian convolution in a data assimilation scenario. In: International Conference on Computational Science. Springer, Cham (2020)
De Luca, P., et al.: Recursive filter based GPU algorithms in a Data Assimilation scenario. J. Comput. Sci. 53, 101339 (2021)
De Luca, P., et al.: Performance analysis of a multicore implementation for solving a two-dimensional inverse anomalous diffusion problem. In: International Conference on Numerical Computations: Theory and Algorithms. Springer, Cham (2019)
Fiscale, S., et al.: A GPU Algorithm for Outliers Detection in TESS Light Curves. In: International Conference on Computational Science. Springer, Cham (2021)
De Luca, P., Galletti, A., Marcellino, L.: Parallel solvers comparison for an inverse problem in fractional calculus. In: 2020 Proceeding of 9th International Conference on Theory and Practice in Modern Computing (TPMC 2020) (2020, July)
Amich, M., et al.: Accelerated implementation of FQSqueezer novel genomic compression method. In: 2020 19th International Symposium on Parallel and Distributed Computing (ISPDC). pp. 158–163. IEEE (2020)
Levy, S., et al.: PLoS Biol. 5(10), e254
Shendure, J., et al.: Nat. Biotechnol. 26(10), 1135–1145 (2008)
Ansorge, W.J.: New Biotechnol. 25(4), 195–203 (2009)
De Luca, P., et al.: Distributed genomic compression in MapReduce paradigm. In: International Conference on Internet and Distributed Computing Systems. Springer, Cham (2019)
De Luca, P., Galletti, A., Ghehsareh, H.R., Marcellino, L., Raei, M.: A gpu-cuda framework for solving a two-dimensional inverse anomalous diffusion problem. In: Foster, I., Joubert, G.R., Kučera, L., Nagel, W.E., Peters, F. (eds) Parallel Computing: Technology Trends, Advances in Parallel Computing, vol 36, pp. 311–320. IOS Press (2020)
Zhang, X., et al.: Cancer Biol. Med. 16(1), 189
Danecek, Petr, et al.: The variant call format and VCFtools. Bioinformatics 27(15), 2156–2158 (2011)
Lan, D., et al.: Bioinformatics (Oxford, England) 36(13), 4091–4092
Deorowicz, S., et al.: Genome compression: a novel approach for large collections. Bioinformatics 29(20), 2572–2578 (2013)
Deorowicz, S., et al.: Bioinformatics 35(22), 4791–4793 (2019)
Chandak, S., et al.: SPRING: a next-generation compressor for FASTQ data. Bioinformatics 35(15), 2674–2676
Antonini, A., et al.: Low complexity algorithms for transmission of short blocks over the BSC with full feedback. In: (ISIT). pp. 2173–2178. IEEE
De Luca, P., Galletti, A., Marcellino, L.: A novel gpu implementation for image stripe noise removal. In: Intelligent Computing, pp. 232–243. Springer, Cham (2022)
https://www.hpc.cineca.it/hardware/marconi100 Accessed 30 June 30 2021
De Luca, P., Galletti, A., Marcellino, L.: A Gaussian recursive filter parallel implementation with overlapping. In: 2019 15th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), pp. 641–648. IEEE (2019)
Acknowledgements
This paper has been supported for computational resources by project “Accelerated High Performance Methods for compressing Next-Generation sequencing data (AHNG20)” - CINECA.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
De Luca, P., Di Mauro, A., Fiscale, S. (2022). On Next-Generation Sequencing Compression via Multi-GPU. In: Camacho, D., Rosaci, D., Sarné, G.M.L., Versaci, M. (eds) Intelligent Distributed Computing XIV. IDC 2021. Studies in Computational Intelligence, vol 1026. Springer, Cham. https://doi.org/10.1007/978-3-030-96627-0_42
Download citation
DOI: https://doi.org/10.1007/978-3-030-96627-0_42
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-96626-3
Online ISBN: 978-3-030-96627-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)