Skip to main content

On Next-Generation Sequencing Compression via Multi-GPU

  • Conference paper
  • First Online:
Intelligent Distributed Computing XIV (IDC 2021)

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1026))

Included in the following conference series:

Abstract

In the last decades, the human genoma analysis for addressing health-care problems, has widely grown. With the high throughput of biological data and, needing of represent them, the Next-Generation Sequencing was introduced. In order to store genomic features without losing information, different data format (such as FAST-A, FAST-Q, SAM, VCF) have been proposed. To overcome the storing process issues of these data, several genomic compressors have been presented. A specific VCF compressor is analyzed. Due to the restricted hardware resources limit of multi-core architecture when input size dimension data are given, large execution times are required. Thanks to the well-known parallel nature of the most recent Graphic Process Units, in this work we present a Multi-GPU based parallel implementation by exploiting CUDA framework. An ad-hoc memory approach combined with a suitable work decomposition strategy are able to give a strong increase in performance. To observe the benefits in terms of performance, tests and experiments complete our work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Deorowicz, S., Danek, A.: VCF Shark: how to squeeze a VCF file. bioRxiv (2020)

    Google Scholar 

  2. De Luca, P., Formisano, A.: Haptic data accelerated prediction via multicore implementation. In: Science and Information Conference, pp. 110–121. Springer, Cham (2020)

    Google Scholar 

  3. Marcellino, L. et al.: Using GPGPU accelerated interpolation algorithms for marine bathymetry processing with on-premises and cloud based computational resources. In: International Conference on Parallel Processing and Applied Mathematics. Springer, Cham (2017)

    Google Scholar 

  4. De Luca, P., et al.: Accelerated Gaussian convolution in a data assimilation scenario. In: International Conference on Computational Science. Springer, Cham (2020)

    Google Scholar 

  5. De Luca, P., et al.: Recursive filter based GPU algorithms in a Data Assimilation scenario. J. Comput. Sci. 53, 101339 (2021)

    Article  MathSciNet  Google Scholar 

  6. De Luca, P., et al.: Performance analysis of a multicore implementation for solving a two-dimensional inverse anomalous diffusion problem. In: International Conference on Numerical Computations: Theory and Algorithms. Springer, Cham (2019)

    Google Scholar 

  7. Fiscale, S., et al.: A GPU Algorithm for Outliers Detection in TESS Light Curves. In: International Conference on Computational Science. Springer, Cham (2021)

    Google Scholar 

  8. De Luca, P., Galletti, A., Marcellino, L.: Parallel solvers comparison for an inverse problem in fractional calculus. In: 2020 Proceeding of 9th International Conference on Theory and Practice in Modern Computing (TPMC 2020) (2020, July)

    Google Scholar 

  9. Amich, M., et al.: Accelerated implementation of FQSqueezer novel genomic compression method. In: 2020 19th International Symposium on Parallel and Distributed Computing (ISPDC). pp. 158–163. IEEE (2020)

    Google Scholar 

  10. https://developer.nvidia.com/cuda-zone

  11. Levy, S., et al.: PLoS Biol. 5(10), e254

    Google Scholar 

  12. Shendure, J., et al.: Nat. Biotechnol. 26(10), 1135–1145 (2008)

    Article  Google Scholar 

  13. Ansorge, W.J.: New Biotechnol. 25(4), 195–203 (2009)

    Article  Google Scholar 

  14. De Luca, P., et al.: Distributed genomic compression in MapReduce paradigm. In: International Conference on Internet and Distributed Computing Systems. Springer, Cham (2019)

    Google Scholar 

  15. De Luca, P., Galletti, A., Ghehsareh, H.R., Marcellino, L., Raei, M.: A gpu-cuda framework for solving a two-dimensional inverse anomalous diffusion problem. In: Foster, I., Joubert, G.R., Kučera, L., Nagel, W.E., Peters, F. (eds) Parallel Computing: Technology Trends, Advances in Parallel Computing, vol 36, pp. 311–320. IOS Press (2020)

    Google Scholar 

  16. Zhang, X., et al.: Cancer Biol. Med. 16(1), 189

    Google Scholar 

  17. Danecek, Petr, et al.: The variant call format and VCFtools. Bioinformatics 27(15), 2156–2158 (2011)

    Article  Google Scholar 

  18. Lan, D., et al.: Bioinformatics (Oxford, England) 36(13), 4091–4092

    Google Scholar 

  19. Deorowicz, S., et al.: Genome compression: a novel approach for large collections. Bioinformatics 29(20), 2572–2578 (2013)

    Article  Google Scholar 

  20. Deorowicz, S., et al.: Bioinformatics 35(22), 4791–4793 (2019)

    Article  Google Scholar 

  21. Chandak, S., et al.: SPRING: a next-generation compressor for FASTQ data. Bioinformatics 35(15), 2674–2676

    Google Scholar 

  22. Antonini, A., et al.: Low complexity algorithms for transmission of short blocks over the BSC with full feedback. In: (ISIT). pp. 2173–2178. IEEE

    Google Scholar 

  23. De Luca, P., Galletti, A., Marcellino, L.: A novel gpu implementation for image stripe noise removal. In: Intelligent Computing, pp. 232–243. Springer, Cham (2022)

    Google Scholar 

  24. https://www.hpc.cineca.it/hardware/marconi100 Accessed 30 June 30 2021

  25. De Luca, P., Galletti, A., Marcellino, L.: A Gaussian recursive filter parallel implementation with overlapping. In: 2019 15th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), pp. 641–648. IEEE (2019)

    Google Scholar 

Download references

Acknowledgements

This paper has been supported for computational resources by project “Accelerated High Performance Methods for compressing Next-Generation sequencing data (AHNG20)” - CINECA.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pasquale De Luca .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

De Luca, P., Di Mauro, A., Fiscale, S. (2022). On Next-Generation Sequencing Compression via Multi-GPU. In: Camacho, D., Rosaci, D., Sarné, G.M.L., Versaci, M. (eds) Intelligent Distributed Computing XIV. IDC 2021. Studies in Computational Intelligence, vol 1026. Springer, Cham. https://doi.org/10.1007/978-3-030-96627-0_42

Download citation

Publish with us

Policies and ethics