Skip to main content

A Federated In-memory Database System for Life Sciences

  • Conference paper
  • First Online:
Real-Time Business Intelligence and Analytics (BIRTE 2015, BIRTE 2016, BIRTE 2017)

Abstract

Cloud computing has become a synonym for elastic provision of shared computing resources operated by a professional service provider. However, data needs to be transferred from local systems to shared resources for processing, which might results in significant process delays and the need to comply with special data privacy acts. Based on the concrete requirements of life sciences research, we share our experience in integrating existing decentralized computing resources to form a federated in-memory database system. Our approach combines advantages of cloud computing, such as efficient use of hardware resources and provisioning of managed software, whilst sensitive data are stored and processed on local hardware only.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amazon Web Services, Inc.: Amazon Elastic Computing Cloud (EC2), July 2015. http://aws.amazon.com/ec2/Jul27. Accessed 10 Dec 2018

  2. Armbrust, M., et al.: A view of cloud computing. Commun. ACM 53(4), 50–58 (2010)

    Article  Google Scholar 

  3. Bhuvaneshwar, K., et al.: A case study for cloud-based high-throughput analysis of NGS data using the globus genomics system. Comput. Struct. Biotechnol. J. 13, 64–74 (2015)

    Article  Google Scholar 

  4. Bundesärztekammer und Kassenärztliche Vereinigung: Empfehlungen zur ärztlichen Schweigepflicht, Datenschutz und Datenverarbeitung in der Arztpraxis. Deutsche Ärzteblatt 111(21), A963–A972 (2014)

    Google Scholar 

  5. CERN: gLite - Lightweight Middleware for Grid Computing, April 2014. http://grid-deployment.web.cern.ch/grid-deployment/glite-web/introductionJul27. Accessed 10 Dec 2018

  6. Everest Global, Inc.: Enterprise Cloud Adoption Survey, March 2014. http://www.everestgrp.com/wp-content/uploads/2014/03/2014-Enterprise-Cloud-Adoption-Survey.pdfDec17. Accessed 10 Dec 2018

  7. Färber, F., et al.: SAP HANA database: data management for modern business applications. SIGMOD Rec. 40(4), 45–51 (2012)

    Article  Google Scholar 

  8. Fears, R., et al.: Data protection regulation and the promotion of health research: getting the balance right. QJM 107(1), 3–5 (2013)

    Article  Google Scholar 

  9. Gartner, Inc.: 2014 Hype Cycle for Emerging Technologies Maps the Journey to Digital Business, August 2014. http://www.gartner.com/newsroom/id/2819918Dec11. Accessed 10 Dec 2018

  10. Goecks, J., Nekrutenko, A., Taylor, J., The Galaxy Team: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 11(8), R86 (2010)

    Article  Google Scholar 

  11. Grondona, M.A.: Parallel Distributed Shell (PDSH), August 2011. https://code.google.com/p/pdsh/wiki/UsingPDSHNov26. Accessed 10 Dec 2018

  12. Hardt, D.: RFC6749: The OAuth 2.0 Authorization Framework, October 2012. http://tools.ietf.org/html/rfc6749/Nov26. Accessed 10 Dec 2018

  13. Jensen, P.B., Jensen, L.J., Brunak, S.: Mining electronic health records: towards better research applications and clinical care. Nat. Rev. Genet. 13(6), 395–405 (2012)

    Article  Google Scholar 

  14. Kalloniatis, C., Manousakis, V., Mouratidis, H., Gritzalis, S.: Migrating into the cloud: identifying the major security and privacy concerns. In: Douligeris, C., Polemi, N., Karantjias, A., Lamersdorf, W. (eds.) I3E 2013. IAICT, vol. 399, pp. 73–87. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37437-1_7

    Chapter  Google Scholar 

  15. Knöpfel, A., Grone, B., Tabeling, P.: Fundamental Modeling Concepts: Effective Communication of IT Systems. Wiley, Hoboken (2006)

    Google Scholar 

  16. Kundra, V.: Federal Cloud Computing Strategy, February 2011. http://www.whitehouse.gov/sites/default/files/omb/assets/egov_docs/federal-cloud-computing-strategy.pdfDec15. Accessed 10 Dec 2018

  17. Langmead, B., Salzberg, S.L.: Fast gapped read alignment with bowtie 2. Nat. Methods 9, 357–359 (2012)

    Article  Google Scholar 

  18. Li, H., Durbin, R.: Fast and accurate short read alignment with burrows-wheeler transformation. Bioinformatics 25, 1754–1760 (2009)

    Article  Google Scholar 

  19. National Institute of Standards and Technology: The NIST Definition of CloudComputing: Recommendations of the National Institute of Standards andTechnology. NIST Special Publication 800-145, September 2011

    Google Scholar 

  20. OpenVPN Technologies, Inc.: Site-to-Site Layer 3 Routing Using OpenVPNAccess Server and a Linux Gateway Client, February 2012. https://docs.openvpn.net/. Accessed 10 Dec 2018

  21. Plattner, H., Schapranow, M.-P. (eds.): High-Performance In-Memory Genome Data Analysis: How In-Memory Database Technology Accelerates Personalized Medicine. IDMR. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-03035-7

    Book  Google Scholar 

  22. Ryan, M.D.: Cloud computing privacy concerns on our doorstep. Commun. ACM 54(1), 36–38 (2011)

    Article  Google Scholar 

  23. SAP SE: Add Hosts Using the Command-Line Interface (2014). http://help.sap.com/saphelp_hanaplatform/helpdata/en/0d/9fe701e2214e98ad4f8721f6558c34/content.htm. Accessed 10 Dec 2018

  24. Schaffner, J.: Multi Tenancy for Cloud-Based In-Memory Column Databases: Workload Management and Data Placement. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-319-00497-6

    Book  Google Scholar 

  25. Schapranow, M.P., et al.: In-memory computing enabling real-time genome data analysis. Int. J. Adv. Life Sci. 6(1 and 2), 11–29 (2014)

    Google Scholar 

  26. Srinivasan, S.: Cloud computing evolution. In: Srinivasan, S. (ed.) Cloud Computing Basics. SECE, pp. 1–16. Springer, New York (2014). https://doi.org/10.1007/978-1-4614-7699-3_1

    Chapter  Google Scholar 

  27. The UNICORE Forum e.V.: UNICORE - Documentation, July 2015. https://www.unicore.eu/documentation/Jul27. Accessed 10 Dec 2018

  28. Wicks, P., et al.: Sharing health data for better outcomes on PatientsLikeMe. J. Med. Internet Res. 12(2), e19 (2010)

    Article  Google Scholar 

  29. Zhang, Q., Cheng, L., Boutaba, R.: Cloud computing: state-of-the-art and research challenges. J. Internet Serv. Appl. 1(1), 7–18 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Matthieu-P. Schapranow .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Schapranow, MP. et al. (2019). A Federated In-memory Database System for Life Sciences. In: Castellanos, M., Chrysanthis, P., Pelechrinis, K. (eds) Real-Time Business Intelligence and Analytics. BIRTE BIRTE BIRTE 2015 2016 2017. Lecture Notes in Business Information Processing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-030-24124-7_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-24124-7_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-24123-0

  • Online ISBN: 978-3-030-24124-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics