Abstract
Cloud computing has become a synonym for elastic provision of shared computing resources operated by a professional service provider. However, data needs to be transferred from local systems to shared resources for processing, which might results in significant process delays and the need to comply with special data privacy acts. Based on the concrete requirements of life sciences research, we share our experience in integrating existing decentralized computing resources to form a federated in-memory database system. Our approach combines advantages of cloud computing, such as efficient use of hardware resources and provisioning of managed software, whilst sensitive data are stored and processed on local hardware only.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amazon Web Services, Inc.: Amazon Elastic Computing Cloud (EC2), July 2015. http://aws.amazon.com/ec2/Jul27. Accessed 10 Dec 2018
Armbrust, M., et al.: A view of cloud computing. Commun. ACM 53(4), 50–58 (2010)
Bhuvaneshwar, K., et al.: A case study for cloud-based high-throughput analysis of NGS data using the globus genomics system. Comput. Struct. Biotechnol. J. 13, 64–74 (2015)
Bundesärztekammer und Kassenärztliche Vereinigung: Empfehlungen zur ärztlichen Schweigepflicht, Datenschutz und Datenverarbeitung in der Arztpraxis. Deutsche Ärzteblatt 111(21), A963–A972 (2014)
CERN: gLite - Lightweight Middleware for Grid Computing, April 2014. http://grid-deployment.web.cern.ch/grid-deployment/glite-web/introductionJul27. Accessed 10 Dec 2018
Everest Global, Inc.: Enterprise Cloud Adoption Survey, March 2014. http://www.everestgrp.com/wp-content/uploads/2014/03/2014-Enterprise-Cloud-Adoption-Survey.pdfDec17. Accessed 10 Dec 2018
Färber, F., et al.: SAP HANA database: data management for modern business applications. SIGMOD Rec. 40(4), 45–51 (2012)
Fears, R., et al.: Data protection regulation and the promotion of health research: getting the balance right. QJM 107(1), 3–5 (2013)
Gartner, Inc.: 2014 Hype Cycle for Emerging Technologies Maps the Journey to Digital Business, August 2014. http://www.gartner.com/newsroom/id/2819918Dec11. Accessed 10 Dec 2018
Goecks, J., Nekrutenko, A., Taylor, J., The Galaxy Team: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 11(8), R86 (2010)
Grondona, M.A.: Parallel Distributed Shell (PDSH), August 2011. https://code.google.com/p/pdsh/wiki/UsingPDSHNov26. Accessed 10 Dec 2018
Hardt, D.: RFC6749: The OAuth 2.0 Authorization Framework, October 2012. http://tools.ietf.org/html/rfc6749/Nov26. Accessed 10 Dec 2018
Jensen, P.B., Jensen, L.J., Brunak, S.: Mining electronic health records: towards better research applications and clinical care. Nat. Rev. Genet. 13(6), 395–405 (2012)
Kalloniatis, C., Manousakis, V., Mouratidis, H., Gritzalis, S.: Migrating into the cloud: identifying the major security and privacy concerns. In: Douligeris, C., Polemi, N., Karantjias, A., Lamersdorf, W. (eds.) I3E 2013. IAICT, vol. 399, pp. 73–87. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37437-1_7
Knöpfel, A., Grone, B., Tabeling, P.: Fundamental Modeling Concepts: Effective Communication of IT Systems. Wiley, Hoboken (2006)
Kundra, V.: Federal Cloud Computing Strategy, February 2011. http://www.whitehouse.gov/sites/default/files/omb/assets/egov_docs/federal-cloud-computing-strategy.pdfDec15. Accessed 10 Dec 2018
Langmead, B., Salzberg, S.L.: Fast gapped read alignment with bowtie 2. Nat. Methods 9, 357–359 (2012)
Li, H., Durbin, R.: Fast and accurate short read alignment with burrows-wheeler transformation. Bioinformatics 25, 1754–1760 (2009)
National Institute of Standards and Technology: The NIST Definition of CloudComputing: Recommendations of the National Institute of Standards andTechnology. NIST Special Publication 800-145, September 2011
OpenVPN Technologies, Inc.: Site-to-Site Layer 3 Routing Using OpenVPNAccess Server and a Linux Gateway Client, February 2012. https://docs.openvpn.net/. Accessed 10 Dec 2018
Plattner, H., Schapranow, M.-P. (eds.): High-Performance In-Memory Genome Data Analysis: How In-Memory Database Technology Accelerates Personalized Medicine. IDMR. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-03035-7
Ryan, M.D.: Cloud computing privacy concerns on our doorstep. Commun. ACM 54(1), 36–38 (2011)
SAP SE: Add Hosts Using the Command-Line Interface (2014). http://help.sap.com/saphelp_hanaplatform/helpdata/en/0d/9fe701e2214e98ad4f8721f6558c34/content.htm. Accessed 10 Dec 2018
Schaffner, J.: Multi Tenancy for Cloud-Based In-Memory Column Databases: Workload Management and Data Placement. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-319-00497-6
Schapranow, M.P., et al.: In-memory computing enabling real-time genome data analysis. Int. J. Adv. Life Sci. 6(1 and 2), 11–29 (2014)
Srinivasan, S.: Cloud computing evolution. In: Srinivasan, S. (ed.) Cloud Computing Basics. SECE, pp. 1–16. Springer, New York (2014). https://doi.org/10.1007/978-1-4614-7699-3_1
The UNICORE Forum e.V.: UNICORE - Documentation, July 2015. https://www.unicore.eu/documentation/Jul27. Accessed 10 Dec 2018
Wicks, P., et al.: Sharing health data for better outcomes on PatientsLikeMe. J. Med. Internet Res. 12(2), e19 (2010)
Zhang, Q., Cheng, L., Boutaba, R.: Cloud computing: state-of-the-art and research challenges. J. Internet Serv. Appl. 1(1), 7–18 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Schapranow, MP. et al. (2019). A Federated In-memory Database System for Life Sciences. In: Castellanos, M., Chrysanthis, P., Pelechrinis, K. (eds) Real-Time Business Intelligence and Analytics. BIRTE BIRTE BIRTE 2015 2016 2017. Lecture Notes in Business Information Processing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-030-24124-7_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-24124-7_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24123-0
Online ISBN: 978-3-030-24124-7
eBook Packages: Computer ScienceComputer Science (R0)