Data Lakes

Mathis, Christian

doi:10.1007/s13222-017-0272-7

Data Lakes

Kurz erklärt
Published: 06 October 2017

Volume 17, pages 289–293, (2017)
Cite this article

Datenbank-Spektrum Aims and scope Submit manuscript

Christian Mathis ORCID: orcid.org/0000-0002-7530-5947¹

4801 Accesses
59 Citations
7 Altmetric
1 Mention
Explore all metrics

Abstract

By moving data into a centralized, scalable storage location inside an organization – the data lake – companies and other institutions aim to discover new information and to generate value from the data. The data lake can help to overcome organizational boundaries and system complexity. However, to generate value from the data, additional techniques, tools, and processes need to be established which help to overcome data integration and other challenges around this approach. Although there is a certain agreed-on notion of the central idea, there is no accepted definition what components or functionality a data lake has or how an architecture looks like. Throughout this article, we will start with the central idea and discuss various related aspects and technologies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Trends and Future Perspective Challenges in Big Data

Big Data Analytics: Applications, Prospects and Challenges

The use of Big Data Analytics in healthcare

Article Open access 06 January 2022

References

Dixon J (2010) Pentaho, hadoop, and data lakes. https://jamesdixon.wordpress.com/2010/10/14/pentaho-hadoop-and-data-lakes/
Google Scholar
Dong XL, Srivastava D (2015) Big Data Integration. Morgan and Claypool Publishers, San Rafael, CA
Google Scholar
Ramakrishnan R et al (2017) Azure data lake store: a hyperscale distributed file service for big data analytics. Proc ACM SIGMOD Int Conf Manag Data. https://doi.org/10.1145/3035918.3056100
Google Scholar
Maltzahn C, Molina-Estolano E, Khurana A, Nelson AJ, Brandt SA, Weil S (2010) Ceph as a scalable alternative to the Hadoop distributed file system. login 35(4):38–49
Cohen J, Dolan B, Dunlap M, Hellerstein JM, Welton C (2009) MAD skills: new analysis practices for big data. Proc VLDB Endow 2009:1481–1492
Article Google Scholar
Xin RS, Rosen J, Zahira M, Franklin MJ, Shenker S, Stoica I (2013) Shark: SQL and rich analytics at scale. Proc ACM SIGMOD Int Conf Manag Data 2013:13–24
Google Scholar
Kreps J (2014) Questioning the lambda architecture. http://milinda.pathirage.org/kappa-architecture.com/. Accessed: 30. Sept. 2017
Google Scholar
Marz N (2011) How to beat the CAP theorem. http://nathanmarz.com/blog/how-to-beat-the-cap-theorem.html. Accessed: 30. Sept. 2017
Google Scholar

Download references

Acknowledgements

I would like to thank Christian Sengstock and Martin Hartig for feedback and discussions while writing this article.

Author information

Authors and Affiliations

SAP SE, Dietmar-Hopp-Allee 16, 69190, Walldorf, Germany
Christian Mathis

Authors

Christian Mathis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christian Mathis.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mathis, C. Data Lakes. Datenbank Spektrum 17, 289–293 (2017). https://doi.org/10.1007/s13222-017-0272-7

Download citation

Published: 06 October 2017
Issue Date: November 2017
DOI: https://doi.org/10.1007/s13222-017-0272-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data Lakes

Abstract

Access this article

Similar content being viewed by others

Trends and Future Perspective Challenges in Big Data

Big Data Analytics: Applications, Prospects and Challenges

The use of Big Data Analytics in healthcare

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Data Lakes

Abstract

Access this article

Similar content being viewed by others

Trends and Future Perspective Challenges in Big Data

Big Data Analytics: Applications, Prospects and Challenges

The use of Big Data Analytics in healthcare

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation