skip to main content
10.1145/2811222.2811235acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Big Data Design

Published: 22 October 2015 Publication History

Abstract

It is widely accepted today that Relational databases are not appropriate in highly distributed shared-nothing architectures of commodity hardware, that need to handle poorly structured heterogeneous data. This has brought the blooming of NoSQL systems with the purpose of mitigating such problem, specially in the presence of analytical workloads. Thus, the change in the data model and the new analytical needs beyond OLAP take us to rethink methods and models to design and manage these newborn repositories. In this paper, we will analyze state of the art and future research directions.

References

[1]
R. Cattell. Scalable SQL and NoSQL data stores. SIGMOD Record, 39(4), 2010.
[2]
F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A distributed storage system for structured data. Trans. Comput. Syst., 26(2), 2008.
[3]
C. L. P. Chen and C. Zhang. Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Inf. Sci., 275, 2014.
[4]
E. F. Codd. A relational model of data for large shared data banks. Commun. ACM, 13(6), 1970.
[5]
G. P. Copeland and S. Khoshafian. A decomposition storage model. In SIGMOD. ACM, 1985.
[6]
M. Fowler and P. J. Sadalage. Introduction to Polyglot Persistence: Using Different Data Storage Technologies for Varying Data Storage Needs. Addison-Wesley, 2012.
[7]
H. Garcia-Molina, J. D. Ullman, and J. Widom. Database Systems. Prentice Hall, 2009.
[8]
S. Ghemawat, H. Gobioff, and S. Leung. The Google file system. In SOSP. ACM, 2003.
[9]
M. Grover, T. Malaska, J. Seidman, and G. Shapira. Hadoop Application Architectures. O'Reilly, 2015.
[10]
H. Hultgren. Modeling the Agile Data Warehouse with Data Vault. New Hamilton, 2012.
[11]
H. V. Jagadish, J. Gehrke, A. Labrinidis, Y. Papakonstantinou, J. M. Patel, R. Ramakrishnan, and C. Shahabi. Big data and its technical challenges. Commun. ACM, 57(7), 2014.
[12]
D. Jardine. The ANSI/SPARC DBMS Model. North-Holland, 1977.
[13]
D. Karger, E. Lehman, T. Leighton, R. Panigrahy, M. Levine, and D. Lewin. Consistent hashing and random trees: Distributed caching protocols for relieving hot spots on the World Wide Web. In STOC. ACM, 1997.
[14]
N. Marz and J. Warren. Big Data: Principles and Best Practices of Scalable Realtime Data Systems. Manning Publications, 2015.
[15]
E. Meijer and G. M. Bierman. A co-relational model of data for large shared data banks. Commun. ACM, 54(4), 2011.
[16]
P. E. O'Neil, E. Cheng, D. Gawlick, and E. J. O'Neil. The Log-Structured Merge-Tree (LSM-Tree). Acta Inf., 33(4), 1996.
[17]
C. Ordonez, S. Maabout, D. S. Matusevich, and W. Cabrera. Extending ER models to capture database transformations to build data sets for data mining. Data & Know. Eng., 89, 2014.
[18]
O. Romero, V. Herrero, A. Abelló, and J. Ferrarons. Tuning small analytics on Big Data: Data partitioning and secondary indexes in the Hadoop ecosystem. Information Systems, 2015. In Press.
[19]
M. Stonebraker. Technical perspective - one size fits all: an idea whose time has come and gone. Commun. ACM, 51(12), 2008.
[20]
M. Stonebraker. What does "Big Data" mean? Blog@CACM, September 2012.
[21]
J. Varga, O. Romero, T. B. Pedersen, and C. Thomsen. Towards next generation BI systems: The analytical metadata challenge. In DaWaK. Springer, 2014.

Cited By

View all
  • (2022)Translating UML Class Diagram Models Into Key-Value Store Models2022 International Conference on Data and Software Engineering (ICoDSE)10.1109/ICoDSE56892.2022.9972034(155-160)Online publication date: 2-Nov-2022
  • (2022)Databases, Data Warehousing, and Data AnalyticsHandbook of Media and Communication Economics10.1007/978-3-658-34048-3_16-2(1-14)Online publication date: 23-Jun-2022
  • (2021)Run-time data analysis to drive compiler optimizationsCompanion Proceedings of the 2021 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity10.1145/3484271.3484974(9-12)Online publication date: 17-Oct-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DOLAP '15: Proceedings of the ACM Eighteenth International Workshop on Data Warehousing and OLAP
October 2015
108 pages
ISBN:9781450337854
DOI:10.1145/2811222
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. big data
  2. database design
  3. nosql

Qualifiers

  • Short-paper

Funding Sources

  • Erasmus Mundus PhD program

Conference

CIKM'15
Sponsor:

Acceptance Rates

DOLAP '15 Paper Acceptance Rate 8 of 31 submissions, 26%;
Overall Acceptance Rate 29 of 79 submissions, 37%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Translating UML Class Diagram Models Into Key-Value Store Models2022 International Conference on Data and Software Engineering (ICoDSE)10.1109/ICoDSE56892.2022.9972034(155-160)Online publication date: 2-Nov-2022
  • (2022)Databases, Data Warehousing, and Data AnalyticsHandbook of Media and Communication Economics10.1007/978-3-658-34048-3_16-2(1-14)Online publication date: 23-Jun-2022
  • (2021)Run-time data analysis to drive compiler optimizationsCompanion Proceedings of the 2021 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity10.1145/3484271.3484974(9-12)Online publication date: 17-Oct-2021
  • (2021)Model-Driven Engineering: From SQL Relational Database to Column—Oriented Database in Big Data ContextNetworking, Intelligent Systems and Security10.1007/978-981-16-3637-0_47(667-678)Online publication date: 2-Oct-2021
  • (2021)Datenbanken, Data Warehousing & Data AnalyticsHandbuch Medienökonomie10.1007/978-3-658-09560-4_16(313-327)Online publication date: 6-Jan-2021
  • (2020)Modeling and Management Big Data in Databases—A Systematic Literature ReviewSustainability10.3390/su1202063412:2(634)Online publication date: 15-Jan-2020
  • (2019)Big Data Processing and Big AnalyticsEmerging Technologies and Applications in Data Processing and Management10.4018/978-1-5225-8446-9.ch014(285-315)Online publication date: 2019
  • (2019)Integration of Relational and NoSQL DatabasesVietnam Journal of Computer Science10.1142/S219688881950021006:04(389-405)Online publication date: 6-Nov-2019
  • (2018)Formalizing the Mapping of UML Conceptual Schemas to Column-Oriented DatabasesInternational Journal of Data Warehousing and Mining10.4018/IJDWM.201807010314:3(44-68)Online publication date: 1-Jul-2018
  • (2018)SemLinker: automating big data integration for casual usersJournal of Big Data10.1186/s40537-018-0123-x5:1Online publication date: 26-Mar-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media