ABSTRACT
In this paper, we highlight open problems and actual research trends in the field of Data Warehousing and OLAP over Big Data, an emerging term in Data Warehousing and OLAP research. We also derive several novel research directions arising in this field, and put emphasis on possible contributions to be achieved by future research efforts.
- Cuzzocrea, A., Song, I.-Y., and Davis, K. C. Analytics over Large-Scale Multidimensional Data: The Big Data Revolution! Proc. of ACM DOLAP, 2011. Google ScholarDigital Library
- Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., Pellow, F., and Pirahesh, H. Data Cube: A Relational Aggregation Operator Generalizing Group-by, Cross-Tab, and Sub Totals. Data Mining and Knowledge Discovery 1(1), 1997. Google ScholarDigital Library
- Harinarayan, V., Rajaraman, A., and Ullman, J. D. Implementing Data Cubes Efficiently. Proc. of SIGMOD Conference, 1996. Google ScholarDigital Library
- Chen, C., Yan, X., Zhu, F., Han, J., and Yu, P. S. Graph OLAP: A Multi-Dimensional Framework for Graph Data Analysis. Knowledge and Information Systems 21(1), 2009. Google ScholarDigital Library
- Jensen, M. R., Møller, T. H., and Pedersen, T. B. Specifying OLAP Cubes on XML Data. Proc. of SSDBM, 2001. Google ScholarDigital Library
- Zhao, P., Li, X., Xin, D., and Han, J. Graph Cube: On Warehousing And OLAP Multidimensional Networks. Proc. of ACM SIGMOD, 2011. Google ScholarDigital Library
- Yuan, Y., Lin, X., Liu, Q., Wang, W., Yu, J. X., and Zhang, Q. Efficient Computation of the Skyline Cube. Proc. of VLDB, 2005. Google ScholarDigital Library
- Dehne, F. K. H. A., Eavis,T., and Rau-Chaplin, A. The cgmCUBE Project: Optimizing Parallel Data Cube Generation for ROLAP. Distributed and Parallel Databases 19(1), 2006. Google ScholarDigital Library
- Sitaridi, E. A., and Ross, K. A. Ameliorating Memory Contention of OLAP Operators on GPU Processors. Proc. of ACM DaMoN, 2012. Google ScholarDigital Library
- Sarawagi, S., Agrawal, R., and Megiddo, N. Discovery-Driven Exploration of OLAP Data Cubes. Proc. of EDBT, 1998. Google ScholarDigital Library
- Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D. J., Rasin, A., and Silberschatz, A. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads. PVLDB 2(1), 2009. Google ScholarDigital Library
- Agrawal, D., Das, D., and El Abbadi, A. Big Data and Cloud Computing: Current State and Future Opportunities. Proc. of EDBT, 2011. Google ScholarDigital Library
- Cuzzocrea, A., and Bertino, E. Privacy Preserving OLAP over Distributed XML Data: A Theoretically-Sound Secure-Multiparty-Computation Approach. Journal of Computer and System Sciences 77(6), 2011. Google ScholarDigital Library
- Cattell, R. Scalable SQL and NoSQL Data Stores. SIGMOD Record 39(4), 2010. Google ScholarDigital Library
- Cuzzocrea, A., and Saccà, D. Balancing Accuracy and Privacy of OLAP Aggregations on Data Cubes. Proc. of DOLAP, 2010. Google ScholarDigital Library
- Bellatreche, L., Cuzzocrea, A., and Benkrid, S. Effectively and Efficiently Designing and Querying Parallel Relational Data Warehouses on Heterogeneous Database Clusters: The F&A Approach. Journal of Database Management 23(4), 2012.Google ScholarDigital Library
- Cohen, J., Dolan, B., Dunlap, M., Hellerstein, J. M., and Welton, C. MAD Skills: New Analysis Practices for Big Data. PVLDB 2(2), 2009. Google ScholarDigital Library
- Dean, J., and Ghemawat, S. MapReduce: Simplified Data Processing on Large Clusters. Communications of the ACM 51(1), 2008. Google ScholarDigital Library
- Khouri, S., Bellatreche, L., and Berkani, N. MODETL: A Complete MODeling and ETL Method for Designing Data Warehouses from Semantic Databases. Proc. of COMAD, 2012.Google ScholarDigital Library
- Khouri, S., and Bellatreche, L. DWOBS: Data Warehouse Design from Ontology-Based Sources. Proc. of DASFAA, 2011. Google ScholarDigital Library
- Herodotou, H., Lim, H., Luo, G., Borisov, N., Dong, L., Cetin, F. B., and Babu, S. Starfish: A Self-Tuning System for Big Data Analytics. Proc. of CIDIR, 2011.Google Scholar
- Jiang, D., Ooi, B.C., Shi, L., and Wu, S. The Performance of MapReduce: An In-depth Study. PVLDB 3(1), 2010. Google ScholarDigital Library
- Thusoo, A. Sarma, J.S., Jain, N., Shao, Z., Chakka, P. Zhang, N., Antony, S., Liu, H., and Murthy, R. Hive - A Petabyte Scale Data Warehouse Using Hadoop. Proc. of ICDE, 2010.Google ScholarCross Ref
- Bizer, C., Boncz, P. A., Brodie, M. L., and Erling, O. The Meaningful Use of Big Data: Four Perspectives - Four Challenges. SIGMOD Record 40(4), 2011. Google ScholarDigital Library
- Chen, Y., Alspaugh, S., and Katz, R. H. Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads. PVLDB 5(12), 2012. Google ScholarDigital Library
- Cuzzocrea, A., Saccà, D., and Serafino, P. Semantics-Aware Advanced OLAP Visualization of Multidimensional Data Cubes. International Journal of Data Warehousing and Mining 3(4), 2007.Google Scholar
- Cuzzocrea, A., Saccà, D., and Serafino, P. A Hierarchy-Driven Compression Technique for Advanced OLAP Visualization of Multidimensional Data Cubes. Proc. of DaWaK, 2006. Google ScholarDigital Library
- Cuzzocrea, A. Retrieving Accurate Estimates to OLAP Queries over Uncertain and Imprecise Multidimensional Data Streams. Proc. of SSDBM, 2011. Google ScholarDigital Library
- Cuzzocrea, A., and Chakravarthy, S. Event-based Lossy Compression for Effective and Efficient OLAP over Data Streams. Data and Knowledge Engineering 69(7), 2010. Google ScholarDigital Library
- Cuzzocrea, A. Providing Probabilistically-Bounded Approximate Answers to Non-Holistic Aggregate Range Queries in OLAP. Proc. of ACM DOLAP, 2005. Google ScholarDigital Library
Index Terms
- Data warehousing and OLAP over big data: current challenges and future research directions
Recommendations
Analytics over large-scale multidimensional data: the big data revolution!
DOLAP '11: Proceedings of the ACM 14th international workshop on Data Warehousing and OLAPIn this paper, we provide an overview of state-of-the-art research issues and achievements in the field of analytics over big data, and we extend the discussion to analytics over big multidimensional data as well, by highlighting open problems and ...
Warehousing and Protecting Big Data: State-Of-The-Art-Analysis, Methodologies, Future Challenges
ICC '16: Proceedings of the International Conference on Internet of things and Cloud ComputingThis paper proposes a comprehensive critical survey on the issues of warehousing and protecting big data, which are recognized as critical challenges of emerging big data research. Indeed, both are critical aspects to be considered in order to build ...
Data Warehousing in Big Data: From Multidimensional to Tabular Data Models
C3S2E '16: Proceedings of the Ninth International C* Conference on Computer Science & Software EngineeringData warehouses are central pieces in business intelligence and analytics as these repositories ensure proper data storage and querying, being supported by data models that allow the analysis of data by different perspectives. Those perspectives support ...
Comments