ABSTRACT
Data analysis process typically starts in an exploration phase, where the goal is to gain an understanding of the underlying data. In this phase, analysts make multiple queries and expect answers from the data services. Existing data services do not meet the needs for online data exploration in practice. Some services only support the data analysts to pull the whole dataset before analysis. Others allow analysts to make one-off queries and do not provide any guidance for exploring data. In this paper, we address these limitations of data services by proposing the Data Exploration as a Service (DEaaS) approach. Our RESTful service architecture and resource design provide explicit support for interactive data exploration. In addition, we use historical query information and predefined analytics semantics based on a multidimensional data model to recommend resources to analysts and guide them through the exploration process. We evaluate DEaaS using data exploration processes on both synthetic and real-life datasets. The experimental results show that our solution adapts to different data sources and the proposed resource navigation approach can make DEaaS outperform existing data services in data exploration.
- Iman Avazpour, John Grundy, and Liming Zhu. 2016. V for variety: Lessons learned from complex smart cities data harmonization and integration. In Pervasive Computing and Communication Workshops (PerCom Workshops), 2016 IEEE International Conference on. 1--6.Google ScholarCross Ref
- Ada Bagozi, Devis Bianchini, Valeria De Antonellis, Alessandro Marini, and Davide Ragazzi. 2017. Interactive data exploration as a service for the smart factory. In Web Services (ICWS), 2017 IEEE International Conference on. IEEE, 293--300.Google ScholarCross Ref
- Ada Bagozi, Devis Bianchini, Valeria De Antonellis, Alessandro Marini, and Davide Ragazzi. 2017. Summarisation and relevance evaluation techniques for big data exploration: the smart factory case study. In International Conference on Advanced Information Systems Engineering. Springer, 264--279.Google ScholarCross Ref
- Wolf-Tilo Balke and Matthias Wagner. 2003. Towards Personalized Selection of Web Services.. In WWW. 20--24.Google Scholar
- Leilani Battle, Michael Stonebraker, and Remco Chang. 2013. Dynamic reduction of query result sets for interactive visualizaton. In IEEE Workshop on Big Data Visualization. 1--8.Google ScholarCross Ref
- Marcello Buoncristiano, Giansalvatore Mecca, Elisa Quintarelli, Manuel Roveri, Donatello Santoro, and Letizia Tanca. 2015. Database challenges for exploratory computing. ACM SIGMOD Record 44, 2 (2015), 17--22. Google ScholarDigital Library
- Ugur Cetintemel, Mitch Cherniack, Justin DeBrabant, Yanlei Diao, Kyriaki Dimitriadou, Alexander Kalinin, Olga Papaemmanouil, and Stanley B Zdonik. 2013. Query Steering for Interactive Data Exploration.. In CIDR.Google Scholar
- Surajit Chaudhuri and Umeshwar Dayal. 1997. An overview of data warehousing and OLAP technology. ACM Sigmod record 26, 1 (1997), 65--74. Google ScholarDigital Library
- Alfredo Cuzzocrea. 2013. Analytics over big data: Exploring the convergence of datawarehousing, OLAP and data-intensive cloud infrastructures. In Computer Software and Applications Conference (COMPSAC), 2013 IEEE 37th Annual. Google ScholarDigital Library
- Kyriaki Dimitriadou, Olga Papaemmanouil, and Yanlei Diao. 2014. Explore-by-example: An automatic query steering framework for interactive data exploration. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, 517--528. Google ScholarDigital Library
- Marina Drosou and Evaggelia Pitoura. 2013. YmalDB: exploring relational databases via result-driven recommendations. The VLDB Journal 22, 6 (2013), 849--874. Google ScholarDigital Library
- Roy T Fielding and Richard N Taylor. 2000. Architectural styles and the design of network-based software architectures. University of California, Irvine Doctoral dissertation. Google ScholarDigital Library
- Arnaud Giacometti, Patrick Marcel, Elsa Negre, and Arnaud Soulet. 2009. Query recommendations for OLAP discovery driven analysis. In Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP. ACM, 81--88. Google ScholarDigital Library
- Stratos Idreos, Olga Papaemmanouil, and Surajit Chaudhuri. 2015. Overview of data exploration techniques. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. ACM, 277--281. Google ScholarDigital Library
- Guosheng Kang, Jianxun Liu, Mingdong Tang, Xiaoqing Liu, Buqing Cao, and Yu Xu. 2012. AWSR: Active web service recommendation based on usage history. In Proceedings of international conference on Web services. IEEE, 186--193. Google ScholarDigital Library
- Liwei Liu, Freddy Lecue, and Nikolay Mehandjiev. 2013. Semantic content-based recommendation of software services using context. ACM Transactions on the Web (TWEB) 7, 3 (2013), 17. Google ScholarDigital Library
- Fabrizio Luccio, Antonio Mesa Enriquez, P Olivares Rieumont, and Linda Pagli. 2001. Exact rooted subtree matching in sublinear time.Google Scholar
- Liam O'Brien, Paulo Merson, and Len Bass. 2007. Quality attributes for service-oriented architectures. In Proceedings of the international Workshop on Systems Development in SOA Environments. IEEE Computer Society, 3. Google ScholarDigital Library
- Kai Petersen and Cigdem Gencel. 2013. Worldviews, research methods, and their relationship to validity in empirical software engineering research. In Software Measurement and the 2013 Eighth International Conference on Software Process and Product Measurement (IWSM-MENSURA), 2013 Joint Conference of the 23rd International Workshop on. IEEE, 81--89. Google ScholarDigital Library
- Suhrid Satyal, Ingo Weber, Hye-young Paik, Claudio Di Ciccio, and Jan Mendling. 2018. Business Process Improvement with the AB-BPM Methodology. Information Systems (2018).Google Scholar
- Rami Sellami, Sami Bhiri, and Bruno Defude. 2014. ODBAPI: a unified REST API for relational and NoSQL data stores. In 2014 IEEE International Congress on Big Data. Google ScholarDigital Library
- Hua Xiao, Ying Zou, Joanna Ng, and Leho Nigul. 2010. An approach for context-aware service discovery and recommendation. In Proceedings of international conference on Web services. 163--170. Google ScholarDigital Library
- Yun Zhang, Liming Zhu, Xiwei Xu, Shiping Chen, and An Binh Tran. 2018. Data Service API Design for Data Analytics. In International Conference on Services Computing. Springer, 87--102.Google Scholar
- Hao Ma Michael R. Lyu Zheng, Zibin and Irwin King. 2011. Qos-aware web service recommendation by collaborative filtering. IEEE Transactions on services computing 4, 2 (2011), 140--152. Google ScholarDigital Library
- Zibin Zheng, Jieming Zhu, and Michael R Lyu. {n. d.}. Service-generated big data and big data-as-a-service: an overview. In 2013 IEEE International Congress on Big Data. 403--410. Google ScholarDigital Library
Index Terms
- A RESTful architecture for data exploration as a service
Recommendations
Overview of Data Exploration Techniques
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of DataData exploration is about efficiently extracting knowledge from data even if we do not know exactly what we are looking for. In this tutorial, we survey recent developments in the emerging area of database systems tailored for data exploration. We ...
Data exploration: a roll call of all user-data interaction functionality
ExploreDB '16: Proceedings of the Third International Workshop on Exploratory Search in Databases and the WebData exploration encompasses a variety of interaction types and data functionality, such as search, data analysis, curation, constraint satisfaction, data mining, and visualization. Data exploration naturally begins when a user is given a set of data ...
Interactive data exploration using semantic windows
SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of DataWe present a new interactive data exploration approach, called Semantic Windows (SW), in which users query for multidimensional "windows" of interest via standard DBMS-style queries enhanced with exploration constructs. Users can specify SWs using (i) ...
Comments