skip to main content
10.1145/3297280.3297444acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

A RESTful architecture for data exploration as a service

Authors Info & Claims
Published:08 April 2019Publication History

ABSTRACT

Data analysis process typically starts in an exploration phase, where the goal is to gain an understanding of the underlying data. In this phase, analysts make multiple queries and expect answers from the data services. Existing data services do not meet the needs for online data exploration in practice. Some services only support the data analysts to pull the whole dataset before analysis. Others allow analysts to make one-off queries and do not provide any guidance for exploring data. In this paper, we address these limitations of data services by proposing the Data Exploration as a Service (DEaaS) approach. Our RESTful service architecture and resource design provide explicit support for interactive data exploration. In addition, we use historical query information and predefined analytics semantics based on a multidimensional data model to recommend resources to analysts and guide them through the exploration process. We evaluate DEaaS using data exploration processes on both synthetic and real-life datasets. The experimental results show that our solution adapts to different data sources and the proposed resource navigation approach can make DEaaS outperform existing data services in data exploration.

References

  1. Iman Avazpour, John Grundy, and Liming Zhu. 2016. V for variety: Lessons learned from complex smart cities data harmonization and integration. In Pervasive Computing and Communication Workshops (PerCom Workshops), 2016 IEEE International Conference on. 1--6.Google ScholarGoogle ScholarCross RefCross Ref
  2. Ada Bagozi, Devis Bianchini, Valeria De Antonellis, Alessandro Marini, and Davide Ragazzi. 2017. Interactive data exploration as a service for the smart factory. In Web Services (ICWS), 2017 IEEE International Conference on. IEEE, 293--300.Google ScholarGoogle ScholarCross RefCross Ref
  3. Ada Bagozi, Devis Bianchini, Valeria De Antonellis, Alessandro Marini, and Davide Ragazzi. 2017. Summarisation and relevance evaluation techniques for big data exploration: the smart factory case study. In International Conference on Advanced Information Systems Engineering. Springer, 264--279.Google ScholarGoogle ScholarCross RefCross Ref
  4. Wolf-Tilo Balke and Matthias Wagner. 2003. Towards Personalized Selection of Web Services.. In WWW. 20--24.Google ScholarGoogle Scholar
  5. Leilani Battle, Michael Stonebraker, and Remco Chang. 2013. Dynamic reduction of query result sets for interactive visualizaton. In IEEE Workshop on Big Data Visualization. 1--8.Google ScholarGoogle ScholarCross RefCross Ref
  6. Marcello Buoncristiano, Giansalvatore Mecca, Elisa Quintarelli, Manuel Roveri, Donatello Santoro, and Letizia Tanca. 2015. Database challenges for exploratory computing. ACM SIGMOD Record 44, 2 (2015), 17--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Ugur Cetintemel, Mitch Cherniack, Justin DeBrabant, Yanlei Diao, Kyriaki Dimitriadou, Alexander Kalinin, Olga Papaemmanouil, and Stanley B Zdonik. 2013. Query Steering for Interactive Data Exploration.. In CIDR.Google ScholarGoogle Scholar
  8. Surajit Chaudhuri and Umeshwar Dayal. 1997. An overview of data warehousing and OLAP technology. ACM Sigmod record 26, 1 (1997), 65--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Alfredo Cuzzocrea. 2013. Analytics over big data: Exploring the convergence of datawarehousing, OLAP and data-intensive cloud infrastructures. In Computer Software and Applications Conference (COMPSAC), 2013 IEEE 37th Annual. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kyriaki Dimitriadou, Olga Papaemmanouil, and Yanlei Diao. 2014. Explore-by-example: An automatic query steering framework for interactive data exploration. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, 517--528. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Marina Drosou and Evaggelia Pitoura. 2013. YmalDB: exploring relational databases via result-driven recommendations. The VLDB Journal 22, 6 (2013), 849--874. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Roy T Fielding and Richard N Taylor. 2000. Architectural styles and the design of network-based software architectures. University of California, Irvine Doctoral dissertation. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Arnaud Giacometti, Patrick Marcel, Elsa Negre, and Arnaud Soulet. 2009. Query recommendations for OLAP discovery driven analysis. In Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP. ACM, 81--88. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Stratos Idreos, Olga Papaemmanouil, and Surajit Chaudhuri. 2015. Overview of data exploration techniques. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. ACM, 277--281. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Guosheng Kang, Jianxun Liu, Mingdong Tang, Xiaoqing Liu, Buqing Cao, and Yu Xu. 2012. AWSR: Active web service recommendation based on usage history. In Proceedings of international conference on Web services. IEEE, 186--193. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Liwei Liu, Freddy Lecue, and Nikolay Mehandjiev. 2013. Semantic content-based recommendation of software services using context. ACM Transactions on the Web (TWEB) 7, 3 (2013), 17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Fabrizio Luccio, Antonio Mesa Enriquez, P Olivares Rieumont, and Linda Pagli. 2001. Exact rooted subtree matching in sublinear time.Google ScholarGoogle Scholar
  18. Liam O'Brien, Paulo Merson, and Len Bass. 2007. Quality attributes for service-oriented architectures. In Proceedings of the international Workshop on Systems Development in SOA Environments. IEEE Computer Society, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Kai Petersen and Cigdem Gencel. 2013. Worldviews, research methods, and their relationship to validity in empirical software engineering research. In Software Measurement and the 2013 Eighth International Conference on Software Process and Product Measurement (IWSM-MENSURA), 2013 Joint Conference of the 23rd International Workshop on. IEEE, 81--89. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Suhrid Satyal, Ingo Weber, Hye-young Paik, Claudio Di Ciccio, and Jan Mendling. 2018. Business Process Improvement with the AB-BPM Methodology. Information Systems (2018).Google ScholarGoogle Scholar
  21. Rami Sellami, Sami Bhiri, and Bruno Defude. 2014. ODBAPI: a unified REST API for relational and NoSQL data stores. In 2014 IEEE International Congress on Big Data. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Hua Xiao, Ying Zou, Joanna Ng, and Leho Nigul. 2010. An approach for context-aware service discovery and recommendation. In Proceedings of international conference on Web services. 163--170. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yun Zhang, Liming Zhu, Xiwei Xu, Shiping Chen, and An Binh Tran. 2018. Data Service API Design for Data Analytics. In International Conference on Services Computing. Springer, 87--102.Google ScholarGoogle Scholar
  24. Hao Ma Michael R. Lyu Zheng, Zibin and Irwin King. 2011. Qos-aware web service recommendation by collaborative filtering. IEEE Transactions on services computing 4, 2 (2011), 140--152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Zibin Zheng, Jieming Zhu, and Michael R Lyu. {n. d.}. Service-generated big data and big data-as-a-service: an overview. In 2013 IEEE International Congress on Big Data. 403--410. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A RESTful architecture for data exploration as a service

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SAC '19: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing
            April 2019
            2682 pages
            ISBN:9781450359337
            DOI:10.1145/3297280

            Copyright © 2019 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 8 April 2019

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate1,650of6,669submissions,25%
          • Article Metrics

            • Downloads (Last 12 months)8
            • Downloads (Last 6 weeks)0

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader