Abstract
We witness a rapid increase in the number of social media streams due to development of Web2.0, IoT and Cloud Computing technology. These sources include both traditional relational databases and streaming data from messaging infrastructure. We would like to use multiple social media streams to answer complex queries to enable information sharing and intelligence gathering for better collaboration. For this purpose, we adopt data services as the basic abstraction for both traditional relational databases and data streams retrieval. A flexible continuous data service model with continuous query as service operation is proposed. Service operation instance is modeled as a view defined on data streams. In the view, data part and time synchronization part are separated from each other. Based on the continuous data service model, we proposed a continuous data service composition algorithm for answering queries across data streams and relational data. The main idea is to find the contained rewriting of user query on views satisfying both data part and time synchronization part containment relationship. We also present use case and experimental studies that indicate that the approach is effective and efficient.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Carey, M.J., Onose, N., Petropoulos, M.: Data services. Commun. ACM 55(6), 86–97 (2012)
VaculÃn, R., Chen, H., Neruda, R., Sycara, K.: Modeling and discovery of data providing services. In: 2008 IEEE International Conference on Web Services, pp. 54–61, September 2008
Barhamgi, M., Benslimane, D., Medjahed, B.: A query rewriting approach for web service composition. IEEE Trans. Serv. Comput. 3(3), 206–222 (2010)
Zhou, L., Chen, H., Yu, T., Ma, J., Wu, Z.: Ontology-based scientific data service composition: a query rewriting-based approach. In: AAAI Spring Symposium: Semantic Scientific Knowledge Integration, pp. 116–121 (2008)
Zhang, F., Wang, G., Han, Y.: Automatic generation of service composition plans for correlated queries. In: 2013 10th Web Information System and Application Conference, pp. 143–149, November 2013
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 1–16. ACM (2002)
Ghanem, T.M., Elmagarmid, A.K., Larson, P.Å., Aref, W.G.: Supporting views in data stream management systems. ACM Trans. Database Syst. 35(1), 1–47 (2008)
Zhao, W., Liu, C., Chen, J.: Automatic composition of information-providing web services based on query rewriting. Sci. China Inf. Sci. 55(11), 2428–2444 (2012)
Barhamgi, M., Benslimane, D., Ouksel, A.M.: Composing and optimizing data providing web services. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1141–1142. ACM (2008)
Doan, A., Halevy, A., Ives, Z.: Principles of Data Integration, 1st edn. Morgan Kaufmann Publishers Inc., San Francisco (2012)
Lemos, A.L., Daniel, F., Benatallah, B.: Web service composition: a survey of techniques and tools. ACM Comput. Surv. 48(3), 33:1–33:41 (2015)
Pottinger, R., Halevy, A.: MiniCon: a scalable algorithm for answering queries using views. Int. J. Very Large Data Bases 10(2–3), 182–198 (2001)
Zaharia, M., Das, T., Li, H., Hunter, T., Shenker, S., Stoica, I.: Discretized streams: fault-tolerant streaming computation at scale. In: Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP 2013, pp. 423–438. ACM, New York (2013)
Wang, G., et al.: Building a replicated logging system with Apache Kafka. Proc. VLDB Endow. 8(12), 1654–1655 (2015)
Fielding, R.T.: Architectural styles and the design of network-based software architectures. Ph.D. thesis, University of California (2000)
Hickson, I.: Server-sent events. https://www.w3.org/TR/eventsource/. Accessed 25 Oct 2015
Newman, M.E.: Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)
Han, Y., Liu, C., Su, S., Zhu, M., Zhang, Z., Zhang, S.: A proactive service model facilitating stream data fusion and correlation. Int. J. Web Serv. Res. (IJWSR) 14(3), 1–16 (2017)
Billet, B., Issarny, V., Texier, G.: Composing continuous services in a CoAP-based IoT. In: 2017 IEEE International Conference on AI Mobile Services (AIMS), pp. 46–53, June 2017
Han, Y., Wang, G., Yu, J., Liu, C., Zhang, Z., Zhu, M.: A service-based approach to traffic sensor data integration and analysis to support community-wide green commute in China. IEEE Trans. Intell. Transp. Syst. 17(9), 2648–2657 (2016)
Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., Tzoumas, K.: Apache Flink: stream and batch processing in a single engine. Bull. IEEE Comput. Soc. Tech. Comm. Data Eng. 36(4), 28–38 (2015)
Confluent: KSQL: Streaming SQL for Apache Kafka. https://www.confluent.io/product/ksql/. Accessed 25 July 2018
Genesereth, M.R., Keller, A.M., Duschka, O.M.: Infomaster: an information integration system. SIGMOD Rec. 26(2), 539–542 (1997)
Levy, A.Y., Rajaraman, A., Ordille, J.J.: The world wide web as a collection of views: query processing in the information manifold. In: VIEWS, pp. 43–55 (1996)
Levy, A.Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: Proceedings of the 22th International Conference on Very Large Data Bases, VLDB 1996, pp. 251–262. Morgan Kaufmann Publishers Inc., San Francisco (1996)
Benedikt, M., Lopez-Serrano, R., Tsamoura, E.: Biological web services: integration, optimization, and reasoning. CEUR Workshop Proceedings, pp. 21–27 (2016)
Thakkar, S., Ambite, J.L., Knoblock, C.A.: Composing, optimizing, and executing plans for bioinformatics web services. VLDB J. 14(3), 330–353 (2005)
Wang, G., Yang, S., Han, Y.: Mashroom: end-user mashup programming using nested tables. In: Proceedings of the 18th International Conference on World Wide Web, pp. 861–870. ACM (2009)
Han, Y., Wang, G., Ji, G., Zhang, P.: Situational data integration with data services and nested table. Serv. Oriented Comput. Appl. 7(2), 129–150 (2013)
Acknowledgments
This work is supported by Beijing Natural Science Foundation No. 4172018 (Building Stream Data Services for Spatio-Temporal Pattern Discovery in Cloud Computing Environment) and National Natural Science Foundation of China No. 61672042 (Models and Methodology of Data Services Facilitating Dynamic Correlation of Big Stream Data), and University Cooperation Projects Foundation of CETC Ocean Corp.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer-Verlag GmbH Germany, part of Springer Nature
About this chapter
Cite this chapter
Wang, G., Zuo, X., Hesenius, M., Xu, Y., Han, Y., Gruhn, V. (2019). A Data Services Composition Approach for Continuous Query on Social Media Streams. In: Hameurlain, A., Wagner, R., Morvan, F., Tamine, L. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems XL. Lecture Notes in Computer Science(), vol 11360. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-58664-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-662-58664-8_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-58663-1
Online ISBN: 978-3-662-58664-8
eBook Packages: Computer ScienceComputer Science (R0)