Skip to main content

A Data Services Composition Approach for Continuous Query on Social Media Streams

  • Chapter
  • First Online:
Transactions on Large-Scale Data- and Knowledge-Centered Systems XL

Part of the book series: Lecture Notes in Computer Science ((TLDKS,volume 11360))

Abstract

We witness a rapid increase in the number of social media streams due to development of Web2.0, IoT and Cloud Computing technology. These sources include both traditional relational databases and streaming data from messaging infrastructure. We would like to use multiple social media streams to answer complex queries to enable information sharing and intelligence gathering for better collaboration. For this purpose, we adopt data services as the basic abstraction for both traditional relational databases and data streams retrieval. A flexible continuous data service model with continuous query as service operation is proposed. Service operation instance is modeled as a view defined on data streams. In the view, data part and time synchronization part are separated from each other. Based on the continuous data service model, we proposed a continuous data service composition algorithm for answering queries across data streams and relational data. The main idea is to find the contained rewriting of user query on views satisfying both data part and time synchronization part containment relationship. We also present use case and experimental studies that indicate that the approach is effective and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/declouddataservice/servicecomposition.

References

  1. Carey, M.J., Onose, N., Petropoulos, M.: Data services. Commun. ACM 55(6), 86–97 (2012)

    Article  Google Scholar 

  2. Vaculín, R., Chen, H., Neruda, R., Sycara, K.: Modeling and discovery of data providing services. In: 2008 IEEE International Conference on Web Services, pp. 54–61, September 2008

    Google Scholar 

  3. Barhamgi, M., Benslimane, D., Medjahed, B.: A query rewriting approach for web service composition. IEEE Trans. Serv. Comput. 3(3), 206–222 (2010)

    Article  Google Scholar 

  4. Zhou, L., Chen, H., Yu, T., Ma, J., Wu, Z.: Ontology-based scientific data service composition: a query rewriting-based approach. In: AAAI Spring Symposium: Semantic Scientific Knowledge Integration, pp. 116–121 (2008)

    Google Scholar 

  5. Zhang, F., Wang, G., Han, Y.: Automatic generation of service composition plans for correlated queries. In: 2013 10th Web Information System and Application Conference, pp. 143–149, November 2013

    Google Scholar 

  6. Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 1–16. ACM (2002)

    Google Scholar 

  7. Ghanem, T.M., Elmagarmid, A.K., Larson, P.Å., Aref, W.G.: Supporting views in data stream management systems. ACM Trans. Database Syst. 35(1), 1–47 (2008)

    Article  Google Scholar 

  8. Zhao, W., Liu, C., Chen, J.: Automatic composition of information-providing web services based on query rewriting. Sci. China Inf. Sci. 55(11), 2428–2444 (2012)

    Article  Google Scholar 

  9. Barhamgi, M., Benslimane, D., Ouksel, A.M.: Composing and optimizing data providing web services. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1141–1142. ACM (2008)

    Google Scholar 

  10. Doan, A., Halevy, A., Ives, Z.: Principles of Data Integration, 1st edn. Morgan Kaufmann Publishers Inc., San Francisco (2012)

    Google Scholar 

  11. Lemos, A.L., Daniel, F., Benatallah, B.: Web service composition: a survey of techniques and tools. ACM Comput. Surv. 48(3), 33:1–33:41 (2015)

    Article  Google Scholar 

  12. Pottinger, R., Halevy, A.: MiniCon: a scalable algorithm for answering queries using views. Int. J. Very Large Data Bases 10(2–3), 182–198 (2001)

    MATH  Google Scholar 

  13. Zaharia, M., Das, T., Li, H., Hunter, T., Shenker, S., Stoica, I.: Discretized streams: fault-tolerant streaming computation at scale. In: Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP 2013, pp. 423–438. ACM, New York (2013)

    Google Scholar 

  14. Wang, G., et al.: Building a replicated logging system with Apache Kafka. Proc. VLDB Endow. 8(12), 1654–1655 (2015)

    Article  Google Scholar 

  15. Fielding, R.T.: Architectural styles and the design of network-based software architectures. Ph.D. thesis, University of California (2000)

    Google Scholar 

  16. Hickson, I.: Server-sent events. https://www.w3.org/TR/eventsource/. Accessed 25 Oct 2015

  17. Newman, M.E.: Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)

    Article  Google Scholar 

  18. Han, Y., Liu, C., Su, S., Zhu, M., Zhang, Z., Zhang, S.: A proactive service model facilitating stream data fusion and correlation. Int. J. Web Serv. Res. (IJWSR) 14(3), 1–16 (2017)

    Article  Google Scholar 

  19. Billet, B., Issarny, V., Texier, G.: Composing continuous services in a CoAP-based IoT. In: 2017 IEEE International Conference on AI Mobile Services (AIMS), pp. 46–53, June 2017

    Google Scholar 

  20. Han, Y., Wang, G., Yu, J., Liu, C., Zhang, Z., Zhu, M.: A service-based approach to traffic sensor data integration and analysis to support community-wide green commute in China. IEEE Trans. Intell. Transp. Syst. 17(9), 2648–2657 (2016)

    Article  Google Scholar 

  21. Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., Tzoumas, K.: Apache Flink: stream and batch processing in a single engine. Bull. IEEE Comput. Soc. Tech. Comm. Data Eng. 36(4), 28–38 (2015)

    Google Scholar 

  22. Confluent: KSQL: Streaming SQL for Apache Kafka. https://www.confluent.io/product/ksql/. Accessed 25 July 2018

  23. Genesereth, M.R., Keller, A.M., Duschka, O.M.: Infomaster: an information integration system. SIGMOD Rec. 26(2), 539–542 (1997)

    Article  Google Scholar 

  24. Levy, A.Y., Rajaraman, A., Ordille, J.J.: The world wide web as a collection of views: query processing in the information manifold. In: VIEWS, pp. 43–55 (1996)

    Google Scholar 

  25. Levy, A.Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: Proceedings of the 22th International Conference on Very Large Data Bases, VLDB 1996, pp. 251–262. Morgan Kaufmann Publishers Inc., San Francisco (1996)

    Google Scholar 

  26. Benedikt, M., Lopez-Serrano, R., Tsamoura, E.: Biological web services: integration, optimization, and reasoning. CEUR Workshop Proceedings, pp. 21–27 (2016)

    Google Scholar 

  27. Thakkar, S., Ambite, J.L., Knoblock, C.A.: Composing, optimizing, and executing plans for bioinformatics web services. VLDB J. 14(3), 330–353 (2005)

    Article  Google Scholar 

  28. Wang, G., Yang, S., Han, Y.: Mashroom: end-user mashup programming using nested tables. In: Proceedings of the 18th International Conference on World Wide Web, pp. 861–870. ACM (2009)

    Google Scholar 

  29. Han, Y., Wang, G., Ji, G., Zhang, P.: Situational data integration with data services and nested table. Serv. Oriented Comput. Appl. 7(2), 129–150 (2013)

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by Beijing Natural Science Foundation No. 4172018 (Building Stream Data Services for Spatio-Temporal Pattern Discovery in Cloud Computing Environment) and National Natural Science Foundation of China No. 61672042 (Models and Methodology of Data Services Facilitating Dynamic Correlation of Big Stream Data), and University Cooperation Projects Foundation of CETC Ocean Corp.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guiling Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer-Verlag GmbH Germany, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Wang, G., Zuo, X., Hesenius, M., Xu, Y., Han, Y., Gruhn, V. (2019). A Data Services Composition Approach for Continuous Query on Social Media Streams. In: Hameurlain, A., Wagner, R., Morvan, F., Tamine, L. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems XL. Lecture Notes in Computer Science(), vol 11360. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-58664-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-58664-8_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-58663-1

  • Online ISBN: 978-3-662-58664-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics