Abstract
Today many current and emerging applications require support for on-line analysis of rapidly changing data streams. Limitations of traditional DBMSs in supporting streaming applications have been recognized, prompting research to augment existing technologies and build new systems to manage streaming data. Stream-oriented systems are inherently geographically distributed and because distribution offers scalable load management and higher availability, future stream processing systems will operate in a distributed fashion. Moreover, service-based approaches have gained considerable attention recently for supporting distributed application development in e-business and e-science. In this paper, we present our innovative work to build a large scale distributed query processing over streaming data, this system has been designed as a WSRF-compliant application built on top of standard Web services technologies. Our distributed data stream Queries are written and evaluated over distributed resources discovered and accessed using emerging the WS-Resource Framework specifications. The data stream query processor has been designed and implemented as a collection of cooperating services, using the facilities of the WSRF to dynamically discover, access and use computational resources to support query compilation and evaluation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Liu, L., Pu, C., Tang, W.: Continual Queries for Internet-Scale Event-Driven Information Delivery. IEEE Trans. Knowledge and Data Eng. 11(4), 610–628 (1999)
Babcock., B., Babu., S., Datar., M., Motwani., R., Widom, J.: Models and Issues in Data Stream Systems. In: Proc. of the 2002 ACM Symp. on Principles of Database Systems (June 2002)
Carney., D., Cetintemel., U., Cherniack., M., Convey., C., Lee, S., Seidman., G., Stonebraker., M., Tatbul., N., Zdonik, S.: Monitoring Streams: A New Class of Data Management Applications. In: Proc. 28th Intl. Conf. on Very Large Data Bases (August 2002)
Stanford Stream Data Management (STREAM) Project, http://www-db.stanford.edu/stream
Hellerstein., J.M., Franklin., M.J., Chandrasekaran, S., Deshpande., A., Hildrum, K., Madden, S., Raman, V., Shah, M.A.: Adaptive Query Processing: Technology in Evolution. IEEE Data Engineering Bulletin 23(2), 7–18 (2000)
Johnson., T., Cranor., C., Spatsheck., O., Shkapenyuk, V.: Gigascope: A Stream Database for Network Applications. In: Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data (June 2003)
Cortes., C., Fisher., K., Pregibon., D., Rogers., A., Smith, F.: Hancock: A Language for Extracting Signatures from Data Streams. In: Proc. of the 2000 ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining, August 2000, pp. 9–17 (2000)
Niagara Project, http://www.cs.wisc.edu/niagara/
Parker, D.S., Muntz., R.R., Chau, H.L.: The Tangram Stream Query Processing System. In: Proc. of the 1989 Intl. Conf. on Data Engineering, February 1989, pp. 556–563 (1989)
Parker., D.S., Simon., E., Valduriez, P.: SVP: A Model Capturing Sets, Lists, Streams, and Parallelism. In: Proc. of the 1992 Intl. Conf. on Very Large Data Bases, August 1992, pp. 115–126 (1992)
Terry., D.B., Goldberg., D., Nichols., D., Oki, B.M.: Continuous Queries over Append-only Databases. In: SIGMOD Conference, pp. 321–330 (1992)
Sullivan, M.: Tribeca: A Stream Database Manager for Network Traffic Analysis. In: Proc. of the of the 1992 ACM SIGMOD Intl. Conf. on Management of Data, June 1992, pp. 321–330 (1992)
Foster, I., Frey, J., Graham, S., Tuecke, S., Czajkowski, K., Ferguson, D., Leymann, F., Nally, M., Storey, T., Weerawarana, S.: Modeling Stateful Resources with Web Services (2004), Available at http://www-106.ibm.com/developerworks/library/wsresource/ws-modelingresources.pdf
http://www-106.ibm.com/developerworks/webservices/library/ws-resource/
http://www.ibm.com/developerworks/webservices/library/ws-add/
Kossmann, D.: The State of The Art in Distributed Query Processing. ACM Computing Surveys 32(4), 422–469 (2000)
Graefe, G.: Encapsulation of Parallelism in The Volcano Query Processing System. In: ACM SIGMOD, pp. 102–111 (1990)
Graefe, G.: Query Evaluation Techniques for large databases. ACM Computing Surveys 25(4), 73–170 (1993)
Globus Project (2004), Argonne National Labs, http://www.globus.org
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Le, Jj., Liu, Jw. (2005). DDSQP: A WSRF-Based Distributed Data Stream Query System. In: Pan, Y., Chen, D., Guo, M., Cao, J., Dongarra, J. (eds) Parallel and Distributed Processing and Applications. ISPA 2005. Lecture Notes in Computer Science, vol 3758. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11576235_83
Download citation
DOI: https://doi.org/10.1007/11576235_83
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29769-7
Online ISBN: 978-3-540-32100-2
eBook Packages: Computer ScienceComputer Science (R0)