Abstract
Data streams are a prevalent and growing source of timely data. As streams become more prevalent, richer interrogation of the contents of the streams is required. Value of the content increases dramatically when streams are aggregated and distributed global behavior can be interrogated. In this paper, we demonstrate that access to multiple data streams should be viewed as one of deriving meaning from a distributed global snapshot. We define an architecture for a data streams resource based on the Data Access and Integration [2] model proposed in the Global Grid Forum. We demonstrate that access to streams by means of database queries can be intuitive. Finally, we discuss key research issues in realizing the data streams model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Altmel, M., Franklin, M.J.: Efficient filtering of XML documents for selective dissemination of information. In: Proceedings of 26th VLDB Conference (2000)
Antonioletti, M., Atkinson, M., Malaika, S., Laws, S., Paton, N., Pearson, D., Riccardi, G.: Grid data service specification. In: Global Grid Forum GWD-R (September 2003)
Antonioletti, M., Hong, N.C., Hume, A., Jackson, M., Krause, A., Nowell, J.: Experiences of designing and implementing grid database services in the ogsa-dai project. In: Global Grid Forum Workshop on Designing and Building Grid Services (September 2003)
Babu, S., Widom, J.: Continuous queries over data streams. In: International Conference on Management of Data, SIGMOD (2001)
Benford, S., et al.: e-Science from the antarctic to the GRID. In: Proceedings of UK e-Science All Hands Meeting (September 2003)
Beynon, M., Ferreira, R., Kurc, T., Sussman, A., Saltz, J.: Datacutter: Middleware for filtering very large scientific datasets on archival storage systems. In: Eighth Goddard Conference on Mass Storage Systems and Technologies/17th IEEE Symposium on Mass Storage Systems, College Park, Maryland (March 2000)
Cranoe, C., Johnson, T., Shkapenyuk, V., Spatscheck, O.: Gigascope: a stream database for network applications. In: International Conference on Management of Data, SIGMOD (2003)
Eisenhauer, G.: The ECho event delivery system. Technical Report GIT-CC-99- 08, College of Computing, Georgia Institute of Technology (1999), http://www.cc.gatech.edu/tech_reports
Fisher, S.: Relational model for information and monitoring. In: Global Grid Forum, GWD-Perf-7-1 (2001)
Fox, G., Pallickara, S.: An event service to support grid computational environments. Journal of Concurrency and Computation: Practice and Experience. Special Issue on Grid Computing Environments (2002)
Gawlick, D., Mishra, S.: Information sharing with the Oracle database (2003)
Golab, L., Ozsu, M.T.: Issues in data stream management. SIGMOD Record 32(2), 5–14 (2003)
Gupta, A.K., Suciu, D.: Stream processing of Xpath queries with predicates. In: International Conference on Management of Data, SIGMOD (2003)
Liu, L., Pu, C., Tang, W.: Continual queries for internet scale eventdriven information delivery. IEEE Transactions on Knowledge and Data Engineering, Special issue on Web Technologies (January 1999)
Madden, S., Franklin, M.J.: Fjording the stream: An architecture for queries over streaming sensor data. In: International Conference on Data Engineering ICDE (2002)
Nguyen, B., Abiteboul, S., Cobena, G., Preda, M.: Monitoring XML data on the web. In: International Conference on Management of Data, SIGMOD (2001)
Nippl, C., Rantzau, R., Mitschang, B.: Streamjon: A generic database approach to support the class of stream-oriented applications. In: International Database Engineering and Applications Symposium IDEAS (2000)
Plale, B., Schwan, K.: Dynamic querying of streaming data with the dQUOB system. IEEE Transactions in Parallel and Distributed Systems 14(4), 422–432 (2003)
Ribler, R., Vetter, J., Simitci, H., Reed, D.: Autopilot: Adaptive control of distributed applications. In: IEEE International High Performance Distributed Computing (HPDC) (August 1999)
(Plale) Schroeder, B., Aggarwal, S., Schwan, K.: Software approach to hazard detection using on-line analysis of safety constraints. In: Proceedings 16th Symposium on Reliable and Distributed Systems, SRDS 1997, October 1997, pp. 80–87. IEEE Computer Society, Los Alamitos (1997)
Sivaramakris, N., Kurc, T., Catalyurek, U., Saltz, J.: Database support for data-driven scientific applications in the grid. Parallel Processing Letters 13(2) (2003)
Yao, Y., Gehrke, J.: Query processing in sensor networks. In: First Biennial Conference on Innovative Data Systems Research, Asilomar, CA (January 2003)
Zhuang, S., Zhao, B., Joseph, A., Katz, R., Kubiatowicz, J.: Bayeux: An architecture for scalable and fault-tolerant wide area data dissemination. In: Proceedings Eleventh International Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV 2001) (June 2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Plale, B. (2004). Using Global Snapshots to Access Data Streams on the Grid. In: Dikaiakos, M.D. (eds) Grid Computing. AxGrids 2004. Lecture Notes in Computer Science, vol 3165. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28642-4_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-28642-4_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22888-2
Online ISBN: 978-3-540-28642-4
eBook Packages: Springer Book Archive