Abstract
Motivated by automating enterprise information derivation processes, we propose a new kind of business process - Data-Continuous SQL Process (DCSP), which is data-stream driven and continuously running.
The basic operators of a DCSP are database User Defined Functions (UDFs). However, we introduce a special kind of UDFs - Relation Valued Functions (RVFs) with both input and return values specified as relations. An RVF represents a relational transformation and can be composed with other relational operators. We allow an RVF to be triggered repeatedly by stream inputs, timers or event-conditions.Thesequence of executions generates a data stream. To capture such data continuation semantics we introduce the notion of station for hosting a continuously-executed RVF, and the notion ofpipe as the FIFO stream container for asynchronous communication between stations. A station is specified with the triggering factors and the outgoing pipes. A pipe is strongly typed by a relation schema with a stream key for identifying its elements. As an abstract object, a pipe can be implemented as aqueue or stream table.
To allow a DCSP to be constructed from stations and pipes recursively, we introduce the notion of Data Continuous Query (DCQ) that is a query applied to a stream data source – a stream table, a station (via pipe) or recursively a DCQ, with well defined data continuation semantics. A DCQ itself can be treated as a station, meaning that stations can be constructed from existing ones recursively in terms of SQL. Based on these notions a DCSP is modeled as a graph of stations (nodes) and pipes (links) and represented by a set of correlated DCQs. Specifying DCSP in SQL allows us to take advantage of SQL in expressing relational transformations on the stream elements, and potentially in pushing DCSP execution down to the database layer for performance and scalability. The implementation issues based on parallel database technology are discussed.
The proposed approach represents a major shift in process management from one-time execution to data stream driven, open-ended execution, and an initial step in bringing BPM technology and database technology together under the data-continuation semantics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abadi, D., Carney, D., Stonebraker, M., Zdonik, S., et al.: Aurora: a new model and architecture for data stream management. VLDB Journal (2003)
Arasu, A., Babu, S., Widom, J.: The CQL Continuous Query Language: Semantic Foundations and Query Execution. VLDB J. (2004)
Arasu, B., Babcock, S., Babu, M., Datar, K., Ito, I., Nishizawa, J., Rosenstein, J.: STREAM: The Stanford Stream Data Manager. In: Proceedings of SIGMOD (2003)
Avnur, R., Hellerstein, J.M.: Eddies: Continuously adaptive query processing. In: ACM SIGMOD, Dallas, TX (May 2000)
Chen, J., DeWitt, D., Naughton, J.: Design and evaluation of alternative selection placement strategies in optimizing continuous queries. In: ICDE, CA (2002)
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Principles of Database Systems (2002)
Chen, Q., Hsu, M.: User Defined Partitioning - Group Data based on Computation Model. In: Proc. 10th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2008) (2008)
Chen, Q., Hsu, M.: Correlated Query Process and P2P Execution. In: Hameurlain, A. (ed.) Globe 2008. LNCS, vol. 5187, pp. 82–92. Springer, Heidelberg (2008)
Chen, Q., Hsu, M.: CPM Revisited – An Architecture Comparison. In: Proc. of 10th Int’l Conf on Cooperative Information Systems (Coopis 2002), USA (2002)
Chen, Q., Hsu, M.: Inter-Enterprise Collaborative Business Process Management. In: Proc. of 17th Int’l Conf on Data Engineering (ICDE 2001), Germany (2001)
Chen, Q., Dayal, U.: Multi-Agent Cooperative Transactions for E-Commerce. In: Proc. Fifth IFCIS Conference on Cooperative Information Systems (CoopIS 2000) (2000)
Qiming Chen, P., Chundi, U., Dayal, M., Hsu, M.: Dynamic-Agents for Dynamic Service Provision. In: Proc. of 3rd Int Conf. on Cooperative Information Systems (CoopIS 1998) (1998)
Chen, Q., Kambayashi, Y.: Nested Relation Based Database Knowledge Representation. In: Proc. of ACM SIGMOD 1991 (ACM SIGMOD Rec. 20(2)) (1991)
Dayal, U., Hsu, M., Ladin, R.: A Transaction Model for Long-Running Activities. In: VLDB 1991 (1991)
Gray, J., Liu, D.T., Nieto-Santisteban, M.A., Szalay, A.S., Heber, G., DeWitt, D.: Scientific Data Management in the Coming Decade. SIGMOD Record 34(4) (2005)
Hanlon, M., Klein, J., Van der Linden, R., Zeller, H.: Publish/subscribe in NonStop SQL: transactional streams in a relational context. In: ICDE 2004 (2004)
Hsu, M., Xiong, Y.: Building a Scalable Web Query System. DNIS (2007)
HP Neoview enterprise datawarehousing platform, http://h71028.www7.hp.com/ERC/downloads/4AA0-7932ENW.pdf
IBM DB2 Universal Database V8.1 (2004), http://www.ibm.com/software/data/db2
IBM, MQSeries An Introduction to Messaging and Queuing, ftp://ftp.software.ibm.com/software/mqseries/pdf/horaa101.pdf
Isard, M., Budiu, M., Yu, Y., Birrell, A., Fetterly, D.: Dryad: Distributed data-parallel programs from sequential building blocks. In: EuroSys 2007 (March 2007)
Law, Y.-N., Wang, H., Zaniolo, C.: Query languages and data models for database sequences and data streams. In: VLDB 2004 (2004)
Madden, S., Shah, M., Raman, J.M.H.V.: Continuously Adaptive Continuous Queries over Stream. ACM-SIGMOD (1992)
Carino, F., O’Connell, W.: “Plan-per-tuple Optimization Solution – Parallel Execution of Expensive User Defined Functions in Object-Relational DBMS. In: VLDB (1998)
O’Connell, W., et al.: A Teradata Content-Based Multimedia Object Manager for Massively Parallel Architectures. In: ACM-SIGMOD Conf., Canada (1996)
Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: A Not-So-Foreign Language for Data Processing. ACM SIGMOD (2008)
Oracle, Oracle Stream, Oracle 9i documentation
Oracle, Oracle Pipelined Table Functions, Oracle 9i documentation
Witkowski, A., et al.: Continuous Queries in Oracle. In: VLDB 2007 (2007)
Workflow Management Coalition, www.aiim.org/wfmc/mainframe.htm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Q., Hsu, M. (2008). Data-Continuous SQL Process Model. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems: OTM 2008. OTM 2008. Lecture Notes in Computer Science, vol 5331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88871-0_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-88871-0_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88870-3
Online ISBN: 978-3-540-88871-0
eBook Packages: Computer ScienceComputer Science (R0)