Abstract
A key issue in supporting the synthesis of data intensive computation and data management is to liberate users from low-level parallel programming, by specifying applications functionally independent of the underlying server infrastructure, and further, by providing high-level primitives to express the control flow of applying functions to data partitions. Currently only few such primitives, e.g. Map-Reduce and Cross-Apply, are available, and their expressive power is limited to “flat parallel computing”. To deal with “structured parallel computing” where a function is applied to multiple objects with execution order dependencies, a general framework for creating and combining such primitives is required.
We propose the SQL-FCF framework as the database centric solution to the above problem. We embed into SQL queries the Function Controlling Forms (FCFs) to specify the flow control of applying Table Valued Functions (TVFs) to multiple data partitions. We further support the extensibility of this framework by allowing new FCFs to be defined from existing ones with SQL phrases. Based on this approach, we provided a SQL based high-level interface for “structured parallel computing” in architecting a hydrologic scientific computation platform. Envisioning that the simple parallel computing primitives will evolve and form a general framework, our effort is a step towards that goal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Asanovic, K., Bodik, R., Catanzo, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., Yelick, K.A.: The landscape of parallel computing research: A view from Berkeley, Technical Report UCB/EECS-2006-183, U. C., Berkeley, December 18 (2006)
Backus, J.: Can Programming be Liberated from the von Neumann Style? A functional style and its algebra of programs. CACM 21(8) (1978)
Barclay, T., Gray, J., Chong, W.: TerraServer Bricks – A High Availability Cluster Alternative, Technical Report, MSR-TR-2004-107 (October 2004)
Barroso, L.A., Dean, J., H"olze, U.: Web search for a planet:”The Google cluster architecture. IEEE Micro 23(2), 22–28 (2003)
Ben-gan, I., et al.: Inside Microsoft SQL Server 2005: T-SQL Programming (2006)
Bryant, R.E.: Data-Intensive Supercomputing: The case for DISC, CMU-CS-07-128 (2007)
Chen, Q., Hsu, M., Dayal, U.: A Data-Warehouse/OLAP Framework for Scalable Telecommunication Tandem Traffic Analysis. In: ICDE 2000, pp. 201–210 (2000)
Chen, Q., Hsu, M.: Inter-Enterprise Collaborative Business Process Management. In: Proc. of 17th Int’l Conf. on Data Engineering (ICDE-2001), Germany (2001)
Chen, Q., Dayal, U., Hsu, M.: A Distributed OLAP Infrastructure for E-Commerce. In: Proc. Fourth IFCIS CoopIS Conference, UK (1999)
Chen, Q., Kambayashi, Y.: Nested Relation Based Database Knowledge Representation. In: ACM-SIGMOD Conference, pp. 328–337 (1991)
Chen, Q., Gardarin, G.: An Implementation Model for Reasoning with Complex Objects. In: ACM-SIGMOD Conference, pp. 164–172 (1988)
Dean, J.: Experiences with MapReduce, an abstraction for large-scale computation. In: International Conference on Parallel Architecture and Compilation Techniques. ACM, New York (2006)
DeWitt, D., Gray, J.: Parallel Database Systems: the Future of High Performance Database Systems. CACM 35(6) (June 1992)
Gray, J., Liu, D.T., Nieto-Santisteban, M.A., Szalay, A.S., Heber, G., DeWitt, D.: Scientific Data Management in the Coming Decade. SIGMOD Record 34(4) (2005)
Hsu, M., Xiong, Y.: Building a Scalable Web Query System. In: Bhalla, S. (ed.) DNIS 2007. LNCS, vol. 4777, Springer, Heidelberg (2007)
HP Neoview enterprise datawarehousing platform, http://h71028.www7.hp.com/ERC/downloads/4AA0-7932ENW.pdf
Netz, A., Chaudhuri, S., Bernhardt, J., Fayyad, U.: Integration of data mining and relational databases. In: Proceeding of the 26th Conference on Very Large Databases, pp. 719–722 (2000)
Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: A Not-So-Foreign Language for Data Processing. In: VLDB (2008)
Saarenvirta, G.: Operational Data Mining. DB2 Magazine 6 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Q., Hsu, M. (2008). SQL TVF Controlling Forms - Express Structured Parallel Data Intensive Computing. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2008. Lecture Notes in Computer Science, vol 5181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85654-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-85654-2_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85653-5
Online ISBN: 978-3-540-85654-2
eBook Packages: Computer ScienceComputer Science (R0)