Abstract
Data and Knowledge Grids represent emerging and attracting application scenarios for Grid Computing, and pose novel and previously-unrecognized challenges to the research community. Basically, Data and Knowledge Grids found on high-performance Grid infrastructures and add to the latter meaningful data- and knowledge-oriented abstractions and metaphors that perfectly marry with innovative requirements of modern complex Intelligent Information Systems. To this end, service-oriented architectures and paradigms are the most popular ones for Grids, and on the whole represent an active and widely-recognized area of Grid Computing research. In this paper, we introduce the so-called Grid-based RTSOA frameworks, which essentially combine Grid Computing with real-time service management and execution paradigms, and put the basis for novel research perspectives in data-intensive e-science Grid applications with real-time bound constraints. This novel framework is then specialized to the particular context of Data Transformation services over Grids, which play a relevant role for both Data and Knowledge Grids. Finally, we complete the main contribution of the paper with a rigorous theoretical model for efficiently supporting Grid-based RTSOA frameworks, with particular emphasis to the context of Data Transformation services over Grids, along with its preliminary experimental assessment.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Allcock, W.E., Bester, J., Bresnahan, J., Chervenak, A.L., Foster, I.T., Kesselman, C., Meder, S., Nefedova, V., Quesnel, D., Tuecke, S.: Data Management and Transfer in High-Performance Computational Grid Environments. Parallel Computing 28(5), 749–771 (2002)
Antonioletti, M., Atkinson, M., Baxter, R., Borley, A., Chue Hong, N., Collins, B., Hardman, N., Hume, A., Knox, A., Jackson, M., Krause, A., Laws, S., Magowan, J., Paton, N.W., Pearson, D., Sugden, T., Watson, P., Westhead, M.: The Design and Implementation of Grid Database Services in OGSA-DAI. Concurrency and Computation: Practice and Experience 17(2-4), 357–376 (2005)
Baker, M., Buyya, R., Laforenza, D.: Grids and Grid Technologies for Wide-Area Distributed Computing. International Journal of Software: Practice and Experience 32(15), 1437–1466 (2002)
Bodhare, S.: Optimizing Service Infrastructures, http://blogs.ittoolbox.com/eai/optimization/archives/optimizing-serviceinfrastructures-3928
Brezany, P., Janciak, I., Tjoa, A.M.: GridMiner: A Fundamental Infrastructure for Building Intelligent Grid Systems. In: IEEE/ACM WI 2005, pp. 150–156 (2005)
Cannataro, M., Talia, D.: The Knowledge Grid: An Architecture for Distributed Knowledge Discovery. Communications of the ACM 46(1), 89–93 (2003)
Cannataro, M., Talia, D., Trunfio, P.: Distributed Data Mining on the Grid. Future Generation Computer Systems 18(8), 1101–1112 (2002)
de Carvalho Costa, R.L., Furtado, P.: An SLA-Enabled Grid Data Warehouse. In: IEEE IDEAS 2007, pp. 285–289 (2007)
Congiusta, A., Pugliese, A., Talia, D., Trunfio, P.: Designing Grid Services for Distributed Knowledge Discovery. Web Intelligence and Agent Systems 1(2), 91–104 (2003)
Cuzzocrea, A.: Towards Real-Time Data Transformation Services over Grids. In: IEEE COMPSAC-RTSOAA 2008, pp. 1143–1149 (2008)
Cuzzocrea, A., Furfaro, F., Greco, S., Mazzeo, G.M., Masciari, E., Saccà, D.: A Distributed System for Answering Range Queries on Sensor Network Data. In: IEEE PerSeNS 2005, pp. 369–373 (2005)
Cuzzocrea, A., Furfaro, F., Masciari, E., Saccà, D., Sirangelo, C.: Approximate Query Answering on Sensor Network Data Streams. In: Stefanidis, A., Nittel, S. (eds.) GeoSensor Networks, pp. 53–72. CRC Press, Boca Raton (2004)
Cuzzocrea, A., Furfaro, F., Mazzeo, G.M., Saccà, D.: A Grid Framework for Approximate Aggregate Query Answering on Summarized Sensor Network Readings. In: Meersman, R., Tari, Z., Corsaro, A. (eds.) OTM-WS 2004. LNCS, vol. 3292, pp. 144–153. Springer, Heidelberg (2004)
Data Transformation Services – Microsoft SQL Server (2000), http://www.microsoft.com/technet/prodtechnol/sql/2000/deploy/dtssql2k.mspx
Fiser, B., Onan, U., Elsayed, I., Brezany, P., Tjoa, A.M.: On-Line Analytical Processing on Large Databases Managed by Computational Grids. In: IEEE DEXA Workshops 2004, pp. 556–560 (2004)
Foster, I., Kesselman, C., Nick, J.M., Tuecke, S.: Grid Services for Distributed System Integration. IEEE Computer 35(6), 37–46 (2002)
Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organizations. International Journal of High Performance Computing Applications 15(3), 200–222 (2001)
Fox, G., Aktas, M.S., Aydin, G., Bulut, H., Gadgil, H., Oh, S., Pallickara, S., Pierce, M.E., Sayar, A., Zhai, G.: Grids for Real Time Data Applications. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds.) PPAM 2005. LNCS, vol. 3911, pp. 320–332. Springer, Heidelberg (2006)
Fox, G., Aydin, G., Bulut, H., Gadgil, H., Pallickara, S., Pierce, M., Wu, W.: Management of Real-Time Streaming Data Grid Services. Concurrency and Computation: Practice and Experience 19(7), 983–998 (2007)
Fox, G., Pallickara, S., Pierce, M., Gadgil, H.: Building Messaging Substrates for Web and Grid Applications. Philosophical Transactions of the Royal Society: Mathematical, Physical and Engineering Sciences 363(1833), 1757–1773 (2005)
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals. Data Mining and Knowledge Discovery 1(1), 29–53 (1997)
Han, J.: OLAP Mining: An Integration of OLAP with Data Mining. IFIP 2.6 DS 1997, 1–9 (1997)
Helander, J., Sigurdsson, S.B.: Self-Tuning Planned Actions Time to Make Real-Time SOAP Real. In: IEEE ISORC 2005, pp. 80–89 (2005)
Ho, C.-T., Agrawal, R., Megiddo, N., Srikant, R.: Range Queries in OLAP Data Cubes. In: ACM SIGMOD 1997, pp. 73–88 (1997)
IBM: IBM SOA Foundation: An Architectural Introduction and Overview, http://download.boulder.ibm.com/ibmdl/pub/software/dw/webservices/ws-soa-whitepaper.pdf
Intel: Service-Oriented Enterprise, The Technology Path to Business Transformation, http://www.intel.com/business/bss/technologies/soe/soe_backgrounder.pdf
Iqbal, S., Bunn, J.J., Newman, H.B.: Distributed Heterogeneous Relational Data Warehouse in a Grid Environment. In: CHEP 2003 (2003), http://www.slac.stanford.edu/econf/C0303241/proc/papers/THAT007.pdf
Jiang, W.-S., Yu, J.-H.: Distributed Data Mining on the Grid. In: IEEE ICMLC 2005, pp. 2010–2014 (2005)
Kazi, A.: Enabling Real-Time Business Through Service-Oriented & Event-Driven Architecture. Business Integration Journal (April 2005), http://www.bijonline.com/index.cfm?section=article&aid=19
Kohlhoff, C., Steele, R.: Evaluating SOAP for High Performance Applications in Capital Markets. Computer Systems: Science & Engineering 19(4), 19–31 (2004)
Lawrence, M., Dehne, F.A., Rau-Chaplin, A.: Implementing OLAP Query Fragment Aggregation and Recombination for the OLAP Enabled Grid. In: IEEE IPDPS 2007, pp. 1–8 (2007)
Lawrence, M., Rau-Chaplin, A.: The OLAP-Enabled Grid: Model and Query Processing Algorithms. In: IEEE HPCS 2006, pp. 4–10 (2006)
Michelson, B.M.: Event-Driven Architecture Overview. Technical Report, Patricia Seybold Group (2006), http://soa.omg.org/Uploaded%20Docs/EDA/bda2-2-06cc.pdf
Moore, R.: Knowledge-based Grids. Technical Report, San Diego Supercomputer Center (2001)
Nguyen, M., Tjoa, A.M., Weippl, E., Brezany, P.: Toward a Grid-Based Zero-Latency Data Warehousing Implementation for Continuous Data Streams Processing. International Journal of Data Warehousing and Mining 1(4), 22–55 (2005)
Nieto-Santisteban, M.A., Gray, J., Szalay, A., Annis, J., Thakar, A.R., O’Mullane, W.: When Database Systems Meet the Grid. In: ACM CIDR 2005, pp. 154–161 (2005)
Papazoglou, M.P., Georgakapoulos, G.: Service-Oriented Computing. Communications of the ACM 46(10), 24–28 (2003)
Papazoglou, M.P., van den Heuvel, W.-J.: Service-Oriented Design and Development Methodology. International Journal of Web Engineering and Technology 2(4), 412–442 (2006)
Papazoglou, M.P., van den Heuvel, W.-J.: Service Oriented Architectures: Approaches, Technologies and Research Issues. VLDB Journal 16(3), 389–415 (2007)
Poess, M., Othayoth, R.: Large Scale Data Warehouses on Grid: Oracle Database 10g and HP ProLiant Systems. In: VLDB 2005, pp. 1055–1066 (2005)
Smith, J., Gounaris, A., Watson, P., Paton, N.W., Fernandes, A.A.A., Sakellariou, R.: Distributed Query Processing on the Grid. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 279–290. Springer, Heidelberg (2002)
Son, S.H., Kang, K.-D.: Qos Management in Web-based Real-Time Data Services. In: IEEE WECWIS 2002, pp. 129–135 (2002)
Stahl, F., Berrar, D.P., Silva, C., Rodrigues, R.J., Brito, R.M.M., Dubitzky, W.: Grid Warehousing of Molecular Dynamics Protein Unfolding Data. In: IEEE CCGRID 2005, pp. 496–503 (2005)
Tsai, W.T., Lee, Y.-H., Cao, Z., Chen, Y., Xiao, B.: RTSOA: Real-Time Service-Oriented Architecture. In: IEEE SOSE 2006, pp. 49–56 (2006)
Wehrle, P., Miquel, M., Tchounikine, A.: A Model for Distributing and Querying a Data Warehouse on a Computing Grid. In: IEEE ICPADS 2005, pp. 203–209 (2005)
Wehrle, P., Miquel, M., Tchounikine, A.: A Grid Services-Oriented Architecture for Efficient Operation of Distributed Data Warehouses on Globus. In: IEEE AINA 2007, pp. 994–999 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cuzzocrea, A. (2008). Data Transformation Services over Grids with Real-Time Bound Constraints. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems: OTM 2008. OTM 2008. Lecture Notes in Computer Science, vol 5331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88871-0_60
Download citation
DOI: https://doi.org/10.1007/978-3-540-88871-0_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88870-3
Online ISBN: 978-3-540-88871-0
eBook Packages: Computer ScienceComputer Science (R0)