Skip to main content

Data Transformation Services over Grids with Real-Time Bound Constraints

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5331))

Abstract

Data and Knowledge Grids represent emerging and attracting application scenarios for Grid Computing, and pose novel and previously-unrecognized challenges to the research community. Basically, Data and Knowledge Grids found on high-performance Grid infrastructures and add to the latter meaningful data- and knowledge-oriented abstractions and metaphors that perfectly marry with innovative requirements of modern complex Intelligent Information Systems. To this end, service-oriented architectures and paradigms are the most popular ones for Grids, and on the whole represent an active and widely-recognized area of Grid Computing research. In this paper, we introduce the so-called Grid-based RTSOA frameworks, which essentially combine Grid Computing with real-time service management and execution paradigms, and put the basis for novel research perspectives in data-intensive e-science Grid applications with real-time bound constraints. This novel framework is then specialized to the particular context of Data Transformation services over Grids, which play a relevant role for both Data and Knowledge Grids. Finally, we complete the main contribution of the paper with a rigorous theoretical model for efficiently supporting Grid-based RTSOA frameworks, with particular emphasis to the context of Data Transformation services over Grids, along with its preliminary experimental assessment.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allcock, W.E., Bester, J., Bresnahan, J., Chervenak, A.L., Foster, I.T., Kesselman, C., Meder, S., Nefedova, V., Quesnel, D., Tuecke, S.: Data Management and Transfer in High-Performance Computational Grid Environments. Parallel Computing 28(5), 749–771 (2002)

    Article  Google Scholar 

  2. Antonioletti, M., Atkinson, M., Baxter, R., Borley, A., Chue Hong, N., Collins, B., Hardman, N., Hume, A., Knox, A., Jackson, M., Krause, A., Laws, S., Magowan, J., Paton, N.W., Pearson, D., Sugden, T., Watson, P., Westhead, M.: The Design and Implementation of Grid Database Services in OGSA-DAI. Concurrency and Computation: Practice and Experience 17(2-4), 357–376 (2005)

    Article  Google Scholar 

  3. Baker, M., Buyya, R., Laforenza, D.: Grids and Grid Technologies for Wide-Area Distributed Computing. International Journal of Software: Practice and Experience 32(15), 1437–1466 (2002)

    MATH  Google Scholar 

  4. Bodhare, S.: Optimizing Service Infrastructures, http://blogs.ittoolbox.com/eai/optimization/archives/optimizing-serviceinfrastructures-3928

  5. Brezany, P., Janciak, I., Tjoa, A.M.: GridMiner: A Fundamental Infrastructure for Building Intelligent Grid Systems. In: IEEE/ACM WI 2005, pp. 150–156 (2005)

    Google Scholar 

  6. Cannataro, M., Talia, D.: The Knowledge Grid: An Architecture for Distributed Knowledge Discovery. Communications of the ACM 46(1), 89–93 (2003)

    Article  MATH  Google Scholar 

  7. Cannataro, M., Talia, D., Trunfio, P.: Distributed Data Mining on the Grid. Future Generation Computer Systems 18(8), 1101–1112 (2002)

    Article  MATH  Google Scholar 

  8. de Carvalho Costa, R.L., Furtado, P.: An SLA-Enabled Grid Data Warehouse. In: IEEE IDEAS 2007, pp. 285–289 (2007)

    Google Scholar 

  9. Congiusta, A., Pugliese, A., Talia, D., Trunfio, P.: Designing Grid Services for Distributed Knowledge Discovery. Web Intelligence and Agent Systems 1(2), 91–104 (2003)

    Google Scholar 

  10. Cuzzocrea, A.: Towards Real-Time Data Transformation Services over Grids. In: IEEE COMPSAC-RTSOAA 2008, pp. 1143–1149 (2008)

    Google Scholar 

  11. Cuzzocrea, A., Furfaro, F., Greco, S., Mazzeo, G.M., Masciari, E., Saccà, D.: A Distributed System for Answering Range Queries on Sensor Network Data. In: IEEE PerSeNS 2005, pp. 369–373 (2005)

    Google Scholar 

  12. Cuzzocrea, A., Furfaro, F., Masciari, E., Saccà, D., Sirangelo, C.: Approximate Query Answering on Sensor Network Data Streams. In: Stefanidis, A., Nittel, S. (eds.) GeoSensor Networks, pp. 53–72. CRC Press, Boca Raton (2004)

    Google Scholar 

  13. Cuzzocrea, A., Furfaro, F., Mazzeo, G.M., Saccà, D.: A Grid Framework for Approximate Aggregate Query Answering on Summarized Sensor Network Readings. In: Meersman, R., Tari, Z., Corsaro, A. (eds.) OTM-WS 2004. LNCS, vol. 3292, pp. 144–153. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  14. Data Transformation Services – Microsoft SQL Server (2000), http://www.microsoft.com/technet/prodtechnol/sql/2000/deploy/dtssql2k.mspx

  15. Fiser, B., Onan, U., Elsayed, I., Brezany, P., Tjoa, A.M.: On-Line Analytical Processing on Large Databases Managed by Computational Grids. In: IEEE DEXA Workshops 2004, pp. 556–560 (2004)

    Google Scholar 

  16. Foster, I., Kesselman, C., Nick, J.M., Tuecke, S.: Grid Services for Distributed System Integration. IEEE Computer 35(6), 37–46 (2002)

    Article  Google Scholar 

  17. Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organizations. International Journal of High Performance Computing Applications 15(3), 200–222 (2001)

    Article  Google Scholar 

  18. Fox, G., Aktas, M.S., Aydin, G., Bulut, H., Gadgil, H., Oh, S., Pallickara, S., Pierce, M.E., Sayar, A., Zhai, G.: Grids for Real Time Data Applications. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds.) PPAM 2005. LNCS, vol. 3911, pp. 320–332. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  19. Fox, G., Aydin, G., Bulut, H., Gadgil, H., Pallickara, S., Pierce, M., Wu, W.: Management of Real-Time Streaming Data Grid Services. Concurrency and Computation: Practice and Experience 19(7), 983–998 (2007)

    Article  Google Scholar 

  20. Fox, G., Pallickara, S., Pierce, M., Gadgil, H.: Building Messaging Substrates for Web and Grid Applications. Philosophical Transactions of the Royal Society: Mathematical, Physical and Engineering Sciences 363(1833), 1757–1773 (2005)

    Article  Google Scholar 

  21. Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals. Data Mining and Knowledge Discovery 1(1), 29–53 (1997)

    Article  Google Scholar 

  22. Han, J.: OLAP Mining: An Integration of OLAP with Data Mining. IFIP 2.6 DS 1997, 1–9 (1997)

    Google Scholar 

  23. Helander, J., Sigurdsson, S.B.: Self-Tuning Planned Actions Time to Make Real-Time SOAP Real. In: IEEE ISORC 2005, pp. 80–89 (2005)

    Google Scholar 

  24. Ho, C.-T., Agrawal, R., Megiddo, N., Srikant, R.: Range Queries in OLAP Data Cubes. In: ACM SIGMOD 1997, pp. 73–88 (1997)

    Google Scholar 

  25. IBM: IBM SOA Foundation: An Architectural Introduction and Overview, http://download.boulder.ibm.com/ibmdl/pub/software/dw/webservices/ws-soa-whitepaper.pdf

  26. Intel: Service-Oriented Enterprise, The Technology Path to Business Transformation, http://www.intel.com/business/bss/technologies/soe/soe_backgrounder.pdf

  27. Iqbal, S., Bunn, J.J., Newman, H.B.: Distributed Heterogeneous Relational Data Warehouse in a Grid Environment. In: CHEP 2003 (2003), http://www.slac.stanford.edu/econf/C0303241/proc/papers/THAT007.pdf

  28. Jiang, W.-S., Yu, J.-H.: Distributed Data Mining on the Grid. In: IEEE ICMLC 2005, pp. 2010–2014 (2005)

    Google Scholar 

  29. Kazi, A.: Enabling Real-Time Business Through Service-Oriented & Event-Driven Architecture. Business Integration Journal (April 2005), http://www.bijonline.com/index.cfm?section=article&aid=19

  30. Kohlhoff, C., Steele, R.: Evaluating SOAP for High Performance Applications in Capital Markets. Computer Systems: Science & Engineering 19(4), 19–31 (2004)

    Google Scholar 

  31. Lawrence, M., Dehne, F.A., Rau-Chaplin, A.: Implementing OLAP Query Fragment Aggregation and Recombination for the OLAP Enabled Grid. In: IEEE IPDPS 2007, pp. 1–8 (2007)

    Google Scholar 

  32. Lawrence, M., Rau-Chaplin, A.: The OLAP-Enabled Grid: Model and Query Processing Algorithms. In: IEEE HPCS 2006, pp. 4–10 (2006)

    Google Scholar 

  33. Michelson, B.M.: Event-Driven Architecture Overview. Technical Report, Patricia Seybold Group (2006), http://soa.omg.org/Uploaded%20Docs/EDA/bda2-2-06cc.pdf

  34. Moore, R.: Knowledge-based Grids. Technical Report, San Diego Supercomputer Center (2001)

    Google Scholar 

  35. Nguyen, M., Tjoa, A.M., Weippl, E., Brezany, P.: Toward a Grid-Based Zero-Latency Data Warehousing Implementation for Continuous Data Streams Processing. International Journal of Data Warehousing and Mining 1(4), 22–55 (2005)

    Article  Google Scholar 

  36. Nieto-Santisteban, M.A., Gray, J., Szalay, A., Annis, J., Thakar, A.R., O’Mullane, W.: When Database Systems Meet the Grid. In: ACM CIDR 2005, pp. 154–161 (2005)

    Google Scholar 

  37. Papazoglou, M.P., Georgakapoulos, G.: Service-Oriented Computing. Communications of the ACM 46(10), 24–28 (2003)

    Article  Google Scholar 

  38. Papazoglou, M.P., van den Heuvel, W.-J.: Service-Oriented Design and Development Methodology. International Journal of Web Engineering and Technology 2(4), 412–442 (2006)

    Article  Google Scholar 

  39. Papazoglou, M.P., van den Heuvel, W.-J.: Service Oriented Architectures: Approaches, Technologies and Research Issues. VLDB Journal 16(3), 389–415 (2007)

    Article  Google Scholar 

  40. Poess, M., Othayoth, R.: Large Scale Data Warehouses on Grid: Oracle Database 10g and HP ProLiant Systems. In: VLDB 2005, pp. 1055–1066 (2005)

    Google Scholar 

  41. Smith, J., Gounaris, A., Watson, P., Paton, N.W., Fernandes, A.A.A., Sakellariou, R.: Distributed Query Processing on the Grid. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 279–290. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  42. Son, S.H., Kang, K.-D.: Qos Management in Web-based Real-Time Data Services. In: IEEE WECWIS 2002, pp. 129–135 (2002)

    Google Scholar 

  43. Stahl, F., Berrar, D.P., Silva, C., Rodrigues, R.J., Brito, R.M.M., Dubitzky, W.: Grid Warehousing of Molecular Dynamics Protein Unfolding Data. In: IEEE CCGRID 2005, pp. 496–503 (2005)

    Google Scholar 

  44. Tsai, W.T., Lee, Y.-H., Cao, Z., Chen, Y., Xiao, B.: RTSOA: Real-Time Service-Oriented Architecture. In: IEEE SOSE 2006, pp. 49–56 (2006)

    Google Scholar 

  45. Wehrle, P., Miquel, M., Tchounikine, A.: A Model for Distributing and Querying a Data Warehouse on a Computing Grid. In: IEEE ICPADS 2005, pp. 203–209 (2005)

    Google Scholar 

  46. Wehrle, P., Miquel, M., Tchounikine, A.: A Grid Services-Oriented Architecture for Efficient Operation of Distributed Data Warehouses on Globus. In: IEEE AINA 2007, pp. 994–999 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cuzzocrea, A. (2008). Data Transformation Services over Grids with Real-Time Bound Constraints. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems: OTM 2008. OTM 2008. Lecture Notes in Computer Science, vol 5331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88871-0_60

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88871-0_60

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88870-3

  • Online ISBN: 978-3-540-88871-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics