Skip to main content

Data Driven Infrastructure and Policy Selection to Enhance Scientific Applications in Grid

  • Conference paper
Scientific Applications of Grid Computing (SAG 2004)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3458))

Included in the following conference series:

  • 291 Accesses

Abstract

Most works on Grids have taken an approach where the system is a mixture of clusters and other resources put together with the help of some services. But this solution is a simplistic one that tries to grow from the cluster perspective. We think that the Grid model should be different and near to the p2p model, especially in the I/O field where the network and the heterogeneity of the infrastructure play an important role. In this paper we present a model to organize the DataGrid Infrastructure using concepts as data phases and a p2p approach, in order to select the adequate working policies. These concepts allow the definition of a clearer model for our DataGrid Architecture than a mixture of resources. We present a model relying on the former concepts, their implementation in an I/O middleware for Grids, called GridExpand, and the evaluation of some of the concepts presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Foster, I., Kesselman, C. (eds.): The Grid: Blueprint for a New Computing Infraestructure. Morgan Kaufmann, San Francisco (1999)

    Google Scholar 

  2. Patterson, D., Gibson, G., Katz, R.: A Case for Redundant arrays of Inexpensive Disks (RAID). In: Proc. of the ACM SIGMOD 1988, June 1988, pp. 109–116 (1988)

    Google Scholar 

  3. Sponsored by the European Union. The Data Grid Project, http://eu-datagrid.web.cern.ch/eu-datagrid

  4. Sponsored by the U.S. DOE Office of Science. The Earth System Grid, http://www.earthsystemgrid.org

  5. Wolfson, O., Jajodia, S., Huang, Y.: An Adaptive Data Replication Algorithm. ACM Transaction on Database Systems 22(2), 255–314 (1997)

    Article  Google Scholar 

  6. Pacitti, E., Minet, P., Simon, E.: Fast algorithms for maintaining replica consistency in lazy master replicated databases. In: VLDB, pp. 126–137 (1999)

    Google Scholar 

  7. Dahlin, M., Wang, R., Anderson, T., Patterson, D.: Cooperative Caching: Using Remote Client Memory to Improve File System Performance. In: OSDI (November 1994)

    Google Scholar 

  8. Dahlin, M., Mather, C., Wang, R., Anderson, T., Patterson, D.: A Quantitative Analysis of Cache Policies for Scalable Network File Systems. In: SIGMETRICS (1994)

    Google Scholar 

  9. Garcia, F., Carretero, J., Perez, F., de Miguel, P., Alonso, L.: High Performance Cache Management for Parallel File Systems. In: Hernández, V., Palma, J.M.L.M., Dongarra, J. (eds.) VECPAR 1998. LNCS, vol. 1573, pp. 466–479. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  10. Thain, D., Basney, J., Son, S.-C., Livny, M.: The Kangaroo Approach to Data Movement on the Gr4id. In: Proceedings of the Tenth IEEE Symposium on High Performance Distributed Computing

    Google Scholar 

  11. Simitci, H., Reed, D.A., Fox, T., Medina, M., Oly, J., Trand, N., Wang, G.: A Framework for Adaptive Storage Input/Output on Computational Grids. In: Proceedings of the 3rd Workshop on Runtime Systems for Parallel Programming (April 1999)

    Google Scholar 

  12. Madhyastha, T.M., Elford, C.L., Reed, D.A.: Optimizing Input/Output Using Adaptive File System Policies. In: Proceedings of the Fifth Goddard Conference on Mass Storage Systems and Technologies, College Park, MD, September 1996, pp. 493–514 (1996)

    Google Scholar 

  13. Ranganathan, K., Foster, I.: Identifying Dynamic Replication Strategies for a High Performance Data Grid. In: Proceedings of the International Grid Computing Workshop, Denve (November 2001)

    Google Scholar 

  14. Ranganathan, K., Iamnitchi, A., Foster, I.: Improving Data Availability through Dynamic Model-Driven Replication in Large Peer-to-Peer Communities. In: Global and Peer-to-Peer Computing on Large Scale Distributed Systems Workshop, Berlin (May 2002)

    Google Scholar 

  15. Thakur, R., Choudhary, A.: An Extended Two-Phase Method for Accessing Sections of Out-of-Core Arrays. Scientific Programming 4(5), 301–317 (Winter 1996)

    Google Scholar 

  16. Thakur, R., Gropp, W., Lusk, E.: Data Sieving and Collective I/O in ROMIO. In: Proceedings of the Seventh Symposium on the Frontiers of Massively Parallel Computation (1998)

    Google Scholar 

  17. Thain, D., Bent, J., Arpaci-Dusseau, A., Arpaci-Dusseau, R., Libny, M.: Gathering at the Well: Creating Communities for Grid I/O. In: Proceedings of Supercomputing 2001, Denver, Colorado (November 2001)

    Google Scholar 

  18. Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications. In: Proceedings of the 2001 ACM SIGCOMM Conference, San Diego, California, USA (2001)

    Google Scholar 

  19. Garcia, F., Calderon, A., Carretero, J., Perez, J.M., Fernandez, J.: The Design of the Expand Parallel File System. International Journal of High Performance Computing Applications (2003)

    Google Scholar 

  20. Gropp, W., Takhur, R., Lusk, E.: An Abstract-Devide Interface for Implementing Portable Paralle-I/O Interfaces. In: Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation, October 1996, pp. 180–187 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Perez, J.M., Garcia, F., Carretero, J., Garcia, J.D., Escolar, S. (2005). Data Driven Infrastructure and Policy Selection to Enhance Scientific Applications in Grid. In: Herrero, P., Pérez, M.S., Robles, V. (eds) Scientific Applications of Grid Computing. SAG 2004. Lecture Notes in Computer Science, vol 3458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11423287_6

Download citation

  • DOI: https://doi.org/10.1007/11423287_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25810-0

  • Online ISBN: 978-3-540-32010-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics