Abstract
Most works on Grids have taken an approach where the system is a mixture of clusters and other resources put together with the help of some services. But this solution is a simplistic one that tries to grow from the cluster perspective. We think that the Grid model should be different and near to the p2p model, especially in the I/O field where the network and the heterogeneity of the infrastructure play an important role. In this paper we present a model to organize the DataGrid Infrastructure using concepts as data phases and a p2p approach, in order to select the adequate working policies. These concepts allow the definition of a clearer model for our DataGrid Architecture than a mixture of resources. We present a model relying on the former concepts, their implementation in an I/O middleware for Grids, called GridExpand, and the evaluation of some of the concepts presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Foster, I., Kesselman, C. (eds.): The Grid: Blueprint for a New Computing Infraestructure. Morgan Kaufmann, San Francisco (1999)
Patterson, D., Gibson, G., Katz, R.: A Case for Redundant arrays of Inexpensive Disks (RAID). In: Proc. of the ACM SIGMOD 1988, June 1988, pp. 109–116 (1988)
Sponsored by the European Union. The Data Grid Project, http://eu-datagrid.web.cern.ch/eu-datagrid
Sponsored by the U.S. DOE Office of Science. The Earth System Grid, http://www.earthsystemgrid.org
Wolfson, O., Jajodia, S., Huang, Y.: An Adaptive Data Replication Algorithm. ACM Transaction on Database Systems 22(2), 255–314 (1997)
Pacitti, E., Minet, P., Simon, E.: Fast algorithms for maintaining replica consistency in lazy master replicated databases. In: VLDB, pp. 126–137 (1999)
Dahlin, M., Wang, R., Anderson, T., Patterson, D.: Cooperative Caching: Using Remote Client Memory to Improve File System Performance. In: OSDI (November 1994)
Dahlin, M., Mather, C., Wang, R., Anderson, T., Patterson, D.: A Quantitative Analysis of Cache Policies for Scalable Network File Systems. In: SIGMETRICS (1994)
Garcia, F., Carretero, J., Perez, F., de Miguel, P., Alonso, L.: High Performance Cache Management for Parallel File Systems. In: Hernández, V., Palma, J.M.L.M., Dongarra, J. (eds.) VECPAR 1998. LNCS, vol. 1573, pp. 466–479. Springer, Heidelberg (1999)
Thain, D., Basney, J., Son, S.-C., Livny, M.: The Kangaroo Approach to Data Movement on the Gr4id. In: Proceedings of the Tenth IEEE Symposium on High Performance Distributed Computing
Simitci, H., Reed, D.A., Fox, T., Medina, M., Oly, J., Trand, N., Wang, G.: A Framework for Adaptive Storage Input/Output on Computational Grids. In: Proceedings of the 3rd Workshop on Runtime Systems for Parallel Programming (April 1999)
Madhyastha, T.M., Elford, C.L., Reed, D.A.: Optimizing Input/Output Using Adaptive File System Policies. In: Proceedings of the Fifth Goddard Conference on Mass Storage Systems and Technologies, College Park, MD, September 1996, pp. 493–514 (1996)
Ranganathan, K., Foster, I.: Identifying Dynamic Replication Strategies for a High Performance Data Grid. In: Proceedings of the International Grid Computing Workshop, Denve (November 2001)
Ranganathan, K., Iamnitchi, A., Foster, I.: Improving Data Availability through Dynamic Model-Driven Replication in Large Peer-to-Peer Communities. In: Global and Peer-to-Peer Computing on Large Scale Distributed Systems Workshop, Berlin (May 2002)
Thakur, R., Choudhary, A.: An Extended Two-Phase Method for Accessing Sections of Out-of-Core Arrays. Scientific Programming 4(5), 301–317 (Winter 1996)
Thakur, R., Gropp, W., Lusk, E.: Data Sieving and Collective I/O in ROMIO. In: Proceedings of the Seventh Symposium on the Frontiers of Massively Parallel Computation (1998)
Thain, D., Bent, J., Arpaci-Dusseau, A., Arpaci-Dusseau, R., Libny, M.: Gathering at the Well: Creating Communities for Grid I/O. In: Proceedings of Supercomputing 2001, Denver, Colorado (November 2001)
Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications. In: Proceedings of the 2001 ACM SIGCOMM Conference, San Diego, California, USA (2001)
Garcia, F., Calderon, A., Carretero, J., Perez, J.M., Fernandez, J.: The Design of the Expand Parallel File System. International Journal of High Performance Computing Applications (2003)
Gropp, W., Takhur, R., Lusk, E.: An Abstract-Devide Interface for Implementing Portable Paralle-I/O Interfaces. In: Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation, October 1996, pp. 180–187 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Perez, J.M., Garcia, F., Carretero, J., Garcia, J.D., Escolar, S. (2005). Data Driven Infrastructure and Policy Selection to Enhance Scientific Applications in Grid. In: Herrero, P., Pérez, M.S., Robles, V. (eds) Scientific Applications of Grid Computing. SAG 2004. Lecture Notes in Computer Science, vol 3458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11423287_6
Download citation
DOI: https://doi.org/10.1007/11423287_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25810-0
Online ISBN: 978-3-540-32010-4
eBook Packages: Computer ScienceComputer Science (R0)