Abstract
In a Data Grid, replication of data is critical for maximizing the overall job throughput. Such replication involves the creation of copies of data files at different sites according to specific Replica Optimization strategies that define when and where replicas should be created or deleted on a per-site basis, and which replicas should be used by Grid jobs. To be really effective these strategies have to take into account the available network bandwidth as a primary resource, prior to any consideration about storage or processing power. We present a novel replica management service, integrated within the GlueDomains active network monitoring architecture, designed and implemented within the centralized collective middleware framework of the SCoPE project to provide network-aware replica optimization for data intensive applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allcock, B., et al.: Efficient data transport and replica management for high-performance data-intensive computing. In: Proceedings of the 18th IEEE Symposium on Mass Storage Systems and 9th NASA Goddard Conference on Mass Storage Systems and Technologies, S. Diego (2001)
Chervenak, A., et al.: Giggle: a framework for constructing scalable replica location services. In: SC 2002, Baltimore (2002)
Ciuffoletti, A., et al.: Architecture of monitoring elements for the network element modeling in a grid infrastructure. In: Proc. of Workskop on Computing in High Energy and Nuclear Physics (2003)
INFN Grid, http://grid.infn.it/
LCG middleware, http://lcg.web.cern.ch/LCG/
gLite middleware, http://glite.web.cern.ch/glite/
McClatchey, R., et al.: Data Intensive and Network Aware (DIANA) Grid Scheduling. Jouunal of Grid Computing, DOI 10.1007/s10723-006-9059-z
Mathis, S., Mahdavi, O.: The macroscopic behavior of the TCP congestion avoidance algorithm. Computer Communications Rev. 27(3), 62–82 (1997)
Foster, I., et al.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration, Globus Project (2002), http://www.globus.org/research/papers/ogsa.pdf
Czajkowski, K., et al.: From Open Grid Services Infrastructure to WS-Resource Framework: Refactoring and Evolution (2004), http://www.globus.org
NuSoap - SOAP Toolkit for PHP, http://sourceforge.net/projects/nusoap
Alifieri, R., et al.: An authorization system for virtual organizations. In: European Across Grids Conference, Santiago de Compostela, Spain (2003)
Bell, W., Cameron, D., Capozza, L., Millar, A.P., Stockinger, K., Zini, F.: Design of a Replica Optimisation Framework, Technical Report DataGrid-02-TED-021215, Geneva, Switzerland (2002)
Cameron, D., Casey, J., Guy, L., Kunszt, P., Lemaitre, S., McCance, G., Stockinger, H., Stockinger, K., et al.: Replica Management in the EU DataGrid Project. International Journal of Grid Computing 2(4), 341–351 (2004)
Andreetto, P., et al.: Practical Approaches to Grid Workload & Resource Management in the EGEE Project. In: CHEP 2004, Interlaken, Switzerland (2004)
Yin, D., Chen, B., Fang, Y.: A Fast Replica Selection Algorithm for Data Grid, compsac. In: 31st Annual International Computer Software and Applications Conference (COMPSAC 2007), vol. 1, pp. 383–387 (2007)
Ciglan, M., Hluchy, L.: Towards Scalable Grid Replica Optimization Framework, ispdc. In: The 4th International Symposium on Parallel and Distributed Computing (ISPDC 2005), pp. 43–50 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Palmieri, F., Pardi, S. (2008). Network-Aware Replica Optimization in the SCoPE Grid Infrastructure. In: Gervasi, O., Murgante, B., Laganà, A., Taniar, D., Mun, Y., Gavrilova, M.L. (eds) Computational Science and Its Applications – ICCSA 2008. ICCSA 2008. Lecture Notes in Computer Science, vol 5073. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69848-7_67
Download citation
DOI: https://doi.org/10.1007/978-3-540-69848-7_67
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69840-1
Online ISBN: 978-3-540-69848-7
eBook Packages: Computer ScienceComputer Science (R0)