Abstract
Data replication and consistency refer to the same data being stored in distributed sites, and kept consistent when one or more copies are modified. A good file maintenance and consistency strategy can reduce file access times and access latencies, and increase download speeds, thus reducing overall computing times. In this paper, we propose dynamic services for replicating and maintaining data in grid environments, and directing replicas to appropriate locations for use. To address a problem with the Bandwidth Hierarchy-based Replication (BHR) algorithm, a strategy for maintaining replicas dynamically, we propose the Dynamic Maintenance Service (DMS). We also propose a One-way Replica Consistency Service (ORCS) for data grid environments, a positive approach to resolving consistency maintenance issues we hope will strike a balance between improving data access performance and replica consistency. Experimental results show that our services are more efficient than other strategies.
Similar content being viewed by others
References
Allcock B, Bester J, Bresnahan J, Chervenak A, Foster I, Kesselman C, Meder S, Nefedova V, Quesnel D, Tuecke S (2002) Data management and transfer in high-performance computational grid environments. Parallel Comput 28(5):749–771
Allcock B, Bester J, Bresnahan J, Chervenak A, Foster I, Kesselman C, Meder S, Nefedova V, Quesnel D, Tuecke S (2001) Secure, efficient data transport and replica management for high-performance data-intensive computing. In: Proceedings of the eighteenth IEEE symposium on mass storage systems and technologies, pp 13–28
Chang RS, Chang JS (2006) Adaptable replica consistency service for data grids. In: Proceeding of the third international conference of information technology (ITNG’06), pp 646–651
Chervenak A, Foster I, Kesselman C, Salisbury C, Tuecke S (2001) The data grid: towards an architecture for the distributed management and analysis of large scientific datasets. J Netw Comput Appl 23:187–200
Czajkowski K, Fitzgerald S, Foster I, Kesselman C (2001) Grid information services for distributed resource sharing, In: Proceedings of the tenth IEEE international symposium on high-performance distributed computing (HPDC-10’01), August 2001, pp 181–194
Domenici A, Donno F, Pucciani G, Stockinger H, Stockinger K (2004) Replica consistency in a data grid. Nucl Instr Methods Phys Res A 534(1–2):24–28
Düllmann D, Hoschek W, Martinez JJ, Segal B (2001) Models for replica synchronisation and consistency in a data grid. In: Proceedings of the 10th IEEE international symposium on high performance distributed computing (HPDC-10’01), October 2001, pp 67
Fathali J (2006, to appear) A genetic algorithm for the p-median problem with pos/neg weights. Appl Math Comput 8 (August)
Fisher ML (1981) The Lagrangian relaxation method for solving integer programming problems. Manag Sci 27:1–18
Foster I (2002) The grid: a new infrastructure for 21st century science. Phys Today 55(2):42–47
Foster I, Kesselman C (1999) The grid 2: blueprint for a new computing infrastructure, 2nd edn. Morgan Kaufmann, San Mateo (Elsevier series in grid computing)
Foster I, Kesselman C (1997) Globus: a metacomputing infrastructure toolkit. Int J Supercomput Appl High Perform Comput 11(2):115–128
Huang CQ, Xu FY, Hu XY (2006) Massive data oriented replication algorithms for consistency maintenance in data grids. ICCS 2006, Part I, LNCS 3991, pp 838–841
Hu JF, Xiao N, Zhao YJ, Fu W (2005) An asynchronous replica consistency model in data grid. In: Parallel and distributed processing and applications (ISPA 2005 workshops), pp 475–484
Jackson LE, Rouskas GN, Stallmann MFM (2007) The directional p-median problem: definition, complexity, and algorithms. Eur J Oper Res 179:1097–1108. http://people.engr.ncsu.edu/mfms/Publications/2007-EJOR-Jackson.pdf
Java CoG. http://www-unix.globus.org/cog/
OptorSim. http://edg-wp2.web.cern.ch/edg-wp2/optimization/optorsim.html
Park SM, Kim JH, Ko YB, Yoon W-S (2003) Dynamic data grid replication strategy based on Internet hierarchy. In: The second international workshop on grid and cooperative computing (GCC2003), pp 838–846
Park SM, Kim JH (2003) Chameleon: a resource scheduler in a data grid environment. In: Proceedings of third international symposium on cluster computing and the grid, p. 258. http://portal.acm.org/citation.cfm?id=792481
Rahman RM, Barker K, Alhajj R (2006) Replica placement design with static optimality and dynamic maintainability. In: Proceedings of the sixth IEEE international symposium on cluster computing and the grid (CCGRID’06), pp 434–437
Rahman RM, Barker K, Alhajj R (2006) Effective dynamic replica maintenance algorithm for the grid environment. In: Proceedings of advances in grid and pervasive computing, vol 3947: Grid and pervasive computing 2006 (GPC2006), pp 336–345
Ranganathan K, Foster I Design and evaluation of dynamic replication strategies for a high performance data grid. In: Proceedings of international conference on computing in high energy and nuclear physics
Shi XH, Jin H, Qiang WZ, Zou DQ (2003) An adaptive meta-scheduler for data-intensive applications. In: Proceedings of grid and cooperative computing (GCC’03), pp 830–837
Stockinger H, Samar A, Allcock B, Foster I, Holtman K, Tierney B (2002) File and object replication in data grids. J Cluster Comput 5(3):305–314
The Globus Alliance. http://www.globus.org/
Vazhkudai S, Tuecke S, Foster I (2001) Replica selection in the globus data grid. In: Proceedings of the 1st international symposium on cluster computing and the grid (CCGRID 2001), pp 106–113
Venugopal S, Buyya R, Ramamohanarao K (2006) A taxonomy of data grids for distributed data sharing, management, and processing. ACM computing surveys, vol 38, Article 3, March 2006
Yang CT, Yang IH, Li KC, Wang SY (2007) Improvements on dynamic adjustment mechanism in co-allocation data grid environments. J Supercomput 40(3):269–280
Yang CT, Wang SY, Fu CP (2007) A dynamic adjustment mechanism for data transfer in data grids. In: Network and parallel computing: IFIP international conference, NPC 2007. Lecture notes in computer science, vol 4672. Springer, Berlin, pp 61–70. ISSN 1611-3349
Yang CT, Yang MF, Chiang WC (2008) Implementation of a cyber transformer for parallel download in co-allocation data grid environments. In: Proceedings of the 7th international conference on grid and cooperative computing (GCC2008) and second EchoGRID conference, October 24–26, 2008 in Shenzhen, Guangdong, China, pp 242–253
Yang CT, Yang IH, Chen CH, Wang SY (2006) Implementation of a dynamic adjustment mechanism with efficient replica selection in co-allocation data grid environments. In: Proceedings of the 21st annual ACM symposium on applied computing (SAC 2006) – distributed systems and grid computing (DSGC) track, vol 1, pp 797–804, Dijon, France, April 23–27, 2006
Yang CT, Yang IH, Wang SY, Li KC, Hsu CH (2009) A recursively-adjusting co-allocation scheme with cyber-transformer in data grids. Future Gener Comput Syst 25(7):695–703
Ganglia. http://ganglia.info/
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yang, CT., Fu, CP. & Hsu, CH. File replication, maintenance, and consistency management services in data grids. J Supercomput 53, 411–439 (2010). https://doi.org/10.1007/s11227-009-0302-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-009-0302-9