Abstract
Science collaborations use computer grids to run expensive computational tasks on large data sets. Tasks as jobs across the network demand data and thereby workload management and data allocation to maintain the computational workflow. Data allocation includes data placement with different replication factors (multiplicity) of data.
The proposed data replication & allocation model can place multitudes of subsets of a data population in a distributed system, such as a computer cluster or computer grid. A stochastic simulation with a data and computing example from the ATLAS Physics Collaboration shows its potential usability in one of the largest Computing Grids. This paper showcases data allocation with different replica factors and various numbers of subsets to improve the overall situation in a computer network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Boru, D., Kliazovich, D., Granelli, F., Bouvry, P., Zomaya, A.Y.: Energy-efficient data replication in cloud computing datacenters. Cluster Comput. 18(1), 385–402 (2015). https://doi.org/10.1007/s10586-014-0404-x
Brezany, P., Mueck, T.A., Schikuta, E.: A software architecture for massively parallel input-output. In: Waśniewski, J., Dongarra, J., Madsen, K., Olesen, D. (eds.) PARA 1996. LNCS, vol. 1184, pp. 85–96. Springer, Heidelberg (1996). https://doi.org/10.1007/3-540-62095-8_10
Casey, R.G.: Allocation of copies of a file in an information network. In: Proceedings of the May 16–18, 1972, Spring Joint Computer Conference, pp. 617–625 (1971)
Lamehamedi, H., Szymanski, B., Shentu, Z., Deelman, E.: Data replication strategies in grid environments. In: Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002, Proceedings, pp. 378–383. IEEE (2002)
Loukopoulos, T., Ahmad, I.: Static and adaptive distributed data replication using genetic algorithms. J. Parallel Distrib. Comput. 64(11), 1270–1285 (2004)
Vamosi, R., Lassnig, M., Schikuta, E.: Data allocation based on evolutionary data popularity clustering. In: Shi, Y., et al. (eds.) ICCS 2018. LNCS, vol. 10860, pp. 153–166. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93698-7_12
Wolfson, O., Jajodia, S., Huang, Y.: An adaptive data replication algorithm. ACM Trans. Database Syst. (TODS) 22(2), 255–314 (1997)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Vamosi, R., Schikuta, E. (2023). Dynamic Data Replication for Short Time-to-Completion in a Data Grid. In: Mikyška, J., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M. (eds) Computational Science – ICCS 2023. ICCS 2023. Lecture Notes in Computer Science, vol 10475. Springer, Cham. https://doi.org/10.1007/978-3-031-36024-4_52
Download citation
DOI: https://doi.org/10.1007/978-3-031-36024-4_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36023-7
Online ISBN: 978-3-031-36024-4
eBook Packages: Computer ScienceComputer Science (R0)