Dynamic Data Replication for Short Time-to-Completion in a Data Grid

Vamosi, Ralf; Schikuta, Erich

doi:10.1007/978-3-031-36024-4_52

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 10475))

Included in the following conference series:

International Conference on Computational Science

470 Accesses

Abstract

Science collaborations use computer grids to run expensive computational tasks on large data sets. Tasks as jobs across the network demand data and thereby workload management and data allocation to maintain the computational workflow. Data allocation includes data placement with different replication factors (multiplicity) of data.

The proposed data replication & allocation model can place multitudes of subsets of a data population in a distributed system, such as a computer cluster or computer grid. A stochastic simulation with a data and computing example from the ATLAS Physics Collaboration shows its potential usability in one of the largest Computing Grids. This paper showcases data allocation with different replica factors and various numbers of subsets to improve the overall situation in a computer network.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Boru, D., Kliazovich, D., Granelli, F., Bouvry, P., Zomaya, A.Y.: Energy-efficient data replication in cloud computing datacenters. Cluster Comput. 18(1), 385–402 (2015). https://doi.org/10.1007/s10586-014-0404-x
Article Google Scholar
Brezany, P., Mueck, T.A., Schikuta, E.: A software architecture for massively parallel input-output. In: Waśniewski, J., Dongarra, J., Madsen, K., Olesen, D. (eds.) PARA 1996. LNCS, vol. 1184, pp. 85–96. Springer, Heidelberg (1996). https://doi.org/10.1007/3-540-62095-8_10
Chapter Google Scholar
Casey, R.G.: Allocation of copies of a file in an information network. In: Proceedings of the May 16–18, 1972, Spring Joint Computer Conference, pp. 617–625 (1971)
Google Scholar
Lamehamedi, H., Szymanski, B., Shentu, Z., Deelman, E.: Data replication strategies in grid environments. In: Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002, Proceedings, pp. 378–383. IEEE (2002)
Google Scholar
Loukopoulos, T., Ahmad, I.: Static and adaptive distributed data replication using genetic algorithms. J. Parallel Distrib. Comput. 64(11), 1270–1285 (2004)
Article Google Scholar
Vamosi, R., Lassnig, M., Schikuta, E.: Data allocation based on evolutionary data popularity clustering. In: Shi, Y., et al. (eds.) ICCS 2018. LNCS, vol. 10860, pp. 153–166. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93698-7_12
Chapter Google Scholar
Wolfson, O., Jajodia, S., Huang, Y.: An adaptive data replication algorithm. ACM Trans. Database Syst. (TODS) 22(2), 255–314 (1997)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer Science, University of Vienna, Vienna, Austria
Ralf Vamosi & Erich Schikuta

Authors

Ralf Vamosi
View author publications
You can also search for this author in PubMed Google Scholar
Erich Schikuta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Erich Schikuta .

Editor information

Editors and Affiliations

Czech Technical University in Prague, Prague, Czech Republic
Jiří Mikyška
University of Amsterdam, Amsterdam, The Netherlands
Clélia de Mulatier
AGH University of Science and Technology, Krakow, Poland
Maciej Paszynski
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Tennessee at Knoxville, Knoxville, TN, USA
Jack J. Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M.A. Sloot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vamosi, R., Schikuta, E. (2023). Dynamic Data Replication for Short Time-to-Completion in a Data Grid. In: Mikyška, J., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M. (eds) Computational Science – ICCS 2023. ICCS 2023. Lecture Notes in Computer Science, vol 10475. Springer, Cham. https://doi.org/10.1007/978-3-031-36024-4_52

Download citation

DOI: https://doi.org/10.1007/978-3-031-36024-4_52
Published: 26 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36023-7
Online ISBN: 978-3-031-36024-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dynamic Data Replication for Short Time-to-Completion in a Data Grid