Skip to main content
Log in

Data Replication and Placement Strategies in Distributed Systems: A State of the Art Survey

  • Published:
Wireless Personal Communications Aims and scope Submit manuscript

Abstract

Data replication is an important mechanism for managing data distribution. The general purpose of data replication is to place data in different locations so that it is available to the requester as quickly as possible. In replication, multiple copies of data are stored in multiple locations, which increases availability, fault tolerance, load balancing, and scalability. It also decreases bandwidth consumption and response time. In this paper, data replication scenarios classify to the types of methods, the number of replications, locations, best replication based on energy, types of architecture, replication management, and criteria selection for best replication. The parameters of a good replication strategy are identified and expressed, which simultaneously include reducing access time, reducing bandwidth consumption, increasing the availability of storage resources, and balancing replication. By examining and analyzing of recent papers, solutions classify to the dynamic solution, meta-heuristic solution, multiple criteria decision-making solution, and machine learning solution. Our research show that the best of which is the multiple criteria decision-making solution methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data Availability

All materials included in this paper are available and referenced properly.

Notes

  1. A problem is NP-hard if an algorithm for solving it can be translated into one for solving any NP-problem (nondeterministic polynomial time) problem.

References

  1. Goel, S., & Buyya, R. (2006). Data replication strategies in wide-area distributed systems, in Enterprise Service Computing: From Concept to Deployment, vol.17

  2. Huang, K., Li, D., & Sun, Y. (2014). CRMS: a Centralized Replication Management Scheme for Cloud Storage System, in Proc. IEEE/CIC International Conference on Communications in China (ICCC), pp. 344–348, October

  3. Goel, S., & Buyya, R. (2006). Data replication strategies in wide-area distributed systems, in Enterprise Service Computing: From Concept to Deployment, vol. 17.

  4. Bin, L., Jiong, Y., Hua, S., & Mei, N. (2012). A QoS-aware Dynamic Data Replica Deletion Strategy for Distributed Storage Systems under Cloud Computing Environments, in Proc. Second International Conference on Cloud and Green Computing (CGC), pp. 219–225. November.

  5. Ranganathan, K., & Foster, I. (2001) Design and Evaluation of Dynamic Replication Strategies for a High-Performance Data Grid, in The International Conference on Computing.

  6. Kingsy Grace, R., & Manimegalai, R. (Feb. 2014). Dynamic replica placement and selection strategies in data grids: a comprehensive survey. Journal of Parallel and Distributed Computing, 74(2), 2099–2108.

  7. Chang, R. S., & Hui-Ping, C. (2008). A dynamic data replication strategy using access-weights in data grids. The Journal of Supercomputing, 45(3), 277–295.

    Article  Google Scholar 

  8. Seguela, M., Mokadem, R., & Pierson, J. M. (2019). Comparing energy-aware vs. cost-aware data replication strategy. In 2019 Tenth International Green and Sustainable Computing Conference (IGSC) (pp. 1–8). IEEE.

  9. Ghemawat, S., Gobioff, H., & Leung, S. T. (2003). The Google File System, in Proc. Nineteenth ACM symposium on Operating systems principles (SOSP), Vol. 37, No. 5, pp. 29–43, December

  10. Hongxia, W. (2016). Application of VCG in Replica Placement Strategy of Cloud Storage. Int J Grid Distrib Comput, 9(4), 27–40.

    Article  Google Scholar 

  11. Yin, Y., & Deng, L. (2022). A dynamic decentralized strategy of replica placement on edge computing. International Journal of Distributed Sensor Networks, 18(8), 15501329221115064.

    Article  Google Scholar 

  12. Rajalakshmi, D., Vijayakumar, & Srinivasagan, K. G. (2014). An improved dynamic data replica selection and placement in cloud, 2014 Int. Conf. Recent Trends Inf. Technol. ICRTIT vol. 3, no. 3, 2014.

  13. Shvachko, K., Kuang, H., Radia, S., & Chansler, R. (2010). The Hadoop Distributed File System, in Proc. IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–10, May

  14. Zhang, Y., Zheng, Z., & Lyu, M. R. (2011) BFTCloud: A byzantine fault tolerance framework for voluntary-resource cloud computing, in Proc. IEEE International Conference on Cloud Computing (CLOUD), pp. 444–451.

  15. Veronese, G. S., Correia, M., Bessani, A. N., Lung, L. C., & Verissimo, P. (January 2013). Efficient byzantine Fault-Tolerance. Journal of IEEE Transactions on Computers, 62(1), 16–30.

  16. Tan, Y., Luo, D., & Wang, J. (2010). Cc-vit: Virtualization intrusion tolerance based on cloud computing, in Proc. 2nd International Conference on Information Engineering and Computer Science (ICIECS), pp. 1–6, December

  17. Sun, D. W., Chang, G. R., Gao, S., et al. (2012). Modeling a dynamic data replication strategy to increase system availability in Cloud Computing environments. Journal of Computer Science and Technology, 27(2), 256–272.

  18. Wei, Q., Veeravalli, B., Gong, B., Zeng, L., & Feng, D., CDRM: A cost-effective dynamic replication management scheme for cloud storage cluster, in Proc. IEEE International Conference on Cluster Computing (CLUSTER), pp. 188–196, September 2010.

  19. Vashisht, P., Kumar, V., Kumar, R., & Sharma, A. (2019). Optimizing replica creation using agents in data grids, in Proceedings of the 2019 Amity International Conference on Artificial Intelligence (AICAI), pp. 542–547, Dubai, UAE, February

  20. Li, C., Song, M., Zhang, M., & Luo, Y. (2020). Effective replica management for improving reliability and availability in edgecloud computing environment. Journal of Parallel and Distributed Computing, 143, 107–128.

    Article  Google Scholar 

  21. Gill, N. K., & Singh, S. (2016). A dynamic, cost-aware, optimized data replication strategy for heterogeneous cloud data centers. Future Generation Computer Systems, 65, 10–32.

    Article  Google Scholar 

  22. Mesbahi, M., & Rahmani, A. M. (2016). Load balancing in Cloud Computing: a state-of-the-art survey. International Journal of Modern Education and Computer Science, 8, 64–78.

    Article  Google Scholar 

  23. Kumar, K. A., Quamar, A., Deshpande, A., & Khuller, S. (December 2014). SWORD: workload-aware data placement and replica selection for cloud data management systems, VLDB Journal, 23(6), 845–870,

  24. Yanzhen, Q., Naixue, X., & Resilient, R. F. H. A. (2012). Fault-Tolerant and High-efficient Replication Algorithm for Distributed Cloud Storage, in Proc. 41st International Conference on Parallel Processing (ICPP), pp. 520–529, September

  25. Janpet, J., & Wen, Y. F. (2013). Reliable and Available Data Replication Planning for Cloud Storage, in Proc. IEEE 27th International Conference on Advanced Information Networking and Applications (AINA), pp. 772–779, March

  26. Bin, L., Jiong, Y., Hua, S., & Mei, N. (2012). A QoS-aware Dynamic Data Replica Deletion Strategy for Distributed Storage Systems under Cloud Computing Environments, in Proc. Second International Conference on Cloud and Green Computing (CGC), pp. 219–225. November

  27. Boru, D., Kliazovich, D., Granelli, F., Bouvry, P., & Zomaya, A. Y. (2015). Energy-efficient data replication in cloud computing datacenters. Journal of Cluster Computing, 18(1), 385–402.

  28. Kliazovich, D., Bouvry, P., & Khan, S. U. (2012). GreenCloud: a packet-level Simulator of Energy-aware Cloud Computing Data Centers,. Journal of Supercomputing, 62(3), 1263–1283.

    Article  Google Scholar 

  29. Mansouri, N., Javidi, M. M., & Zade, B. M. H. (2021). A CSO-based approach for secure data replication in cloud computing environment. The Journal of Supercomputing, 77(6), 5882–5933.

    Article  Google Scholar 

  30. Xhafa, F., Kolici, V., Potlog, A. D., Spaho, E., Barolli, L., & Takizawa, M. (2012). Data replication in P2P collaborative systems. In 2012 7th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp 49–57. IEEE

  31. Wei, Q., Veeravalli, B., Gong, B., Zeng, L., & Feng, D., CDRM: A Cost-effective Dynamic Replication Management Scheme for Cloud Storage Cluster, in Proc. IEEE International Conference on Cluster Computing (CLUSTER), pp. 188–196, September 2010.

  32. Ranganathan, K., Iamnitchi, A., & Foster, I. (2002). Improving data availability through dynamic model-driven replication in large peer-to-peer communities, in 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid, pp. 376, May

  33. Challal, Z. (2010). Bouabana Tebibel, a priori replica placement strategy in data grid, in Proceedings of International Conference on Machine and Web Intelligence, ICMWI, pp. 402–406.

  34. Abdullah, A., Othman, M., Ibrahim, H., Sulaiman, M. N., & Othman, A. T., Decentralized replication strategies for P2P based scientific data grid, in Information Technology, 2008, ITSim 2008, International Symposium, vol. 3, pp. 1–8.

  35. Ranganathan, K., & Foster, I. (2001). Identifying dynamic replication strategies for a high-performance Data Grid, in Lecture Notes in Computer Science, vol. 2242, Springer, pp.75–86,

  36. Ranganathan, K., & Foster, I. (2001). Design and evaluation of dynamic replication strategies for a high-performance data grid, in International Conference on Computing in High Energy and Nuclear Physics, vol. 2001.

  37. Yuan, Y., Wu, Y., Yang, G., & Yu, F. (2007) Dynamic data replication based on local optimization principle in data grid.

  38. Chen, D. W., Zhou, S. T., Ren, X. Y., & Kong, Q. (2010). Methods for replica creation in datagrids using complex network. The Journal of China Universities of Posts and Telecommunications, 17(4).

  39. Chang, R. S., Chang, J. S., & Lin, S. Y. (2007). Job scheduling and data replication on data grids. Future Generation Computer System, 23(7), 846–860.

  40. Mansouri, N., & Dastghaibyfard, G. H. (2012). A dynamic replica management strategy in datagrid. Journal of Network and Computer Applications, 35(4), 1297–1303.

    Article  Google Scholar 

  41. Taheri, J., Zomaya, A. Y., Bouvry, P., & Khan, S. U. (2013). Hopfield neural network for simultaneous job scheduling and data replication in grids. Future Generation Computer Systems, 29(8), 1885–1900.

    Article  Google Scholar 

  42. Pérez, J. M., García-Carballeira, F., Carretero, J., Calderón, A., & Fernández, J. (2010). Branch replication scheme: a new model for data replication in large scale data grids. Future Generation Computer Systems, 26(1), 12–20.

    Article  Google Scholar 

  43. Mansouri, N, Dastghaibyfard, G. H., & Mansouri, E. (2013) Combination of data replication and scheduling algorithm for improving data availability in Data Grids, Available online 3 January 2013.

  44. Heravi, S., & Naji, H. (2014). Improving content reproduction in hybrid networks Delivery of peer-to-peer content considering users leaving the network, 9th International Symposium on Science and Technology Advances, Mashhad. https://civilica.com/doc/841605

  45. Tu, M., Li, P., Yen, I. L., Thuraisingham, B., & Khan, L. (2010). Secure data objects replication in data grid, IEEE Trans. Dependable and Secure Computing, 7(1), 50–64

  46. Rahman, R., Barker, K., & Alhajj, R. (2005). Replica placement in data grid: a multi-objective approach. In H. Zhuge, & C. F. Geoffrey (Eds.), Grid and cooperative computing-GCC 2005 (pp. 645–656). Berlin Heidelberg: Springer.

    Chapter  Google Scholar 

  47. Mohammad Khanlari, L., & Hosseinzadeh, R. (2011). Elnaz, Study and comparison of the performance of graph scheduling algorithms in the grid, the first national conference of computer and information technology scholars, Tabriz. https://civilica.com/doc/ 132203

  48. Lei, M., Vrbsky, S. V., & Hong, X. (2008). An on-line replication strategy to increase availability in data

  49. Globus Alliance (2008). GT 4.0 Data Management: Replica Location Service (RLS). http://www.globus.org/toolkit/data/rls,

  50. Mansouri, N., & Dastghaibyfard, G. H. (2012). A dynamic replica management strategy in data grid. Journal of network and computer applications, 35(4), 1297–1303.

    Article  Google Scholar 

  51. Kunszt, P., Laure, E., Stockinger, H., & Stockinger, K. (2005). File-based replica management. Future Generation Computer Systems, 21(1), 115–123.

    Article  Google Scholar 

  52. Bsoul, M., Al-Khasawneh, A., Abdallah, E. E., & Kilani, Y. (2011). Enhanced fast spread replication strategy for data grid. Journal of Network and Computer Applications, 34(2), 575–580.

    Article  Google Scholar 

  53. Zhong, H., Zhang, Z., & Zhang, X. (2010). A dynamic replica management strategy based on Data Grid. In 2010 9th International Conference on Grid and Cooperative Computing (GCC) (pp. 18–23). IEEE.

  54. Nukarapu, D. T., Tang, B., Wang, L., & Lu, S. (2011). Data replication in data intensive scientific applications with performance guarantee. IEEE Transactions on Parallel and Distributed Systems, 22(8),1299–1306.

  55. Lin, Y., Yang Chen, Guodong Wang, and Beixing Deng (2010) Rigel: A Scalable and Lightweight Replica Selection Service for Replicated Distributed File System, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

  56. Rahman, R. M., Alhajj, R., & Barker, K. (2008). Replica selection strategies in data grid. Journal of Parallel and Distributed Computing, 68(12), 1561–1574.

    Article  MATH  Google Scholar 

  57. Husni Hamad, E., & Mistarihi, A. L. (2009). Yong, on fairness, optimizing replica selection in data grids. IEEE Transactions On Parallel And Distributed Systems, 20(8), 1102–1111.

    Article  Google Scholar 

  58. Komai, Y., Sasaki, Y., Hara, T., & Nishio, S. (2015). K nearest neighbor search for location-dependent sensor data in MANETs. IEEE Access: Practical Innovations, Open Solutions, 3, 942–954.

    Article  Google Scholar 

  59. Zhong, H., Zhang, Z., & Zhang, X. (2010). A dynamic replica management strategy based on data grid, in: Proceedings of the 9th International Conference on Grid and Cloud Computing, GCC, pp. 18–23.

  60. Unceta, I., Nin, J., & Pujol, O. (2020). Environmental adaptation and Differential Replication in Machine Learning. Entropy, 22(10), 1122.

    Article  Google Scholar 

  61. Ribeiro, M. T., Singh, S., & Guestrin, C. (2018, April). Anchors: High-precision model-agnostic explanations. In Proceedings of the AAAI conference on artificial intelligence (Vol. 32, No. 1).

  62. Lee, M. C., Leu, F. Y., & Chen, Y. P. (2012). PFRF: an adaptive data replication algorithm based on star-topology data grids. Future Generation.

  63. Shorfuzzaman, M., Graham, P., & Eskicioglu, R. (2010). Adaptive popularity-driven replica placement in hierarchical data grids. The Journal of Supercomputing, 51(3), 374.

    Article  Google Scholar 

  64. Tang, M., Lee, B., & Yeo, C. K. (2005). Dynamic replication algorithm for the multi-tier data grid. Future Generation Computer Systems, 21(5), 775–790.

    Article  Google Scholar 

  65. Słota, R., Skitał, Ł., Nikolow, D., & Kitowski, J. (2006). Algorithms for automatic data replication in grid environment. Parallel Processing and Applied Mathematics (pp. 707–714). Berlin Heidelberg: Springer.

    Chapter  Google Scholar 

  66. Abdurrab, A. R., & Xie, T. (2010, May). Fire: A file reunion-based data replication strategy for data grids. In Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (pp. 215–223). IEEE Computer Society.

  67. Saadat, N., & Rahmani, A. M. (2012). PDDRA: a new pre-fetching based dynamic data replication algorithm in data grids. Future Generation Computer Systems, 28(4), 666–681.

    Article  Google Scholar 

  68. Park, S., Kim, J., Ko, Y., & Yoon, W. (2004). Dynamic data replication strategy based on internet hierarchy BHR (vol. 3033, pp. 838–846). Heidelberg: Springer Verlag. in: Lecture Notes in Computer Science Publisher.

    Google Scholar 

  69. Doğan, A. (2009). A study on performance of dynamic file replication algorithms for real- time file access in data grids. Future Generation Computer Systems, 25(8), 829–839.

    Article  Google Scholar 

  70. Zhang, Y., Nie, X., Jiang, J., Wang, W., Xu, K., Zhao, Y., … Yao, G. (2021). Bds+:An inter-datacenter data replication system with dynamic bandwidth separation. IEEE/ACM Transactions on Networking, 29(2), 918–934.

  71. Cui, L., Zhang, J., Yue, L., Shi, Y., Li, H., & Yuan, D. (2018). A genetic algorithm-based data replica placement strategy for scientific applications in clouds. IEEE Transactions On Services Computing, 11(4), 727–739.

    Article  Google Scholar 

  72. Shvachko, K., Hairong, K., Radia, S., & Chansler (2010). The Hadoop distributed file system. In Proceedings of the 26th symposium on mass storage systems and technologies, pp 1–10

  73. Junfeng, T., & Weiping, L. (2016). Pheromone-based genetic algorithm adaptive selection algorithm in cloud storage. Int J Grid Distrib Comput, 9(6), 269–278.

    Article  Google Scholar 

  74. Al Jadaan, O., Abdulal, W., Abdul Hameed, M., & Jabas, A. (2010). Enhancing data selection using genetic algorithm. In International conference on computational intelligence and communication networks

  75. Almomani, O., & Madi, M. (2014). A GA-based replica placement mechanism for data grid. Int J Adv Comput Sci Appl, 5(10), 1–6.

    Google Scholar 

  76. Grace, K., Rajkuma, M., Sumeetha, S., & Selvanayaki, P. (2014). GA based replica selection in data grid. In International conference on advances in engineering and technology

  77. Xu, Q., Xu, Z., & Wang, T. (2015). A data-placement strategy based on genetic algorithm in cloud computing. Int J Intell Sci, 5, 145–157.

    Google Scholar 

  78. Li, R., Hu, Y., & Lee, P. (2017). Enabling efficient and reliable transition from replication to erasure coding for clustered file systems. IEEE Transactions On Parallel And Distributed Systems, 28(9), 2500–2513.

    Article  Google Scholar 

  79. Wu, X. (2017). Combination replicas placements strategy for data sets from cost-effective view in the cloud. Int J Comput Intell Syst, 10, 521–539.

    Google Scholar 

  80. Chunlin, L., Ping, W. Y., Hengliang, T., & Youlong, L. (2019). Dynamic multi-objective optimized replica placement and migration strategies for SaaS applications in edge cloud. Future Gener Comput Syst, 100, 921–937.

    Article  Google Scholar 

  81. Li, T., Xie, Q., & Zhang, H. (2022). Design of college scheduling algorithm based on improved genetic ant colony hybrid optimization. Security and Communication Networks, 2022.

  82. Wang, L., Luo, J., Shen, J., & Dong, F. (2013). Cost and time aware ant colony algorithm for data replica in alpha magnetic spectrometer experiment. In IEEE international congress on big data, pp 247–254

  83. Sun, M., Sun, J., Lu, E., & Yu, C. (2005). Ant algorithm for file replica selection in data grid. In First international conference on semantics, knowledge and grid

  84. Yang, L., Lin, J., & Zheng, Y. (2013). A replica selection strategy on antalgorithm in data-intensive applications. Int J Online Eng, 9, 38–41.

    Article  Google Scholar 

  85. Jafari Navimipour, N., & Alami Milani, B. (2016). Replica selection in the cloud environments using an ant colony algorithm. In Third international conference on digital information processing, data mining, and wireless communications, pp 105–110

  86. Shojaatmand, A., Saghiri, N., Hashemi, S., & Abbasi Dezfoli, M. (2011). Improving replica selection in data grid using a dynamic ant algorithm. Int J Inf Stud, 3(4), 139.

    Google Scholar 

  87. Khalili Azimi, S. (2019). A bee colony (beehive) based approach for data replication in cloud environments. In S. M. Kouhsari (Ed.), Fundamental research in electrical engineering (pp. 1039–1052). Singapore: Springer.

    Chapter  Google Scholar 

  88. Taheri, J., Choon Lee, Y., Zomaya, A. Y., & Jay Siegel, H. (2013). A bee colony-based optimization approach for simultaneous job scheduling and data replication in grid environments. Computers & Operations Research, 40(6), 1564–1578.

    Article  MathSciNet  MATH  Google Scholar 

  89. Salem, R., Salam, M. A., Abdelkader, H., Awad, A., & Arafa, A. (2019). An artificial bee colony algorithm for data replication optimization in cloud environments. IEEE Access: Practical Innovations, Open Solutions, 7, 1–12.

    Google Scholar 

  90. Yang, X. S. (2013). Firefly algorithm: recent advances and applications. Int J Swarm Intell, 1(1), 36–50.

    Article  Google Scholar 

  91. Sadeghzadeh, M., & Navaezadeh, S. (2014). Improving replica in data grid by using firefly algorithm. In International conference on challenges in IT, engineering and technology (ICCIET’2014),pp 17–18

  92. Kchaou, H., Kechaou, Z., & Alimi, A. M. (2022). A PSO task scheduling and IT2FCM fuzzy data placement strategy for scientific cloud workflows. Journal of Computational Science, 64, 101840.

    Article  Google Scholar 

  93. Jayasree, P., & Saravanan, V. (2018). Apsdrdo: adaptive particle swarm division and replication of data optimization for security in cloud computing. IOSR J Eng.

  94. Ebadi, Y., & Jafari Navimipour, N. (2018). An energy-aware method for data replication in the cloud environments using a Tabu search and particle swarm optimization algorithm. Concurr Comput Pract Exp, 31, e4757.

    Article  Google Scholar 

  95. Mun˜oz, V. M., & Carballeira, F. G. (2006). PSO-LRU algorithm for data grid replication service. In: International conference on high performance computing for computational science, pp 656–669

  96. Awad, A., Salem, R., Abdelkader, H., & Salam, M. A.A (2021) Swarm Intelligence-based Approach for Dynamic Data Replication in a Cloud Environment. International Journal of Intelligent Engineering and Systems14(2), 271-284

  97. A.S. Tanenbaum and M. van Steen (2007) Distributed systems: principles and paradigms, Pearson Prentice Hall

  98. Lamehamedi, H., Szymanski, B., Shentu, Z., & Deelman, E. (2002). Data replication strategies in grid environments. In Proceedings Fifth International Conference on Algorithms and Architectures for Parallel Processing, pp. 378–383. IEEE

  99. Mardani, A., Zavadskas, E. K., Khalifah, Z., Zakuan, N., Jusoh, A., Nor, K. M., & Khoshnoudi, M. (2017). A review of multi-criteria decision-making applications to solve energy management problems: two decades from 1995 to 2015. Renewable And Sustainable Energy Reviews, 71, 216–256.

    Article  Google Scholar 

  100. Rigo, P. D., Rediske, G., Rosa, C. B., Gastaldo, N. G., Michels, L., Neuenfeldt, A. L. Jr., & Siluk, J. C. M. (2020). Renewable energy problems:exploring the methods to support the decision-making process. Sustainability, 12, 195.

    Article  Google Scholar 

  101. Bhardwaja, A., Joshia, M., Khoslaa, R., & Dubash, N. K. (2019). More priorities, more problems? Decision-making with multiple energy,development and climate objectives. Energy Res Soc Sci, 49, 143–157.

    Article  Google Scholar 

  102. Tranfield, D., Denyer, D., & Smart, P. (2003). Towards a methodology for developing evidence-informed management knowledge by means of systematic review. British Journal Of Management, 14, 207–222.

    Article  Google Scholar 

  103. Danesh, D., Ryan, M., & Abbasi, A. (2017). Multi-criteria decision-making methods for project portfolio management: a literature review. Int J Manag Decis Making, 17, 75–94.

    Google Scholar 

  104. Gabus, A., & Fontela, E. (1972). World Problems, an Invitation to Further Thought within the Framework of DEMATEL. Battelle Geneva Research Centre: Geneva, Switzerland,

  105. Brans, J. P., & Vincke, P. (1985). A preference ranking organization method (the PROMETHEE method for multiple criteria decision-making). Manag Sci, 31, 647–656.

    Article  MATH  Google Scholar 

  106. Hemam, S. M., Hioual, O., & Hioual, O. (2022). Dynamic load balancing upon the replication and deletion of cloud services. Journal of Intelligent & Fuzzy Systems, (Preprint),1–13.

  107. Saaty, T. (2004). Fundamentals of the analytic network process-dependence and feedback in decision-making with a single network.J. Syst Sci Syst Eng, 13, 129–157.

    Article  Google Scholar 

  108. Tharani, R. (2016). Balanced ant colony optimization algorithm for job scheduling in grid computing. Int J Eng Res Technol, 4(11), 1–6.

    Google Scholar 

  109. Zavadskas, E. K., Govindan, K., Antucheviciene, J., & Turskis, Z. (2016). Hybrid multiple criteria decision-making methods: a review of applications for sustainability issues. Econ Res Ekon Istraž, 29, 857–887.

    Google Scholar 

  110. Shen, Zavadskas, K. Y., & Tzeng, E. K. (2018). Updated discussions on ‘hybrid multiple criteria decision-making methods: a review of applications for sustainability issues. Econ Res Ekon Istraž, 31, 1437–1452.

    Google Scholar 

  111. Mukhametzyanov, I., & Pamucar, D. (2018). A sensitivity analysis in MCDM problems: a statistical approach. Decis Mak Appl Manag Eng, 2, 1–20.

    Google Scholar 

  112. Zadeh, L. A. (1965). Fuzzy sets. Information And Control, 8, 338–353.

    Article  MathSciNet  MATH  Google Scholar 

  113. Pisano, U., & Berger, G. (2016). Stakeholders activities in support of the 2030 agenda for Sustainable Development and the SDGs implementation: a view on current activities towards implementation. Vienna, Austria: ESDN Office.

    Google Scholar 

  114. Kara¸san, A., & Kahraman, C. (2018). A novel interval-valued neutrosophic EDAS method: prioritization of the United Nations National Sustainable Development Goals. Soft Computing, 22, 4891–4906.

    Article  Google Scholar 

  115. Oliveira, A., Calili, R., Almeida, M. F., & Sousa, M. (2019). A systemic and contextual framework to define a country’s 2030 Agenda from a foresight perspective. Sustainability, 11, 6360.

    Article  Google Scholar 

  116. Resce, G., & Schiltz, F. (2020). Sustainable development in Europe: A multicriteria decision analysis. Rev. Income Wealth

  117. Breu, T., Bergöö, M., Ebneter, L., Pham-Trufert, M., Bieri, S., Messerli, P., Ott, C., & Bader, C. (2020). Where to begin? Defining national strategies for implementing the 2030 Agenda: The case of Switzerland. Sustain. Sci. 16.

  118. Benítez, R., & Liern, V. (2020) Unweighted TOPSIS: A new multicriteria tool for sustainability analysis. Int. J. Sustain. Dev. World Ecol., 1–13.

  119. Jayaraman, R., Colapinto, C., Liuzzi, D., & La Torre, D. (2016). Planning sustainable development through a scenario-based stochastic goal programming model. Operations Research, 17, 789–805.

    Article  Google Scholar 

  120. Mukherjee, S. (2017). Selection of alternative fuels for sustainable urban transportation under multi-criteria intuitionistic fuzzy environment. Fuzzy Inf Eng, 9, 117–135.

    Article  Google Scholar 

  121. Karabulut, A. A., Udias, A., & Vigiak, O. (2019). Assessing the policy scenarios for the Ecosystem Water Food Energy (EWFE) nexus in the Mediterranean region. Ecosyst Serv, 35, 231–240.

    Article  Google Scholar 

  122. De, P., & Majumder, M. (2020). Allocation of energy in surface water treatment plants for maximum energy conservation. Environment, Development And Sustainability, 22, 3347–3370.

    Article  Google Scholar 

  123. Mandeh, A., Khamforoosh, K., & Vafa Maihami. (2015). Data fusion in wireless sensor networks using fuzzy systems. International Journal of Computer Applications, 125, 12.

    Article  Google Scholar 

  124. Sabaghian, K., Khamforoosh, K., & Ghaderzadeh, A. (2021). Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments. Plos one, 16(7), e0254210.

  125. Beigrezaei, M., Haghighat, A. T., & Mirtaheri, S. L. (2021). Improve Performance by a Fuzzy-Based Dynamic Replication Algorithm. In Grid, Cloud, and Fog. Mathematical Problems in Engineering, 2021.

  126. Shorfuzzaman, M., Graham, P., & Eskicioglu, R. (2010). Adaptive popularity-driven replica placement in hierarchical data grids. The Journal of Supercomputing, 51(3), 374.

    Article  Google Scholar 

  127. Bokhari, S. M. A., & Theel, O. (2020). A genetic programming-based multi-objective optimization approach to data replication strategies for distributed systems. In 2020 IEEE Congress on Evolutionary Computation (CEC) (pp. 1–9). IEEE.

  128. Sattar, A. M., Ertuğrul, Ö. F., Gharabaghi, B., McBean, E. A., & Cao, J. (2019). Extreme learning machine model for water network management. Neural Computing And Applications, 31(1), 157–169.

    Article  Google Scholar 

  129. Sethi, K., Jaiswal, V., & Ansari, M. D. (2020). Machine learning based support system for students to select stream (subject). Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science), 13(3), 336–344.

  130. Ribeiro, M. T., Singh, S., & Guestrin, C. (2018). Anchors: High-precision model-agnostic explanations. In Proceedings of the AAAI conference on artificial intelligence (Vol. 32, No. 1).

  131. Pengcheng Xiong, Y., Chi, S., Zhu, H. J., & Moon (2011). Calton Pu, and Hakan Hacigumus. Intelligent management of virtualized resources for database systems in cloud environment. In International Conference on Data Engineering, pages 87–98, ISBN 9781424489589. doi: https://doi.org/10.1109/ICDE.2011.5767928.

  132. Han, D., Wooldridge, M., Rogers, A., Tople, S., Ohrimenko, O., & Tschiatschek, S. (2020). Replication-Robust Payoff-Allocation for Machine Learning Data Markets. arXiv preprint arXiv:2006.14583.

  133. Zhao, Y., & Hu, Y. (2003) GRESS: a grid replica selection service, in Proceedings of the 16th International Conference on Parallel and Distributed Computing Systems, pp. 423–429.

Download references

Funding

The authors did not receive support from any organization for the submitted work.

Author information

Authors and Affiliations

Authors

Contributions

KS and KK have proposed the idea of doing a survey on Data Replication. They have done the literature review and classification. KK as a supervisor managed the process, reviewed the article critically. AG proposed several useful revisions.

Corresponding author

Correspondence to Keyhan Khamforoosh.

Ethics declarations

Conflict of Interest

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sabaghian, K., Khamforoosh, K. & Ghaderzadeh, A. Data Replication and Placement Strategies in Distributed Systems: A State of the Art Survey. Wireless Pers Commun 129, 2419–2453 (2023). https://doi.org/10.1007/s11277-023-10240-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11277-023-10240-7

Keywords

Navigation