Abstract
Data replication is an important mechanism for managing data distribution. The general purpose of data replication is to place data in different locations so that it is available to the requester as quickly as possible. In replication, multiple copies of data are stored in multiple locations, which increases availability, fault tolerance, load balancing, and scalability. It also decreases bandwidth consumption and response time. In this paper, data replication scenarios classify to the types of methods, the number of replications, locations, best replication based on energy, types of architecture, replication management, and criteria selection for best replication. The parameters of a good replication strategy are identified and expressed, which simultaneously include reducing access time, reducing bandwidth consumption, increasing the availability of storage resources, and balancing replication. By examining and analyzing of recent papers, solutions classify to the dynamic solution, meta-heuristic solution, multiple criteria decision-making solution, and machine learning solution. Our research show that the best of which is the multiple criteria decision-making solution methods.
Similar content being viewed by others
Data Availability
All materials included in this paper are available and referenced properly.
Notes
A problem is NP-hard if an algorithm for solving it can be translated into one for solving any NP-problem (nondeterministic polynomial time) problem.
References
Goel, S., & Buyya, R. (2006). Data replication strategies in wide-area distributed systems, in Enterprise Service Computing: From Concept to Deployment, vol.17
Huang, K., Li, D., & Sun, Y. (2014). CRMS: a Centralized Replication Management Scheme for Cloud Storage System, in Proc. IEEE/CIC International Conference on Communications in China (ICCC), pp. 344–348, October
Goel, S., & Buyya, R. (2006). Data replication strategies in wide-area distributed systems, in Enterprise Service Computing: From Concept to Deployment, vol. 17.
Bin, L., Jiong, Y., Hua, S., & Mei, N. (2012). A QoS-aware Dynamic Data Replica Deletion Strategy for Distributed Storage Systems under Cloud Computing Environments, in Proc. Second International Conference on Cloud and Green Computing (CGC), pp. 219–225. November.
Ranganathan, K., & Foster, I. (2001) Design and Evaluation of Dynamic Replication Strategies for a High-Performance Data Grid, in The International Conference on Computing.
Kingsy Grace, R., & Manimegalai, R. (Feb. 2014). Dynamic replica placement and selection strategies in data grids: a comprehensive survey. Journal of Parallel and Distributed Computing, 74(2), 2099–2108.
Chang, R. S., & Hui-Ping, C. (2008). A dynamic data replication strategy using access-weights in data grids. The Journal of Supercomputing, 45(3), 277–295.
Seguela, M., Mokadem, R., & Pierson, J. M. (2019). Comparing energy-aware vs. cost-aware data replication strategy. In 2019 Tenth International Green and Sustainable Computing Conference (IGSC) (pp. 1–8). IEEE.
Ghemawat, S., Gobioff, H., & Leung, S. T. (2003). The Google File System, in Proc. Nineteenth ACM symposium on Operating systems principles (SOSP), Vol. 37, No. 5, pp. 29–43, December
Hongxia, W. (2016). Application of VCG in Replica Placement Strategy of Cloud Storage. Int J Grid Distrib Comput, 9(4), 27–40.
Yin, Y., & Deng, L. (2022). A dynamic decentralized strategy of replica placement on edge computing. International Journal of Distributed Sensor Networks, 18(8), 15501329221115064.
Rajalakshmi, D., Vijayakumar, & Srinivasagan, K. G. (2014). An improved dynamic data replica selection and placement in cloud, 2014 Int. Conf. Recent Trends Inf. Technol. ICRTIT vol. 3, no. 3, 2014.
Shvachko, K., Kuang, H., Radia, S., & Chansler, R. (2010). The Hadoop Distributed File System, in Proc. IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–10, May
Zhang, Y., Zheng, Z., & Lyu, M. R. (2011) BFTCloud: A byzantine fault tolerance framework for voluntary-resource cloud computing, in Proc. IEEE International Conference on Cloud Computing (CLOUD), pp. 444–451.
Veronese, G. S., Correia, M., Bessani, A. N., Lung, L. C., & Verissimo, P. (January 2013). Efficient byzantine Fault-Tolerance. Journal of IEEE Transactions on Computers, 62(1), 16–30.
Tan, Y., Luo, D., & Wang, J. (2010). Cc-vit: Virtualization intrusion tolerance based on cloud computing, in Proc. 2nd International Conference on Information Engineering and Computer Science (ICIECS), pp. 1–6, December
Sun, D. W., Chang, G. R., Gao, S., et al. (2012). Modeling a dynamic data replication strategy to increase system availability in Cloud Computing environments. Journal of Computer Science and Technology, 27(2), 256–272.
Wei, Q., Veeravalli, B., Gong, B., Zeng, L., & Feng, D., CDRM: A cost-effective dynamic replication management scheme for cloud storage cluster, in Proc. IEEE International Conference on Cluster Computing (CLUSTER), pp. 188–196, September 2010.
Vashisht, P., Kumar, V., Kumar, R., & Sharma, A. (2019). Optimizing replica creation using agents in data grids, in Proceedings of the 2019 Amity International Conference on Artificial Intelligence (AICAI), pp. 542–547, Dubai, UAE, February
Li, C., Song, M., Zhang, M., & Luo, Y. (2020). Effective replica management for improving reliability and availability in edgecloud computing environment. Journal of Parallel and Distributed Computing, 143, 107–128.
Gill, N. K., & Singh, S. (2016). A dynamic, cost-aware, optimized data replication strategy for heterogeneous cloud data centers. Future Generation Computer Systems, 65, 10–32.
Mesbahi, M., & Rahmani, A. M. (2016). Load balancing in Cloud Computing: a state-of-the-art survey. International Journal of Modern Education and Computer Science, 8, 64–78.
Kumar, K. A., Quamar, A., Deshpande, A., & Khuller, S. (December 2014). SWORD: workload-aware data placement and replica selection for cloud data management systems, VLDB Journal, 23(6), 845–870,
Yanzhen, Q., Naixue, X., & Resilient, R. F. H. A. (2012). Fault-Tolerant and High-efficient Replication Algorithm for Distributed Cloud Storage, in Proc. 41st International Conference on Parallel Processing (ICPP), pp. 520–529, September
Janpet, J., & Wen, Y. F. (2013). Reliable and Available Data Replication Planning for Cloud Storage, in Proc. IEEE 27th International Conference on Advanced Information Networking and Applications (AINA), pp. 772–779, March
Bin, L., Jiong, Y., Hua, S., & Mei, N. (2012). A QoS-aware Dynamic Data Replica Deletion Strategy for Distributed Storage Systems under Cloud Computing Environments, in Proc. Second International Conference on Cloud and Green Computing (CGC), pp. 219–225. November
Boru, D., Kliazovich, D., Granelli, F., Bouvry, P., & Zomaya, A. Y. (2015). Energy-efficient data replication in cloud computing datacenters. Journal of Cluster Computing, 18(1), 385–402.
Kliazovich, D., Bouvry, P., & Khan, S. U. (2012). GreenCloud: a packet-level Simulator of Energy-aware Cloud Computing Data Centers,. Journal of Supercomputing, 62(3), 1263–1283.
Mansouri, N., Javidi, M. M., & Zade, B. M. H. (2021). A CSO-based approach for secure data replication in cloud computing environment. The Journal of Supercomputing, 77(6), 5882–5933.
Xhafa, F., Kolici, V., Potlog, A. D., Spaho, E., Barolli, L., & Takizawa, M. (2012). Data replication in P2P collaborative systems. In 2012 7th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp 49–57. IEEE
Wei, Q., Veeravalli, B., Gong, B., Zeng, L., & Feng, D., CDRM: A Cost-effective Dynamic Replication Management Scheme for Cloud Storage Cluster, in Proc. IEEE International Conference on Cluster Computing (CLUSTER), pp. 188–196, September 2010.
Ranganathan, K., Iamnitchi, A., & Foster, I. (2002). Improving data availability through dynamic model-driven replication in large peer-to-peer communities, in 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid, pp. 376, May
Challal, Z. (2010). Bouabana Tebibel, a priori replica placement strategy in data grid, in Proceedings of International Conference on Machine and Web Intelligence, ICMWI, pp. 402–406.
Abdullah, A., Othman, M., Ibrahim, H., Sulaiman, M. N., & Othman, A. T., Decentralized replication strategies for P2P based scientific data grid, in Information Technology, 2008, ITSim 2008, International Symposium, vol. 3, pp. 1–8.
Ranganathan, K., & Foster, I. (2001). Identifying dynamic replication strategies for a high-performance Data Grid, in Lecture Notes in Computer Science, vol. 2242, Springer, pp.75–86,
Ranganathan, K., & Foster, I. (2001). Design and evaluation of dynamic replication strategies for a high-performance data grid, in International Conference on Computing in High Energy and Nuclear Physics, vol. 2001.
Yuan, Y., Wu, Y., Yang, G., & Yu, F. (2007) Dynamic data replication based on local optimization principle in data grid.
Chen, D. W., Zhou, S. T., Ren, X. Y., & Kong, Q. (2010). Methods for replica creation in datagrids using complex network. The Journal of China Universities of Posts and Telecommunications, 17(4).
Chang, R. S., Chang, J. S., & Lin, S. Y. (2007). Job scheduling and data replication on data grids. Future Generation Computer System, 23(7), 846–860.
Mansouri, N., & Dastghaibyfard, G. H. (2012). A dynamic replica management strategy in datagrid. Journal of Network and Computer Applications, 35(4), 1297–1303.
Taheri, J., Zomaya, A. Y., Bouvry, P., & Khan, S. U. (2013). Hopfield neural network for simultaneous job scheduling and data replication in grids. Future Generation Computer Systems, 29(8), 1885–1900.
Pérez, J. M., García-Carballeira, F., Carretero, J., Calderón, A., & Fernández, J. (2010). Branch replication scheme: a new model for data replication in large scale data grids. Future Generation Computer Systems, 26(1), 12–20.
Mansouri, N, Dastghaibyfard, G. H., & Mansouri, E. (2013) Combination of data replication and scheduling algorithm for improving data availability in Data Grids, Available online 3 January 2013.
Heravi, S., & Naji, H. (2014). Improving content reproduction in hybrid networks Delivery of peer-to-peer content considering users leaving the network, 9th International Symposium on Science and Technology Advances, Mashhad. https://civilica.com/doc/841605
Tu, M., Li, P., Yen, I. L., Thuraisingham, B., & Khan, L. (2010). Secure data objects replication in data grid, IEEE Trans. Dependable and Secure Computing, 7(1), 50–64
Rahman, R., Barker, K., & Alhajj, R. (2005). Replica placement in data grid: a multi-objective approach. In H. Zhuge, & C. F. Geoffrey (Eds.), Grid and cooperative computing-GCC 2005 (pp. 645–656). Berlin Heidelberg: Springer.
Mohammad Khanlari, L., & Hosseinzadeh, R. (2011). Elnaz, Study and comparison of the performance of graph scheduling algorithms in the grid, the first national conference of computer and information technology scholars, Tabriz. https://civilica.com/doc/ 132203
Lei, M., Vrbsky, S. V., & Hong, X. (2008). An on-line replication strategy to increase availability in data
Globus Alliance (2008). GT 4.0 Data Management: Replica Location Service (RLS). http://www.globus.org/toolkit/data/rls,
Mansouri, N., & Dastghaibyfard, G. H. (2012). A dynamic replica management strategy in data grid. Journal of network and computer applications, 35(4), 1297–1303.
Kunszt, P., Laure, E., Stockinger, H., & Stockinger, K. (2005). File-based replica management. Future Generation Computer Systems, 21(1), 115–123.
Bsoul, M., Al-Khasawneh, A., Abdallah, E. E., & Kilani, Y. (2011). Enhanced fast spread replication strategy for data grid. Journal of Network and Computer Applications, 34(2), 575–580.
Zhong, H., Zhang, Z., & Zhang, X. (2010). A dynamic replica management strategy based on Data Grid. In 2010 9th International Conference on Grid and Cooperative Computing (GCC) (pp. 18–23). IEEE.
Nukarapu, D. T., Tang, B., Wang, L., & Lu, S. (2011). Data replication in data intensive scientific applications with performance guarantee. IEEE Transactions on Parallel and Distributed Systems, 22(8),1299–1306.
Lin, Y., Yang Chen, Guodong Wang, and Beixing Deng (2010) Rigel: A Scalable and Lightweight Replica Selection Service for Replicated Distributed File System, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.
Rahman, R. M., Alhajj, R., & Barker, K. (2008). Replica selection strategies in data grid. Journal of Parallel and Distributed Computing, 68(12), 1561–1574.
Husni Hamad, E., & Mistarihi, A. L. (2009). Yong, on fairness, optimizing replica selection in data grids. IEEE Transactions On Parallel And Distributed Systems, 20(8), 1102–1111.
Komai, Y., Sasaki, Y., Hara, T., & Nishio, S. (2015). K nearest neighbor search for location-dependent sensor data in MANETs. IEEE Access: Practical Innovations, Open Solutions, 3, 942–954.
Zhong, H., Zhang, Z., & Zhang, X. (2010). A dynamic replica management strategy based on data grid, in: Proceedings of the 9th International Conference on Grid and Cloud Computing, GCC, pp. 18–23.
Unceta, I., Nin, J., & Pujol, O. (2020). Environmental adaptation and Differential Replication in Machine Learning. Entropy, 22(10), 1122.
Ribeiro, M. T., Singh, S., & Guestrin, C. (2018, April). Anchors: High-precision model-agnostic explanations. In Proceedings of the AAAI conference on artificial intelligence (Vol. 32, No. 1).
Lee, M. C., Leu, F. Y., & Chen, Y. P. (2012). PFRF: an adaptive data replication algorithm based on star-topology data grids. Future Generation.
Shorfuzzaman, M., Graham, P., & Eskicioglu, R. (2010). Adaptive popularity-driven replica placement in hierarchical data grids. The Journal of Supercomputing, 51(3), 374.
Tang, M., Lee, B., & Yeo, C. K. (2005). Dynamic replication algorithm for the multi-tier data grid. Future Generation Computer Systems, 21(5), 775–790.
Słota, R., Skitał, Ł., Nikolow, D., & Kitowski, J. (2006). Algorithms for automatic data replication in grid environment. Parallel Processing and Applied Mathematics (pp. 707–714). Berlin Heidelberg: Springer.
Abdurrab, A. R., & Xie, T. (2010, May). Fire: A file reunion-based data replication strategy for data grids. In Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (pp. 215–223). IEEE Computer Society.
Saadat, N., & Rahmani, A. M. (2012). PDDRA: a new pre-fetching based dynamic data replication algorithm in data grids. Future Generation Computer Systems, 28(4), 666–681.
Park, S., Kim, J., Ko, Y., & Yoon, W. (2004). Dynamic data replication strategy based on internet hierarchy BHR (vol. 3033, pp. 838–846). Heidelberg: Springer Verlag. in: Lecture Notes in Computer Science Publisher.
Doğan, A. (2009). A study on performance of dynamic file replication algorithms for real- time file access in data grids. Future Generation Computer Systems, 25(8), 829–839.
Zhang, Y., Nie, X., Jiang, J., Wang, W., Xu, K., Zhao, Y., … Yao, G. (2021). Bds+:An inter-datacenter data replication system with dynamic bandwidth separation. IEEE/ACM Transactions on Networking, 29(2), 918–934.
Cui, L., Zhang, J., Yue, L., Shi, Y., Li, H., & Yuan, D. (2018). A genetic algorithm-based data replica placement strategy for scientific applications in clouds. IEEE Transactions On Services Computing, 11(4), 727–739.
Shvachko, K., Hairong, K., Radia, S., & Chansler (2010). The Hadoop distributed file system. In Proceedings of the 26th symposium on mass storage systems and technologies, pp 1–10
Junfeng, T., & Weiping, L. (2016). Pheromone-based genetic algorithm adaptive selection algorithm in cloud storage. Int J Grid Distrib Comput, 9(6), 269–278.
Al Jadaan, O., Abdulal, W., Abdul Hameed, M., & Jabas, A. (2010). Enhancing data selection using genetic algorithm. In International conference on computational intelligence and communication networks
Almomani, O., & Madi, M. (2014). A GA-based replica placement mechanism for data grid. Int J Adv Comput Sci Appl, 5(10), 1–6.
Grace, K., Rajkuma, M., Sumeetha, S., & Selvanayaki, P. (2014). GA based replica selection in data grid. In International conference on advances in engineering and technology
Xu, Q., Xu, Z., & Wang, T. (2015). A data-placement strategy based on genetic algorithm in cloud computing. Int J Intell Sci, 5, 145–157.
Li, R., Hu, Y., & Lee, P. (2017). Enabling efficient and reliable transition from replication to erasure coding for clustered file systems. IEEE Transactions On Parallel And Distributed Systems, 28(9), 2500–2513.
Wu, X. (2017). Combination replicas placements strategy for data sets from cost-effective view in the cloud. Int J Comput Intell Syst, 10, 521–539.
Chunlin, L., Ping, W. Y., Hengliang, T., & Youlong, L. (2019). Dynamic multi-objective optimized replica placement and migration strategies for SaaS applications in edge cloud. Future Gener Comput Syst, 100, 921–937.
Li, T., Xie, Q., & Zhang, H. (2022). Design of college scheduling algorithm based on improved genetic ant colony hybrid optimization. Security and Communication Networks, 2022.
Wang, L., Luo, J., Shen, J., & Dong, F. (2013). Cost and time aware ant colony algorithm for data replica in alpha magnetic spectrometer experiment. In IEEE international congress on big data, pp 247–254
Sun, M., Sun, J., Lu, E., & Yu, C. (2005). Ant algorithm for file replica selection in data grid. In First international conference on semantics, knowledge and grid
Yang, L., Lin, J., & Zheng, Y. (2013). A replica selection strategy on antalgorithm in data-intensive applications. Int J Online Eng, 9, 38–41.
Jafari Navimipour, N., & Alami Milani, B. (2016). Replica selection in the cloud environments using an ant colony algorithm. In Third international conference on digital information processing, data mining, and wireless communications, pp 105–110
Shojaatmand, A., Saghiri, N., Hashemi, S., & Abbasi Dezfoli, M. (2011). Improving replica selection in data grid using a dynamic ant algorithm. Int J Inf Stud, 3(4), 139.
Khalili Azimi, S. (2019). A bee colony (beehive) based approach for data replication in cloud environments. In S. M. Kouhsari (Ed.), Fundamental research in electrical engineering (pp. 1039–1052). Singapore: Springer.
Taheri, J., Choon Lee, Y., Zomaya, A. Y., & Jay Siegel, H. (2013). A bee colony-based optimization approach for simultaneous job scheduling and data replication in grid environments. Computers & Operations Research, 40(6), 1564–1578.
Salem, R., Salam, M. A., Abdelkader, H., Awad, A., & Arafa, A. (2019). An artificial bee colony algorithm for data replication optimization in cloud environments. IEEE Access: Practical Innovations, Open Solutions, 7, 1–12.
Yang, X. S. (2013). Firefly algorithm: recent advances and applications. Int J Swarm Intell, 1(1), 36–50.
Sadeghzadeh, M., & Navaezadeh, S. (2014). Improving replica in data grid by using firefly algorithm. In International conference on challenges in IT, engineering and technology (ICCIET’2014),pp 17–18
Kchaou, H., Kechaou, Z., & Alimi, A. M. (2022). A PSO task scheduling and IT2FCM fuzzy data placement strategy for scientific cloud workflows. Journal of Computational Science, 64, 101840.
Jayasree, P., & Saravanan, V. (2018). Apsdrdo: adaptive particle swarm division and replication of data optimization for security in cloud computing. IOSR J Eng.
Ebadi, Y., & Jafari Navimipour, N. (2018). An energy-aware method for data replication in the cloud environments using a Tabu search and particle swarm optimization algorithm. Concurr Comput Pract Exp, 31, e4757.
Mun˜oz, V. M., & Carballeira, F. G. (2006). PSO-LRU algorithm for data grid replication service. In: International conference on high performance computing for computational science, pp 656–669
Awad, A., Salem, R., Abdelkader, H., & Salam, M. A.A (2021) Swarm Intelligence-based Approach for Dynamic Data Replication in a Cloud Environment. International Journal of Intelligent Engineering and Systems14(2), 271-284
A.S. Tanenbaum and M. van Steen (2007) Distributed systems: principles and paradigms, Pearson Prentice Hall
Lamehamedi, H., Szymanski, B., Shentu, Z., & Deelman, E. (2002). Data replication strategies in grid environments. In Proceedings Fifth International Conference on Algorithms and Architectures for Parallel Processing, pp. 378–383. IEEE
Mardani, A., Zavadskas, E. K., Khalifah, Z., Zakuan, N., Jusoh, A., Nor, K. M., & Khoshnoudi, M. (2017). A review of multi-criteria decision-making applications to solve energy management problems: two decades from 1995 to 2015. Renewable And Sustainable Energy Reviews, 71, 216–256.
Rigo, P. D., Rediske, G., Rosa, C. B., Gastaldo, N. G., Michels, L., Neuenfeldt, A. L. Jr., & Siluk, J. C. M. (2020). Renewable energy problems:exploring the methods to support the decision-making process. Sustainability, 12, 195.
Bhardwaja, A., Joshia, M., Khoslaa, R., & Dubash, N. K. (2019). More priorities, more problems? Decision-making with multiple energy,development and climate objectives. Energy Res Soc Sci, 49, 143–157.
Tranfield, D., Denyer, D., & Smart, P. (2003). Towards a methodology for developing evidence-informed management knowledge by means of systematic review. British Journal Of Management, 14, 207–222.
Danesh, D., Ryan, M., & Abbasi, A. (2017). Multi-criteria decision-making methods for project portfolio management: a literature review. Int J Manag Decis Making, 17, 75–94.
Gabus, A., & Fontela, E. (1972). World Problems, an Invitation to Further Thought within the Framework of DEMATEL. Battelle Geneva Research Centre: Geneva, Switzerland,
Brans, J. P., & Vincke, P. (1985). A preference ranking organization method (the PROMETHEE method for multiple criteria decision-making). Manag Sci, 31, 647–656.
Hemam, S. M., Hioual, O., & Hioual, O. (2022). Dynamic load balancing upon the replication and deletion of cloud services. Journal of Intelligent & Fuzzy Systems, (Preprint),1–13.
Saaty, T. (2004). Fundamentals of the analytic network process-dependence and feedback in decision-making with a single network.J. Syst Sci Syst Eng, 13, 129–157.
Tharani, R. (2016). Balanced ant colony optimization algorithm for job scheduling in grid computing. Int J Eng Res Technol, 4(11), 1–6.
Zavadskas, E. K., Govindan, K., Antucheviciene, J., & Turskis, Z. (2016). Hybrid multiple criteria decision-making methods: a review of applications for sustainability issues. Econ Res Ekon Istraž, 29, 857–887.
Shen, Zavadskas, K. Y., & Tzeng, E. K. (2018). Updated discussions on ‘hybrid multiple criteria decision-making methods: a review of applications for sustainability issues. Econ Res Ekon Istraž, 31, 1437–1452.
Mukhametzyanov, I., & Pamucar, D. (2018). A sensitivity analysis in MCDM problems: a statistical approach. Decis Mak Appl Manag Eng, 2, 1–20.
Zadeh, L. A. (1965). Fuzzy sets. Information And Control, 8, 338–353.
Pisano, U., & Berger, G. (2016). Stakeholders activities in support of the 2030 agenda for Sustainable Development and the SDGs implementation: a view on current activities towards implementation. Vienna, Austria: ESDN Office.
Kara¸san, A., & Kahraman, C. (2018). A novel interval-valued neutrosophic EDAS method: prioritization of the United Nations National Sustainable Development Goals. Soft Computing, 22, 4891–4906.
Oliveira, A., Calili, R., Almeida, M. F., & Sousa, M. (2019). A systemic and contextual framework to define a country’s 2030 Agenda from a foresight perspective. Sustainability, 11, 6360.
Resce, G., & Schiltz, F. (2020). Sustainable development in Europe: A multicriteria decision analysis. Rev. Income Wealth
Breu, T., Bergöö, M., Ebneter, L., Pham-Trufert, M., Bieri, S., Messerli, P., Ott, C., & Bader, C. (2020). Where to begin? Defining national strategies for implementing the 2030 Agenda: The case of Switzerland. Sustain. Sci. 16.
Benítez, R., & Liern, V. (2020) Unweighted TOPSIS: A new multicriteria tool for sustainability analysis. Int. J. Sustain. Dev. World Ecol., 1–13.
Jayaraman, R., Colapinto, C., Liuzzi, D., & La Torre, D. (2016). Planning sustainable development through a scenario-based stochastic goal programming model. Operations Research, 17, 789–805.
Mukherjee, S. (2017). Selection of alternative fuels for sustainable urban transportation under multi-criteria intuitionistic fuzzy environment. Fuzzy Inf Eng, 9, 117–135.
Karabulut, A. A., Udias, A., & Vigiak, O. (2019). Assessing the policy scenarios for the Ecosystem Water Food Energy (EWFE) nexus in the Mediterranean region. Ecosyst Serv, 35, 231–240.
De, P., & Majumder, M. (2020). Allocation of energy in surface water treatment plants for maximum energy conservation. Environment, Development And Sustainability, 22, 3347–3370.
Mandeh, A., Khamforoosh, K., & Vafa Maihami. (2015). Data fusion in wireless sensor networks using fuzzy systems. International Journal of Computer Applications, 125, 12.
Sabaghian, K., Khamforoosh, K., & Ghaderzadeh, A. (2021). Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments. Plos one, 16(7), e0254210.
Beigrezaei, M., Haghighat, A. T., & Mirtaheri, S. L. (2021). Improve Performance by a Fuzzy-Based Dynamic Replication Algorithm. In Grid, Cloud, and Fog. Mathematical Problems in Engineering, 2021.
Shorfuzzaman, M., Graham, P., & Eskicioglu, R. (2010). Adaptive popularity-driven replica placement in hierarchical data grids. The Journal of Supercomputing, 51(3), 374.
Bokhari, S. M. A., & Theel, O. (2020). A genetic programming-based multi-objective optimization approach to data replication strategies for distributed systems. In 2020 IEEE Congress on Evolutionary Computation (CEC) (pp. 1–9). IEEE.
Sattar, A. M., Ertuğrul, Ö. F., Gharabaghi, B., McBean, E. A., & Cao, J. (2019). Extreme learning machine model for water network management. Neural Computing And Applications, 31(1), 157–169.
Sethi, K., Jaiswal, V., & Ansari, M. D. (2020). Machine learning based support system for students to select stream (subject). Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science), 13(3), 336–344.
Ribeiro, M. T., Singh, S., & Guestrin, C. (2018). Anchors: High-precision model-agnostic explanations. In Proceedings of the AAAI conference on artificial intelligence (Vol. 32, No. 1).
Pengcheng Xiong, Y., Chi, S., Zhu, H. J., & Moon (2011). Calton Pu, and Hakan Hacigumus. Intelligent management of virtualized resources for database systems in cloud environment. In International Conference on Data Engineering, pages 87–98, ISBN 9781424489589. doi: https://doi.org/10.1109/ICDE.2011.5767928.
Han, D., Wooldridge, M., Rogers, A., Tople, S., Ohrimenko, O., & Tschiatschek, S. (2020). Replication-Robust Payoff-Allocation for Machine Learning Data Markets. arXiv preprint arXiv:2006.14583.
Zhao, Y., & Hu, Y. (2003) GRESS: a grid replica selection service, in Proceedings of the 16th International Conference on Parallel and Distributed Computing Systems, pp. 423–429.
Funding
The authors did not receive support from any organization for the submitted work.
Author information
Authors and Affiliations
Contributions
KS and KK have proposed the idea of doing a survey on Data Replication. They have done the literature review and classification. KK as a supervisor managed the process, reviewed the article critically. AG proposed several useful revisions.
Corresponding author
Ethics declarations
Conflict of Interest
All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sabaghian, K., Khamforoosh, K. & Ghaderzadeh, A. Data Replication and Placement Strategies in Distributed Systems: A State of the Art Survey. Wireless Pers Commun 129, 2419–2453 (2023). https://doi.org/10.1007/s11277-023-10240-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-023-10240-7