Abstract
Replication of data on variant sites is a well-known technique used to face many challenges in data grid systems. In this respect, selecting the appropriate site to place a new replica is crucial to improve data grid performance. Many replicas placement strategies have been proposed in the literature. Each one of them targets a specific goal. In this paper, we try to reach a new goal which is the improvement of the replicas distribution quality. To this purpose, a new placement strategy is proposed in which the barycenter method is applied. This strategy allows to reduce the total remote accesses cost, to increase the quality of service, and to ensure that the benefits of the new replica placement will remain for a long-term. We evaluate the strategy through the OptorSim simulator. Results show that the new strategy achieves significant improvement in terms of replicas distribution quality and total execution time, in the short-term as well as in the long-term.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
WLCG: Worldwide LHC Computing Grid. http://www.wlcgwebcernch/. Accessed Sep 5 2016
Adya, A., Bolosky, W.J., Castro, M., Cermak, G., Chaiken, R., Douceur, J.R., Howell, J., Lorch, J.R., Theimer, M., Wattenhofer, R.P.: FARSITE: Federated, available, and reliable storage for an incompletely trusted environment. In: Proceedings of the 5th Symposium on Operating Systems Design and Implementation, pp. 1–14 (2002)
Al Mistarihi, H.H.E., Yong, C.H.: Replica management in data grid. Int. J. Comput. Sci. Network Secur. 8(6), 22–23 (2008)
Amjad, T., Sher, M., Dau, A.: A survey of dynamic replication strategies for improving data availability in data grids. Future Gener. Comput. Syst. 28(2), 337–349 (2012)
Andreeva, J., Campana, S., Fanzago, F., Herrala, J.: High-energy physics on the grid: the ATLAS and CMS experience. Int. J. Grid Comput. 6(1), 3–13 (2008)
Antony, R., Peter, D.: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility. Operating Syst. Rev. 35(5), 188–201 (2001)
Bell, W.H., Cameron, D.G., Capozza, L., Millar, A.P., Stockinger, K., Zini, F.: OptorSim: A grid simulator for studying dynamic data replication strategies. Int. J. High Perform. Comput. Appl. 17(4), 403–416 (2003)
Ben Charrada, F., Ounelli, H., Chettaoui, H.: An efficient replica placement strategy in highly dynamic data grids. Int. J. Grid Util. Comput. 2(2), 156–163 (2011)
Berger, M., Fahringer, T.: Practical experience from porting and executing the Wien2k application on the EGEE production grid infrastructure. Int. J. Grid Comput. 8(2), 261–279 (2010)
Cameron, D.G.: Replica Management and Optimisation for Data Grids. Ph.D. thesis, University of Glasgow, Scotland (2004)
Challal, Z., Tebibel, B.T.: A priori replica placement strategy in data grid. In: Proceedings of the International Conference on Machine and Web Intelligence, pp. 402–406 (2010)
Dabek, F., Li, J., Sit, E., Robertson, J., Kaashoek, M.F., Morris, R.: Designing a DHT for low latency and high throughput. In: Proceedings of the 1st Symposium on Networked Systems Design and Implementation, pp. 85–98 (2004)
EGEE: Enable Grids for E-science. http://www.eu-egee-orgwebcernch/eu-egee-org/indexhtml. Accessed Sep 5 2016
Fadaie, Z., Rahmani, A.M.: A new replica placement algorithm in data grid. Int. J. Comput. Sci. Issues 9(2), 491–507 (2012)
Grace, R.K., Manimegalai, R.: Dynamic replica placement and selection strategies in data grids - A comprehensive survey. Int. J. Parallel Distrib. Comput. 74, 2099–2108 (2014)
Grace, R.K., Priya, S.S., Surya, S.: A survey on grid simulators. Int. J. Comput. Sci. Inf. Technol. Secur. 2(6), 1224–1230 (2012)
Hamdeni, C., Hamrouni, T., Ben Charrada, F.: New evaluation criterion of file replicas placement for replication strategies in data grids. In: Proceedings of the 9th IEEE International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp. 1–8 (2014)
Hamrouni, T., Hamdeni, C., Ben Charrada, F.: Impact of the distribution quality of file replicas on replication strategies. J. Network Comput. Appl. 56(3), 60–76 (2015)
Hamrouni, T., Hamdeni, C., Ben Charrada, F.: Placement strategies in data grids. Multiagent Grid Syst. 11(2), 95–105 (2015)
Hamrouni, T., Hamdeni, C., Ben Charrada, F.: Objective assessment of the performance of data grid replication strategies based on distribution quality. Int. J. Web Eng. Technol. 11(1), 3–28 (2016)
Lei, M., Vrbsky, S.V.: A data replication strategy to increase data availability in data grids. In: Proceedings of the 2006 International Conference on Grid Computing and Applications, pp. 221–227 (2006)
Lei, M., Vrbsky, S.V., Hong, X.: An on-line replication strategy to increase availability in data grids. Future Gener. Comput. Syst. 24(2), 85–98 (2008)
Lin, Y.F., Liu, P., Wu, J.: Optimal placement of replicas in data grid environments with locality assurance. In: Proceedings of the 12th International Conference on Parallel and Distributed Systems, pp. 465–474 (2006)
Liu, P., Wu, J.: Optimal replica placement strategy for hierarchical data grid systems. In: Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid, pp. 4–420 (2006)
Manghui, T.: Secure data objects replication in data grid dependable and secure computing. IEEE Comput. Soc. 7, 50–64 (2010)
Mansouri, N.: QDR: a QoS-aware data replication algorithm for data grids considering security factors. Cluster Comput. 19, 1–17 (2016)
Maranzana, F.E.: On the location of supply points to minimize transport costs. J. Oper. Res. Soc. 15, 261–270 (1964)
McClain, W., Vallado, D.: Fundamentals of Astrodynamics and Applications. Space Technology Library
Meroufel, B., Belalem, G.: Managing data replication and placement based on availability. AASRI Procedia 5, 147–155 (2013)
Navaz, A.S.S., Prabhadevi, C., Sangeetha, V.: Data grid concepts for data security in distributed computing. Int. J. Comput. Appl. 61(13), 6–11 (2013)
Navimipour, N.J., Rahmani, A.M., Navin, A.H., Hosseinzadeh, M.: Resource discovery mechanisms in grid systems: A survey. Int. J. Network Comput. Appl. 41, 389–410 (2014)
On, G.: Quality of Availability for Widely Distributed and Replicated Content Stores. Ph.D. thesis, University of Darmstadt, Germany (2004)
Pace, A., Quema, V., Schiavoni, V.: Exploiting node connection regularity for DHT replication. In: Proceedings of the 30th IEEE Symposium on Reliable Distributed Systems, pp. 111–120 (2011)
Park, S., Kim, J., Ko, Y., Yoon, W.: Dynamic data grid replication strategy based on internet hierarchy. Grid Coop. Comput. Lect. Notes Comput. Sci. 3033, 838–846 (2004)
Piton, I.: Marketing course. http://www.marketingthusch/loaderphp?page=Piton-IV. Accessed Sept 5 2016
Rahman, R.M., Barker, K., Alhajj, R.: Replica placement strategies in data grid. J. Grid Comput. 6(1), 103–123 (2008)
Ranganathan, K., Foster, I.: Identifying dynamic replication strategies for a high performance data grid. In: Proceeding of the 2nd International Workshop on Grid Computing, pp. 75–86 (2001)
Reddy, S.C., Prasad, P.V.N., Laxmi, A.J.: Optimal number and location of DGs to improve power quality of distribution system using particle swarm optimization. Int. J. Eng. Res. Appl. 2, 3077–3082 (2012)
Saadat, N., Rahmani, A.M.: PDDRA: A new pre-fetching based dynamic data replication algorithm in data grids. Future Gener. Comput. Syst. 28(4), 666–681 (2012)
Shi, Y.: Application of multi-barycenter method to Chinas iron and steel industry. Master’s thesis, Tianjin University, China (2007)
Shorfuzzaman, M., Graham, P., Eskicioglu, R.: Distributed placement of replicas in hierarchical data grids with user and system QoS constraints. In: Proceedings of the International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp. 177–186 (2011)
Simon, V., Monnet, S., Feuillet, M., Robert, P., Sens, P.: SPLAD: scattering and placing data replicas to enhance long-term durability. Research Report RR-8533 (2014)
Suri, P.K., Singh, M.: DR2: A two-stage dynamic replication strategy for data grid. Int. J. Recent Trends Eng. 2(4), 201–203 (2009)
Thamarai Selvi, S., Kumar, R., Balachandar, R.A., Balakrishnan, P., Rajendar, K.: Trust based grid scheduling algorithm for commercial grids. Int. Conf. Comput. Intell. Multimedia Appl. 1, 545–558 (2007)
Tian, T., Luo, J., Wu, Z., Song, A.: A pre-fetching-based replication algorithm in data grid. In: Proceedings of the 3th International Conference on Pervasive Computing and Applications, pp. 526–531 (2008)
Tu, M., Ma, H., Xiao, L., Yen, I.L., Bastani, F.B., Xu, D.: Data placement in P2P data grids considering the availability, security, access performance and load balancing. Int. J. Grid Comput. 11(1), 103–127 (2013)
Webster, R., Barnes, D.: Future of astronomy: enormous datasets, massive computing, innovative instrumentation. Massive Comput. 4, 931–979 (2011)
Zhang, Z., Zhang, C., Zuo, M., Wang, Z.: Dynamic data grid replication algorithm based on weight and cost of replica. Indonesian J. Electr. Eng. 12(4), 2860–2867 (2014)
Acknowledgements
We are grateful to the anonymous reviewers for their helpful comments and suggestions. We would like to express our sincere thanks to A.M. Rahmani and N. Saadat for their collaboration and for providing us with the source codes of PDDRA and PRA. We also thank H. Chettaoui for providing us the implementation of PO as well as an implementation of DR2.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Hamdeni, C., Hamrouni, T., Ben Charrada, F. (2016). Barycenter-Based Placement Strategy Towards Improving Replicas Distribution Quality. In: Debruyne, C., et al. On the Move to Meaningful Internet Systems: OTM 2016 Conferences. OTM 2016. Lecture Notes in Computer Science(), vol 10033. Springer, Cham. https://doi.org/10.1007/978-3-319-48472-3_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-48472-3_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48471-6
Online ISBN: 978-3-319-48472-3
eBook Packages: Computer ScienceComputer Science (R0)