Abstract
The rapid advancement of network technology has changed the way the world operates and has produced a large number of network application services for users. In order to provide more convenient services, the network service providers need to provide a more stable and high-capacity system. Therefore, cloud computing technology has been developed in the recent decade. The network service providers can reduce the cost related to the cloud computing services by using the virtualizing and data replicating techniques. Besides, an efficient information duplication strategy is necessary to reduce the workload and enhance the ability of the system. Therefore, a three-phase Dynamic Data Replication Algorithms (DDRA) for deploying the resources has been proposed in this paper to improve the efficiency of the information duplication under the cloud storage system. For the first two phases, the proposed algorithm is designed to determine the suitable service nodes to achieve the balance of workload according to the service nodes’ workloads. In the third phase, a dynamic duplication deployment scheme has been designed to achieve the higher access performance and better load balancing between service nodes for the overall environment. As a result, the proposed algorithm can enhance the availability, access efficiency and load balancing under the hierarchical cloud computing environment.
Similar content being viewed by others
References
Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., Zaharia, M: A view of cloud computing. Commun. ACM 53(4), 50–58 (2010)
Buyya, R., Ranjan, R., Calheiros, R.N.: Modeling and simulation of scalable cloud computing environments and the CloudSim toolkit: Challenges and opportunities. In: International Conference on High Performance Computing &, Simulation, pp. 1–11 (2009)
Gunarathne, T., Wu, T.L., Qiu, J., Fox, G.: MapReduce in the clouds for science. In: IEEE Second International Conference on Cloud Computing Technology and Science (CloudCom), pp. 565–572 (2010)
Almuttairi, R.M., Wankar, R., Negi, A., Chillarige, R.R., Almahna, M.S.: New replica selection technique for binding replica sites in data grids. In: International Conference on Energy, Power and Control (EPC-IQ), pp. 187–194 (2010)
Chang, R.S., Chang, H.P., Wang, Y.T.: A dynamic weighted data replication strategy in data grids. In: IEEE/ACS International Conference on Computer Systems and Applications (AICCSA), pp. 414–421 (2008)
Li, W, Yang, Y, Chen, J, Yuan, D.: A Cost-effective mechanism for cloud data reliability management based on proactive replica checking. In: IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pp. 564–571 (2012)
Mansouri, Y., Monsefi, R.: Optimal number of replicas with QoS assurance in data grid environment. In: Second Asia International Conference on Modeling &, Simulation(AICMS), pp.168–173 (2008)
Ranganathann, K., Foster, I.: Identifying dynamic replication strategies for a high-performance data grid. In: Proceedings of the Second International Workshop on Grid Computing, pp. 75–86 (2001)
Qaisar, E.J.: Introduction to cloud computing for developers: Key concepts, the players and their offerings. In: Information Technology Professional Conference (TCF Pro IT), pp. 1–6 (2012)
Mell, P., Grance, T.: The NIST Definition of Cloud Computing Technical report (2011)
Muthunagai, S.U., Karthic, C.D., Sujatha, S: Efficient access of Cloud Resources through virtualization techniques. In: International Conference on Recent Trends In Information Technology (ICRTIT), pp. 174–178 (2012)
Kertesz, A., Kecskemeti, G., Oriol, M., Kotcauer, P., Ace, S., Rodriguez, M., Merce, O., Marosi, A.Cs., Marco, J., Franch, X.: Enhancing federated cloud management with an integrated service monitoring approach. J. Grid Comput. 11(4), 699720 (2013)
Ghemawat, S., Gobioff, H., Leung, S.T.: The Google file system Proceedings of the nineteenth. In: ACM Symposium on Operating Systems Principles, pp. 29–43 (2003)
Yu, H., Xiqian, G, Lan, Y, Ren, H, Chen, Y: The research about file-level trace technology. In: International Conference on Computational and Information Sciences (ICCIS), pp. 1131–1134 (2012)
Qingsong, W., Veeravalli, B., Gong, B, Zeng, L, Feng, D: CDRM: a cost-effective dynamic replication management scheme for cloud storage cluster. In: IEEE International Conference on Cluster Computing (CLUSTER), pp 188–196 (2010)
Cai, B, Xie, C., Zhu, G: EDRFS: An effective distributed replication file system for small-file and data-intensive application. In: International Conference on Communication Systems Software and Middleware, pp. 1–7 (2007)
Kaviani, N., Wohlstadter, E, Lea, R: MANTICORE: a framework for partitioning software services for hybrid cloud. In: IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom), pp. 333–340 (2012)
Rasool, Q., Li, J, Zhang, G.S., Yang, S, Oreku, D: A load balancing replica placement strategy in data grid. In: Third International Conference on Digital Information Management ICDIM, pp. 751–756 (2008)
Sun, Y, Luo, X, Wang, Z, Wang, B, Zhang, Y, Chen, H, Li, X: Fast live cloning of virtual machine based on Xen. In: IEEE International Conference on High Performance Computing and Communications, pp. 392–399 (2009)
Khezr, SN., Navimipour, N.J.: MapReduce and its applications challenges, and architecture: a comprehensive review and directions for future research. J. Grid Comput. 15(3), 295–321 (2017)
Tran, X.T., Do, T.V., Rotter, C., Hwang, D.: A new data layout scheme for energy-efficient MapReduce processing tasks. J. Grid Comput., 1–14 (2018)
Liu, Y., Li, Y, Xiong, L., Zhu, N, Xu, L., Yang, K.T.: The resource locating strategy based on sub-domain hybrid P2P network model. In: IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), pp. 1–8 (2010)
Wang, N., Xu, D: Resource summary for pay-as-you-go dataspace systems. In: 9th International Conference on Signal Processing, pp. 2842–2845 (2008)
Li, N., Zhang, L.J., Xu, P, Wang, L., Zheng, J, Guo, Y.: Research on pricing model of cloud storage. In: IEEE Ninth World Congress on Services (SERVICES), pp. 412–419 (2013)
Huang, Z., Peng, Y., Lin, Y.: Reducing service cost based on the skewness of data popularity for cloud storage systems. In: IEEE 7th International Symposium on Service Oriented System Engineering (SOSE), pp. 373–378 (2013)
Apache Hadoop Wiki, http://en.wikipedia.org/wiki/ApacheHadoop [accessed. 17. 2.25]
The Apache Software Foundation Hadoop, http://hadoop.apache.org/core/ [accessed. 2016. 12. 20]
Computer Programming Projects, http://programming-project.blogspot.tw/2014/04/general-architecture-of-google-file.html [accessed. 2018. 3. 1]
Narayan, S., Bailey, S., Daga, A.: Hadoop acceleration in an OpenFlow based cluster. Companion High Perform. Comput. Network. Storage Anal. (SCC), 535–538 (2012)
Bjorkqvist, M., Chen, L.Y., Binder, W.: Optimizing service replication in clouds. In: Proceedings of the Winter Simulation Conference (WSC) 2011, pp. 3307–3317 (2011)
Jeon, M., Lim, K.H., Ahn, H., Lee, B.D.: Dynamic data replication scheme in the cloud computing environment. In: Second Symposium on Network Cloud Computing and Applications (NCCA), pp. 40–47 (2012)
Kamali, S., Ghodsnia, P., Daudjee, K: Dynamic data allocation with replication in distributed systems. In: IEEE 30th International Performance Computing and Communications Conference (IPCCC), pp. 1–8 (2011)
Li, W., Yang, Y, Yuan, D.: A novel cost-effective dynamic data replication strategy for reliability in cloud data centres. In: IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing (DASC), pp. 496–502 (2011)
Myint, J., Naing, T.T.: A data placement algorithm with binary weighted tree on PC cluster-based cloud storage system. In: International Conference on Cloud and Service Computing (CSC), pp. 315–320 (2011)
Vlassov, V., Li, D., Haridi, K.S.: A scalable autonomous replica management framework for grids. In: IEEE John Vincent Atanasoff Popov International Symposium on Modern Computing 2006, pp. 33–40 (2006)
Xu, C., Huang, X., Yang, G., Zhou, Y.: A two-layered replica management method. In: IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) 2011, pp. 14311436
Rahmani, A.M., Azari, L., Daniel, H.A.: A file group data replication algorithm for data grids. J. Grid Comput. 15(3), 379–393 (2017)
Wu, S, Chen, G, Gao, T., Xu, L, Song, C: Replica pre-adjustment strategy based on trend analysis of file popularity within cloud environment. In: International Conference on Computer and Information Technology (CIT), pp. 219–223 (2012)
Berger, A.W.: Comparison of call gapping and percent Blocking for overload control in distributed switching systems and telecommunications networks. IEEE Trans Commun, 574–580 (1991)
Chang, R.S., Guo, M.H., Lin, H.C.: A multiple parallel downlaod scheme with server throughput and client bandwidth considerations for data grids. Fut. Gen. Comput. Syst. 24(8), 798–805 (2008)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Hsieh, HC., Chiang, ML. The Incremental Load Balance Cloud Algorithm by Using Dynamic Data Deployment. J Grid Computing 17, 553–575 (2019). https://doi.org/10.1007/s10723-019-09474-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-019-09474-2