skip to main content
10.1145/3416921.3416926acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbdcConference Proceedingsconference-collections
research-article

MERP: A Multi-index Evaluation Replication Placement Strategy for Cloud Storage Cluster

Authors Info & Claims
Published:24 September 2020Publication History

ABSTRACT

With the advent of the big data era, the research focuses on how to enhance the reliability, availability and high performance of the cloud storage system. Aiming to cope with extensive data storage, a replication placement strategy based on rack-awareness is applied in Hadoop Distributed File System (HDFS). Without synthetically considering the heterogeneity of cloud storage cluster and the load differences of each service node, however, the emergence of load imbalance is inevitable in HDFS. Focusing on the deficiency of the default replication placement method of HDFS, a multi-index evaluation replication placement scheme referred to as MERP is proposed in this paper. MERP takes a holistic view of the load characteristic, hardware performance and network topological distance of each datanode and leverages a combination-weighting TOPSIS model to comprehensively evaluate candidate datanodes and select the optimal one for replication placement. The simulation results conclusively demonstrated that our MERP outperforms the default replication placement of HDFS in terms of load balancing for distributed cloud storage cluster.

References

  1. Wei, Q., Veeravalli, B., Gong, B., Zeng, L. and Feng, D. 2010. CDRM: A Cost-Effective Dynamic Replication Management Scheme for Cloud Storage Cluster. Proceedings of the 2010 IEEE International Conference on Cluster Computing, Heraklion, Crete, Greece, 20-24 September, 2010. IEEE.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Shvachko, K., Kuang, H., Radia, S. and Chansler, R. 2010. The Hadoop Distributed File System. 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST). IEEE.Google ScholarGoogle Scholar
  3. Lin, C. Y. and Lin, Y. C. 2015. A Load-Balancing Algorithm for Hadoop Distributed File System. International Conference on Network-based Information Systems. IEEE. Tavel, P. 2007. Modeling and Simulation Design. AK Peters Ltd., Natick, MA.Google ScholarGoogle Scholar
  4. Jafarnejad Ghomi, E., Masoud Rahmani, A. and Nasih Qader, N. 2017. Load-balancing algorithms in cloud computing: a survey. Journal of Network and Computer Applications, S1084804517301480.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chung, H. Y., Chang, C. W., Hsiao, H. C. and Chao, Y. C. 2012. The Load Rebalancing Problem in Distributed File Systems. Cluster Computing (CLUSTER), 2012 IEEE International Conference on. IEEE.Google ScholarGoogle Scholar
  6. Kun, L. and Wen-Liang, N. 2013. An improved data load balancing algorithm for Hadoop. Journal of Henan Polytechnic University (Natural Science).Google ScholarGoogle Scholar
  7. Hsiao, H. C., Chung, H. Y., Shen, H. and Chao, Y. C. 2013. Load rebalancing for distributed file systems in clouds. IEEE Transactions on Parallel and Distributed Systems, 24(5), 951--962.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Nishanth, S., Radhikaa, B., Ragavendar, T. J., Babu, C. and Prabavathy, B. 2013. CoHadoop++: A load balanced data co-location in Hadoop Distributed File System. 2013 Fifth International Conference on Advanced Computing (ICoAC). IEEE.Google ScholarGoogle Scholar
  9. Khaneghah, E. M., Mirtaheri, S. L., Grandinetti, L., Memaripour, A. S. and Sharifi, M. 2013. A Dynamic Replication Mechanism to Reduce Response-Time of I/O Operations in High Performance Computing Clusters. International Conference on Social Computing. IEEE Computer Society.Google ScholarGoogle Scholar
  10. Long, S. Q., Zhao, Y. L. and Chen, W. 2014. Morm: a multi-objective optimized replication management strategy for cloud storage cluster. Journal of Systems Architecture, 60(2), 234--244.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. MERP: A Multi-index Evaluation Replication Placement Strategy for Cloud Storage Cluster

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICCBDC '20: Proceedings of the 2020 4th International Conference on Cloud and Big Data Computing
        August 2020
        130 pages
        ISBN:9781450375382
        DOI:10.1145/3416921

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 September 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)4
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader