Skip to main content
Log in

A novel dynamic data replication strategy to improve access efficiency of cloud storage

  • Original Article
  • Published:
Information Systems and e-Business Management Aims and scope Submit manuscript

Abstract

Cloud computing provides on demand services to cloud users, and one among them is storage. Currently, large amount of data gets generated and demand an enormous storage. Users can avail the privilege to store their data remotely and can access them through Internet. Of course, the adoption of cloud lends the kind of storage that the user wants. Since data gets accumulated, the time it takes to store and retrieve the data is very long and difficult. Also, unfortunately the existing method of storage is to be optimized for better performance. The factors that affect the performance of cloud storage are response time, data availability and migration cost. Hence to improve these factors the data can be replicated to multiple locations. The decision on which data to be replicated, number of replicas to be created, where the replica has to be placed, management of the replicated data and the provision of optimal replica to the user are the major challenges involved in dynamic replication. We intend to propose, a novel dynamic data replication strategy with intelligent water drop (IWD) algorithm to address the challenges of replication and for the management of cloud storage. The popularity and size of the data are considered for replication. A swarm intelligence based optimization algorithm named IWD algorithm is used to optimize the process of replication and management of storage in cloud. We have compared our D2R-IWD algorithm with popular optimization techniques such as PSO, GA and found out that our methodology gives better result in terms of access efficiency for several test cases thereby improve the performance of cloud.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Baesens B (2014) Analytics in a big data world: the essential guide to data science and its applications. Wiley

  • Bai X, Jin H, Liao X, Shi X, Shao Z (2016) RTRM: response time based replica management strategy for cloud storage system. In: Proceedings of the grid and pervasive computing. Springer, pp 124–33

  • Beaver D, Kumar S, Li HC, Sobel J, Vajgel P et al (2010) Finding a needle in haystack: Facebook's photo storage. In: OSDI, vol 10. pp 1–8

  • Gantz J, Reinsel D (2012) The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the far east. IDC iView: IDC analyze the future, vol 2007. pp 1–16

  • Gill NK, Singh S (2016) A dynamic, cost-aware, optimized data replication strategy for heterogeneous cloud data centers. Future Gener Comput Syst 65:10–32

    Article  Google Scholar 

  • Grace RK, Manimegalai R (2014) Dynamic replica placement and selection strategies in data grids a comprehensive survey. J Parallel Distrib Comput 74(2):2099–2108

    Article  Google Scholar 

  • Jeon M, Lim KH, Ahn H, Lee BD (2012) Dynamic data replication scheme in the cloud computing environment. In: 2012 Second symposium on network cloud computing and applications (NCCA). IEEE, pp 40–47

  • Lee J, Chung J, Lee D (2015) Efficient data replication scheme based on hadoop distributed file system. Int J Softw Eng Appl 9(12):177–186

    Google Scholar 

  • Li W, Yang Y, Yuan D (2011) A novel cost-effective dynamic data replication strategy for reliability in cloud data centres. In: 2011 IEEE ninth international conference on dependable, autonomic and secure computing (DASC). IEEE, pp 496–502

  • Lin J-W, Chen C-H, Chang JM (2013) QoS-aware data replication for data intensive applications in cloud computing systems. IEEE Trans Cloud Comput 1(1):101–115

    Article  Google Scholar 

  • Lizhen C, Junhua Z, Lingxi Y, Yuliang S, Hui L, Dong Y (2018) A genetic algorithm based data replica placement strategy for scientific applications in clouds. IEEE Trans Serv Comput 11(4):727–739

    Article  Google Scholar 

  • Long SQ, Zhao YL, Chen W (2014) Morm: a multi-objective optimized replication management strategy for cloud storage cluster. J Syst Archit 60(2):234–244

    Article  Google Scholar 

  • Mansouri N, Javidi MM (2018) A new prefetching-aware data replication to decrease access latency in cloud environment. J Syst Softw 144:197–215

    Article  Google Scholar 

  • Milani BA, Navimipour NJ (2016) A comprehensive review of the data replication techniques in the cloud environments: major trends and future directions. J Netw Comput Appl 64:229–238

    Article  Google Scholar 

  • Nachiappan R, Javadi B, Calheiros RN, Matawie KM (2017) Cloud storage reliability for Big Data applications: a state of the art survey. J Netw Comput Appl 97:35–47

    Article  Google Scholar 

  • Niu S, Ong S, Nee AY (2013) An improved intelligent water drops algorithm for solving multi-objective job shop scheduling. Eng Appl Artif Intell 26(10):2431–2442

    Article  Google Scholar 

  • Qu Y, Xiong N (2012) RFH: a resilient fault-tolerant and high-efficient replication algorithm for distributed cloud storage. Presented at the 2012 41st international conference on parallel processing (ICPP)

  • Shah-Hosseini H (2009) Optimization with the nature-inspired intelligent water drops algorithm. In: Santos WPD (ed) Evolutionary computation, Vienna, Austria, pp 297–320

  • Sun DW, Chang GR, Gao S, Jin LZ, Wang XW (2012) Modeling a dynamic data replication strategy to increase system availability in cloud computing environments. J Comput Sci Technol 27(2):256–272

    Article  Google Scholar 

  • Wang L, Luo J, Shen J, Dong F (2013) Cost and time aware ant colony algorithm for data replica in alpha magnetic spectrometer experiment. In: 2013 IEEE international congress on big data (BigData congress). IEEE, pp 247–254

  • Wang P, Dean DJ, Gu X (2015) Understanding real world data corruptions in cloud systems. In: IEEE international conference on cloud engineering (IC2E), pp 116–125

  • Wei Q, Veeravalli B, Gong B, Zeng L, Feng D (2010) CDRM: a cost-effective dynamic replication management scheme for cloud storage cluster. In: Proceedings of the 2010 IEEE international conference on cluster computing, Heraklion, Crete, Greece, Sept. 20–24, 2010, pp 188–196

  • Wu TY, Pan JS, Lin CF (2014) Improving accessing efficiency of cloud storage using de-duplication and feedback schemes. IEEE Syst J 8(1):208–218

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sujaudeen Nannai John.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nannai John, S., Mirnalinee, T.T. A novel dynamic data replication strategy to improve access efficiency of cloud storage. Inf Syst E-Bus Manage 18, 405–426 (2020). https://doi.org/10.1007/s10257-019-00422-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10257-019-00422-x

Keywords

Navigation