Skip to main content

A High-Performance Adaptive Strategy of Container Checkpoint Based on Pre-replication

  • Conference paper
  • First Online:
Security, Privacy, and Anonymity in Computation, Communication, and Storage (SpaCCS 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11342))

Abstract

During the implementation of the container checkpoint strategy, checkpoint downtime is a pivotal performance indicator. Shorter downtime is especially important for systems that provide critical services. To reduce the checkpoint downtime, an adaptive pre-replication checkpoint strategy named APR-CKPOT is proposed in this paper. Through several rounds of pre-replication, the infrequently modified container memory pages are preferentially copied. The dirty pages generated in the previous round of Pre-Replication are saved in each round of pre-replication. The number of pre-replication checkpoints is adaptively determined by the workload of the user’s operating system in the container. The coordination between fault-tolerance service capabilities and performance of the container can be achieved, and the downtime of the checkpoint can be reduced, which is verified by the given experimental results based on Docker container system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. James, T.: The Docker Book: containerization is the new virtualization, pp. 10–20 (2014). http://www.dockerbook.com/. Accessed 22 Apr 2015

  2. Siozios, K., Soudris, D., Hübner, M.: A framework for supporting adaptive fault-tolerant solutions. ACM Trans. Embed. Comput. Syst. 13(5s), 1–22 (2014)

    Article  Google Scholar 

  3. Bernstein, D.: Containers and cloud: from LXC to Docker to Kubernetes. Cloud Comput. 1(3), 81–84 (2015)

    Article  Google Scholar 

  4. Yang, C.T., Liu, J.C., Hsu, C.H., et al.: On improvement of cloud virtual machine availability with virtualization fault tolerance mechanism. In: IEEE Third International Conference on Cloud Computing Technology and Science, pp. 122–129. IEEE (2013)

    Google Scholar 

  5. Lillibridge, M., Kave, E., Deepavali, B.: Improving restore speed for backup systems that use inline chunk-based deduplication. In: Proceedings of the 11th USENIX Conference on File and Storage Technologies, pp. 183–197. USENIX Conference (2013)

    Google Scholar 

  6. Pradhan, S., Gokhale, A., Otte, W.R., et al.: Real-time fault tolerant deployment and configuration framework for cyber physical systems. ACM SIGBED Rev. 10(2), 32 (2013)

    Article  Google Scholar 

  7. LXC-checkpoint [EB/OL]. http://lxc.sourceforge.net/man/lxc-checkpoint.html

  8. Burns, B., Grant, B., Oppenheimer, D., et al.: Borg, Omega, and Kubernetes. Queue 14(1), 10–34 (2016)

    Article  Google Scholar 

  9. LXC-checkpoint. http://lxc.sourceforge.net/man/lxc-checkpoint.html. Accessed 22 Apr 2015

  10. Liu, Q., Jung, C., Lee, D., et al.: Compiler-directed lightweight checkpointing for fine-grained guaranteed soft error recovery. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 228–239 (2017)

    Google Scholar 

  11. Lin, J.C., Leu, F.Y., Chen, Y.P.: Analyzing job completion reliability and job energy consumption for a heterogeneous MapReduce cluster under different intermediate-data replication policies. J. Supercomput. 71(5), 1657–1677 (2015)

    Article  Google Scholar 

  12. Dinh, T., Barkataki, S.: Distributed container: a design pattern for fault tolerance and high-speed data exchange. ACM SIGAda Ada Lett. 29(3), 115–118 (2009)

    Article  Google Scholar 

  13. Shao, Y., Zhu, X., Bao, W., et al.: CHIME: a checkpoint-based approach to improving the performance of shared clusters. In: International Conference on Parallel and Distributed Systems, pp. 1007–1014. IEEE (2017)

    Google Scholar 

  14. Xu, F., Liu, F.M., Liu, L.H., Jin, H., Li, B., Li, B.C.: iAware: making live migration of virtual machines interference-aware in the cloud. IEEE Trans. Comput. 63(12), 3012–3025 (2014)

    Article  MathSciNet  Google Scholar 

  15. Piao, G.Y., Oh, Y.G., Sung, B., Park, C.: Efficient pre-replication live migration with memory compaction and adaptive vm downtime control. In: Proceedings of IEEE 4th International Conference on Big Data and Cloud Computing, pp. 85–90. IEEE (2014)

    Google Scholar 

  16. Louati, T., Abbes, H., Cérin, C., et al.: LXCloud-CR: towards LinuX containers distributed hash table based checkpoint-restart. J. Parallel Distrib. Comput. 12(3), 12–16 (2017)

    Google Scholar 

  17. Beloglazov, A., Buyya, R.: OpenStack Neat: a framework for dynamic and energy-efficient consolidation of virtual machines in OpenStack clouds. Concurr. Comput. Pract. Exp. 27(5), 1310–1333 (2015)

    Article  Google Scholar 

  18. Yamato, Y., Katsuragi, S., Nagao, S., et al.: Software maintenance evaluation of agile software development method based on OpenStack. IEICE Trans. Inf. Syst. E98.D(7), 1377–1380 (2015)

    Article  Google Scholar 

  19. Regola, N., Ducom, J.C.: Recommendations for virtualization technologies in high performance computing. In: Proceedings of 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp. 409–416. IEEE (2010)

    Google Scholar 

  20. Li, C., Xi, S., Lu, C., et al.: Prioritizing soft real-time network traffic in virtualized hosts based on Xen. In: IEEE Real-Time and Embedded Technology and Applications Symposium, pp. 145–156. IEEE (2015)

    Google Scholar 

  21. Chi, X., Liu, B., Niu, Q., et al.: Web load balance and cache optimization design based Nginx under high-concurrency environment. In: Third International Conference on Digital Manufacturing and Automation, pp. 1029–1032. IEEE (2012)

    Google Scholar 

Download references

Acknowledgements

This work is supported by the Natural Science Foundation of China (No. 61762008), the Natural Science Foundation Project of Guangxi (No. 2017GXNSFAA198141), the Key R&D project of Guangxi (No. AB17195014).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ningjiang Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, S., Chen, N., Zhang, H., Xue, Y., Huang, R. (2018). A High-Performance Adaptive Strategy of Container Checkpoint Based on Pre-replication. In: Wang, G., Chen, J., Yang, L. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2018. Lecture Notes in Computer Science(), vol 11342. Springer, Cham. https://doi.org/10.1007/978-3-030-05345-1_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-05345-1_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-05344-4

  • Online ISBN: 978-3-030-05345-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics