skip to main content
10.1145/2731186.2731190acmconferencesArticle/Chapter ViewAbstractPublication PagesveeConference Proceedingsconference-collections
research-article

PARS: A Page-Aware Replication System for Efficiently Storing Virtual Machine Snapshots

Published:14 March 2015Publication History

ABSTRACT

Virtual machine (VM) snapshot enhances the system availability by saving the running state into stable storage during failure-free execution and rolling back to the snapshot point upon failures. Unfortunately, the snapshot state may be lost due to disk failures, so that the VM fails to be recovered. The popular distributed file systems employ replication technique to tolerate disk failures by placing redundant copies across disperse disks. However, unless user-specific personalization is provided, these systems consider the data in the file as of same importance and create identical copies of the entire file, leading to non-trivial additional storage overhead.

This paper proposes a page-aware replication system (PARS) to store VM snapshots efficiently. PARS employs VM introspection technique to explore how a page is used by guest, and classifies the pages by their importance to system execution. If a page is critical, PARS replicates it multiple copies to ensure high availability and long-term durability. Otherwise, the loss of this page causes no harm for system to work properly, PARS therefore saves only one copy of the page. Consequently, PARS improves storage efficiency without compromising availability. We have implemented PARS to justify its practicality. The experimental results demonstrate that PARS achieves 53.9% space saving compared to the native replication approach in HDFS which replicates the whole snapshot file fully and identically.

References

  1. National center for biotechnology information. ftp://ftp.ncbi.nih.gov.Google ScholarGoogle Scholar
  2. Elasticsearch. http://www.elasticsearch.org/.Google ScholarGoogle Scholar
  3. Hdfs. http://hadoop.apache.org/.Google ScholarGoogle Scholar
  4. Mummer. http://mummer.sourceforge.net/.Google ScholarGoogle Scholar
  5. Mysql. http://www.mysql.com/.Google ScholarGoogle Scholar
  6. Sysbench. http://sysbench.sourceforge.net/.Google ScholarGoogle Scholar
  7. Weibo. http://weibo.com.Google ScholarGoogle Scholar
  8. libhdfs. http://hadoop.apache.org/docs/r1.2.1/libhdfs.html.Google ScholarGoogle Scholar
  9. Salesforce, 1999. http://www.salesforce.com.Google ScholarGoogle Scholar
  10. Using the snapshot, 2003. https://www.vmware.com/support/ws4/doc/preserve_snapshot_ws.html.Google ScholarGoogle Scholar
  11. readahead, 2005. https://lwn.net/Articles/155510/.Google ScholarGoogle Scholar
  12. Amazon ec2, 2006. http://aws.amazon.com/ec2/.Google ScholarGoogle Scholar
  13. Superfetch, 2007. http://en.wikipedia.org/wiki/Windows_Vista_I/O_technologies.Google ScholarGoogle Scholar
  14. S. Akiyama, T. Hirofuchi, R. Takano, and S. Honiden. Fast wide area live migration with a low overhead through page cache teleportation. In Proceedings of CCGrid, pages 78--82, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. N. Amit, D. Tsafrir, and A. Schuster. Vswapper: A memory swapper for virtualized environments. In Proceedings of ASPLOS, pages 349--366, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. B. Calder, J. Wang, A. Ogus, and N. N. et al. Windows azure storage: A highly available cloud storage service with strong consistency. In Proceedings of SOSP, pages 143--157, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J.-H. Chiang, H.-L. Li, and T. cker Chiueh. Introspectionbased memory de-duplication and migration. In Proceedings of VEE, pages 51--62, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Christodorescu, R. Sailer, D. L. Schales, D. Sgandurra, and D. Zamboni. Cloud security is not (just) virtualization security: A short paper. In Proceedings of the ACM Workshop on Cloud Computing Security, pages 97--102, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. C. Corbett, J. Dean, and M. E. et al. Spanner: Google's globally-distributed database. In Proceedings of OSDI, pages 251--264, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. L. Cui, B. Li, Y. Zhang, and J. Li. Hotsnap: A hot distributed snapshot system for virtual machine cluster. In Proceedings of USENIX LISA, pages 59--73, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. L. Cui, J. Li, B. Li, and et al. Vmscatter: Migrate virtual machines to many hosts. In Proceedings of VEE, pages 63--72, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. U. Deshpande, X. Wang, and K. Gopalan. Live gang migration of virtual machines. In Proceedings of HPDC, pages 135--146, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. C. Dubnicki, L. Gryz, L. Heldt, and M. Kaczmarczyk. Hydrastor: A scalable secondary storage. In Proceedings of FAST, pages 197--210, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. D. Ford, F. Labelle, F. I. Popovici, M. Stokely, V.-A. Truong, L. Barroso, C. Grimes, and S.Quinlan. Availability in globally distributed storage systems. In Proceedings of OSDI, pages 1--14, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. T. Garfinkel and M. Rosenblum. A virtual machine introspection based architecture for intrusion detection. In Proceedings of NDSS, pages 191--206, 2003.Google ScholarGoogle Scholar
  26. S. Ghemawat, H. Gobioff, and S.-T. Leung. The google file system. In Proceedings of SOSP, pages 29--43, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. D. Harnik, O. Margalit, D. Naor, D. Sotnikov, and G. Vernik. Estimation of deduplication ratios in large data sets. In Proceedings of MSST, pages 1--11, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  28. M. Hibler, R. Ricci, L. Stoller, J. Duerig, S. Guruprasad, T. Stack, K. Webb, and J. Lepreau. Large-scale virtualization in the emulab network testbed. In USENIX Annual Technical Conference, pages 113--128, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. X. Jiang and D. Xu. Violin: Virtual internetworking on overlay infrastructure. In Parallel and Distributed Processing and Applications, pages 937--946, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. H. Jin, L. Deng, and S. Wu. Live virtual machine migration with adaptive memory compression. In Proceedings of CLUSTER, pages 1--10, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  31. S. T. Jones, A. C. Arpaci-Dusseau, and R. H. Arpaci-Dusseau. Geiger: Monitoring the buffer cache in a virtual machine environment. In Proceedings of ASPLOS, pages 14--24, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. A. Kangarlou, P. Eugster, and D. Xu. Vnsnap: Taking snapshots of virtual networked environments with minimal downtime. In Proceedings of DSN, pages 87--98, 2011.Google ScholarGoogle Scholar
  33. A. Kivity, Y. Kamay, D. Laor, U. Lublin, and A. Liguori. Kvm: the linux virtual machine monitor. In Proceedings of the Linux Symposium, pages 225--230, 2007.Google ScholarGoogle Scholar
  34. J. F. Kloster, J. Kristensen, and A.Mejlholm. Determining the use of interdomain shareable pages using kernel introspection. Technical report, Aalborg University, 2007.Google ScholarGoogle Scholar
  35. G. Kola, T. Kosar, and M. Livny. Faults in large distributed systems and what we can do about them. In Proceedings of Euro-Par, pages 442--453, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. R. Kotla, L. Alvisi, and M. Dahlin. Safestore: A durable and practical storage system. In Proceedings of ATC, pages 127--142, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. A. Koto, H. Yamada, K. Ohmura, and K. Kono. Towards unobtrusive vm live migration for cloud computing platforms. In Proceedings of APSys, pages 1--6, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. J. Kubiatowicz, D. Bindel, Y. Chen, and S. C. et al. Oceanstore: An architecture for global-scale persistent storage. In Proceedings of ASPLOS, pages 190--201, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. X. Li, M. Lillibridge, and M. Uysal. Reliability analysis of deduplicated and erasure-coded storage. Proceedings of SIGMETRICS Performance Evaluation Review, 38(3):4--9, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. C. Liu, D. Ju, Y. Gu, Y. Zhang, D. Wang, and D. H. Du. Semantic data de-duplication for archival storage systems. In Proceedings of ACSAC, pages 1--9, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  41. C. Marshall. Efficient and safe data backup with arrow. Technical report, 2008.Google ScholarGoogle Scholar
  42. P. Nath, M. A. Kozuch, D. R. OHallaron, and J. Harkes. Design tradeoffs in applying content addressable storage to enterprise-scale systems based on virtual machines. In Proceedings of ATC, pages 71--84, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. E. Park, B. Egger, and J. Lee. Fast and space-efficient virtual machine checkpointing. In Proceedings of VEE, pages 75--85, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. M. R, Hines, and K. Gopalan. Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning. In Proceedings of VEE, pages 51--60, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. R. Rodrigues and B. Liskov. High availability in dht erasureerasure code vs replication. In Proceedings of IPTPS, pages 226--239, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. B. Schroeder and G. A. Gibson. Understanding failures in petascale computers. Journal of Physics, 78:1--11, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  47. B. Schroeder and G. A. Gibson. Disk failures in the real world what does an mttf of 1,000,000 hours mean to you. In Proceedings of FAST, pages 1--16, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. P. Stuedi, B.Metzler, and A. Trivedi. jverbs: Ultra-low latency for data center applications. In Proceedings of ACM SoCC, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. M. H. Sun and D. M. Blough. Fast, lightweight virtual machine checkpointing. Technical report, 2010.Google ScholarGoogle Scholar
  50. Y. Xu, M. Bailey, B. Noble, and F. Jahanian. Small is better: Avoiding latency traps in virtualized data. In Proceedings of ACM SoCC, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. L. L. You, K. T. Pollack, and D. D. E. Long. Deep store: An archival storage system architecture. In Proceedings of ICDE, pages 804--815, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. I. Zhang, A. Garthwaite, Y. Baskakov, and K. C. Barr. Fast restore of checkpointed memory using working set estimation. In Proceedings of VEE, pages 534--533, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. I. Zhang, T. Denniston, Y. Baskakov, and A. Garthwaite. Optimizing vm checkpointing for restore performance in VMware esxi. In Proceedings of USENIX ATC, pages 1--12, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. W. Zhang, H. Tang, H. Jiang, T. Yang, X. Li, and Y. Zeng. Multi-level selective deduplication for vm snapshots in cloud storage. In Proceedings of Cloud, pages 550--557, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. B. Zhu, K. Li, and H. Patterson. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of FAST, pages 1--14, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. PARS: A Page-Aware Replication System for Efficiently Storing Virtual Machine Snapshots

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      VEE '15: Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments
      March 2015
      238 pages
      ISBN:9781450334501
      DOI:10.1145/2731186
      • cover image ACM SIGPLAN Notices
        ACM SIGPLAN Notices  Volume 50, Issue 7
        VEE '15
        July 2015
        221 pages
        ISSN:0362-1340
        EISSN:1558-1160
        DOI:10.1145/2817817
        • Editor:
        • Andy Gill
        Issue’s Table of Contents

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 14 March 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      VEE '15 Paper Acceptance Rate16of50submissions,32%Overall Acceptance Rate80of235submissions,34%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader