research-article

PARS: A Page-Aware Replication System for Efficiently Storing Virtual Machine Snapshots

Authors:
Lei Cui

Beihang University, Beijing, China

Beihang University, Beijing, China
View Profile

,
Tianyu Wo

Beihang University, Beijing, China

Beihang University, Beijing, China
View Profile

,
Bo Li

Beihang University, Beijing, China

Beihang University, Beijing, China
View Profile

,
Jianxin Li

Beihang University, Beijing, China

Beihang University, Beijing, China
View Profile

,
Bin Shi

Beihang University, Beijing, China

Beihang University, Beijing, China
View Profile

,
Jinpeng Huai

Beihang University, Beijing, China

Beihang University, Beijing, China
View Profile

VEE '15: Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution EnvironmentsMarch 2015Pages 215–228https://doi.org/10.1145/2731186.2731190

Published:14 March 2015Publication History

VEE '15: Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments

Pages 215–228

ABSTRACT

Virtual machine (VM) snapshot enhances the system availability by saving the running state into stable storage during failure-free execution and rolling back to the snapshot point upon failures. Unfortunately, the snapshot state may be lost due to disk failures, so that the VM fails to be recovered. The popular distributed file systems employ replication technique to tolerate disk failures by placing redundant copies across disperse disks. However, unless user-specific personalization is provided, these systems consider the data in the file as of same importance and create identical copies of the entire file, leading to non-trivial additional storage overhead.

This paper proposes a page-aware replication system (PARS) to store VM snapshots efficiently. PARS employs VM introspection technique to explore how a page is used by guest, and classifies the pages by their importance to system execution. If a page is critical, PARS replicates it multiple copies to ensure high availability and long-term durability. Otherwise, the loss of this page causes no harm for system to work properly, PARS therefore saves only one copy of the page. Consequently, PARS improves storage efficiency without compromising availability. We have implemented PARS to justify its practicality. The experimental results demonstrate that PARS achieves 53.9% space saving compared to the native replication approach in HDFS which replicates the whole snapshot file fully and identically.

References

National center for biotechnology information. ftp://ftp.ncbi.nih.gov.Google Scholar
Elasticsearch. http://www.elasticsearch.org/.Google Scholar
Hdfs. http://hadoop.apache.org/.Google Scholar
Mummer. http://mummer.sourceforge.net/.Google Scholar
Mysql. http://www.mysql.com/.Google Scholar
Sysbench. http://sysbench.sourceforge.net/.Google Scholar
Weibo. http://weibo.com.Google Scholar
libhdfs. http://hadoop.apache.org/docs/r1.2.1/libhdfs.html.Google Scholar
Salesforce, 1999. http://www.salesforce.com.Google Scholar
Using the snapshot, 2003. https://www.vmware.com/support/ws4/doc/preserve_snapshot_ws.html.Google Scholar
readahead, 2005. https://lwn.net/Articles/155510/.Google Scholar
Amazon ec2, 2006. http://aws.amazon.com/ec2/.Google Scholar
Superfetch, 2007. http://en.wikipedia.org/wiki/Windows_Vista_I/O_technologies.Google Scholar
S. Akiyama, T. Hirofuchi, R. Takano, and S. Honiden. Fast wide area live migration with a low overhead through page cache teleportation. In Proceedings of CCGrid, pages 78--82, 2013.Google ScholarDigital Library
N. Amit, D. Tsafrir, and A. Schuster. Vswapper: A memory swapper for virtualized environments. In Proceedings of ASPLOS, pages 349--366, 2014. Google ScholarDigital Library
B. Calder, J. Wang, A. Ogus, and N. N. et al. Windows azure storage: A highly available cloud storage service with strong consistency. In Proceedings of SOSP, pages 143--157, 2011. Google ScholarDigital Library
J.-H. Chiang, H.-L. Li, and T. cker Chiueh. Introspectionbased memory de-duplication and migration. In Proceedings of VEE, pages 51--62, 2013. Google ScholarDigital Library
M. Christodorescu, R. Sailer, D. L. Schales, D. Sgandurra, and D. Zamboni. Cloud security is not (just) virtualization security: A short paper. In Proceedings of the ACM Workshop on Cloud Computing Security, pages 97--102, 2009. Google ScholarDigital Library
J. C. Corbett, J. Dean, and M. E. et al. Spanner: Google's globally-distributed database. In Proceedings of OSDI, pages 251--264, 2012. Google ScholarDigital Library
L. Cui, B. Li, Y. Zhang, and J. Li. Hotsnap: A hot distributed snapshot system for virtual machine cluster. In Proceedings of USENIX LISA, pages 59--73, 2013. Google ScholarDigital Library
L. Cui, J. Li, B. Li, and et al. Vmscatter: Migrate virtual machines to many hosts. In Proceedings of VEE, pages 63--72, 2013. Google ScholarDigital Library
U. Deshpande, X. Wang, and K. Gopalan. Live gang migration of virtual machines. In Proceedings of HPDC, pages 135--146, 2011. Google ScholarDigital Library
C. Dubnicki, L. Gryz, L. Heldt, and M. Kaczmarczyk. Hydrastor: A scalable secondary storage. In Proceedings of FAST, pages 197--210, 2009. Google ScholarDigital Library
D. Ford, F. Labelle, F. I. Popovici, M. Stokely, V.-A. Truong, L. Barroso, C. Grimes, and S.Quinlan. Availability in globally distributed storage systems. In Proceedings of OSDI, pages 1--14, 2010. Google ScholarDigital Library
T. Garfinkel and M. Rosenblum. A virtual machine introspection based architecture for intrusion detection. In Proceedings of NDSS, pages 191--206, 2003.Google Scholar
S. Ghemawat, H. Gobioff, and S.-T. Leung. The google file system. In Proceedings of SOSP, pages 29--43, 2003. Google ScholarDigital Library
D. Harnik, O. Margalit, D. Naor, D. Sotnikov, and G. Vernik. Estimation of deduplication ratios in large data sets. In Proceedings of MSST, pages 1--11, 2012.Google ScholarCross Ref
M. Hibler, R. Ricci, L. Stoller, J. Duerig, S. Guruprasad, T. Stack, K. Webb, and J. Lepreau. Large-scale virtualization in the emulab network testbed. In USENIX Annual Technical Conference, pages 113--128, 2008. Google ScholarDigital Library
X. Jiang and D. Xu. Violin: Virtual internetworking on overlay infrastructure. In Parallel and Distributed Processing and Applications, pages 937--946, 2005. Google ScholarDigital Library
H. Jin, L. Deng, and S. Wu. Live virtual machine migration with adaptive memory compression. In Proceedings of CLUSTER, pages 1--10, 2009.Google ScholarCross Ref
S. T. Jones, A. C. Arpaci-Dusseau, and R. H. Arpaci-Dusseau. Geiger: Monitoring the buffer cache in a virtual machine environment. In Proceedings of ASPLOS, pages 14--24, 2006. Google ScholarDigital Library
A. Kangarlou, P. Eugster, and D. Xu. Vnsnap: Taking snapshots of virtual networked environments with minimal downtime. In Proceedings of DSN, pages 87--98, 2011.Google Scholar
A. Kivity, Y. Kamay, D. Laor, U. Lublin, and A. Liguori. Kvm: the linux virtual machine monitor. In Proceedings of the Linux Symposium, pages 225--230, 2007.Google Scholar
J. F. Kloster, J. Kristensen, and A.Mejlholm. Determining the use of interdomain shareable pages using kernel introspection. Technical report, Aalborg University, 2007.Google Scholar
G. Kola, T. Kosar, and M. Livny. Faults in large distributed systems and what we can do about them. In Proceedings of Euro-Par, pages 442--453, 2005. Google ScholarDigital Library
R. Kotla, L. Alvisi, and M. Dahlin. Safestore: A durable and practical storage system. In Proceedings of ATC, pages 127--142, 2007. Google ScholarDigital Library
A. Koto, H. Yamada, K. Ohmura, and K. Kono. Towards unobtrusive vm live migration for cloud computing platforms. In Proceedings of APSys, pages 1--6, 2012. Google ScholarDigital Library
J. Kubiatowicz, D. Bindel, Y. Chen, and S. C. et al. Oceanstore: An architecture for global-scale persistent storage. In Proceedings of ASPLOS, pages 190--201, 2000. Google ScholarDigital Library
X. Li, M. Lillibridge, and M. Uysal. Reliability analysis of deduplicated and erasure-coded storage. Proceedings of SIGMETRICS Performance Evaluation Review, 38(3):4--9, 2010. Google ScholarDigital Library
C. Liu, D. Ju, Y. Gu, Y. Zhang, D. Wang, and D. H. Du. Semantic data de-duplication for archival storage systems. In Proceedings of ACSAC, pages 1--9, 2008.Google ScholarCross Ref
C. Marshall. Efficient and safe data backup with arrow. Technical report, 2008.Google Scholar
P. Nath, M. A. Kozuch, D. R. OHallaron, and J. Harkes. Design tradeoffs in applying content addressable storage to enterprise-scale systems based on virtual machines. In Proceedings of ATC, pages 71--84, 2006. Google ScholarDigital Library
E. Park, B. Egger, and J. Lee. Fast and space-efficient virtual machine checkpointing. In Proceedings of VEE, pages 75--85, 2011. Google ScholarDigital Library
M. R, Hines, and K. Gopalan. Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning. In Proceedings of VEE, pages 51--60, 2009. Google ScholarDigital Library
R. Rodrigues and B. Liskov. High availability in dht erasureerasure code vs replication. In Proceedings of IPTPS, pages 226--239, 2005. Google ScholarDigital Library
B. Schroeder and G. A. Gibson. Understanding failures in petascale computers. Journal of Physics, 78:1--11, 2007.Google ScholarCross Ref
B. Schroeder and G. A. Gibson. Disk failures in the real world what does an mttf of 1,000,000 hours mean to you. In Proceedings of FAST, pages 1--16, 2007. Google ScholarDigital Library
P. Stuedi, B.Metzler, and A. Trivedi. jverbs: Ultra-low latency for data center applications. In Proceedings of ACM SoCC, 2013. Google ScholarDigital Library
M. H. Sun and D. M. Blough. Fast, lightweight virtual machine checkpointing. Technical report, 2010.Google Scholar
Y. Xu, M. Bailey, B. Noble, and F. Jahanian. Small is better: Avoiding latency traps in virtualized data. In Proceedings of ACM SoCC, 2013. Google ScholarDigital Library
L. L. You, K. T. Pollack, and D. D. E. Long. Deep store: An archival storage system architecture. In Proceedings of ICDE, pages 804--815, 2005. Google ScholarDigital Library
I. Zhang, A. Garthwaite, Y. Baskakov, and K. C. Barr. Fast restore of checkpointed memory using working set estimation. In Proceedings of VEE, pages 534--533, 2009. Google ScholarDigital Library
I. Zhang, T. Denniston, Y. Baskakov, and A. Garthwaite. Optimizing vm checkpointing for restore performance in VMware esxi. In Proceedings of USENIX ATC, pages 1--12, 2013. Google ScholarDigital Library
W. Zhang, H. Tang, H. Jiang, T. Yang, X. Li, and Y. Zeng. Multi-level selective deduplication for vm snapshots in cloud storage. In Proceedings of Cloud, pages 550--557, 2012. Google ScholarDigital Library
B. Zhu, K. Li, and H. Patterson. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of FAST, pages 1--14, 2008. Google ScholarDigital Library

Index Terms

PARS: A Page-Aware Replication System for Efficiently Storing Virtual Machine Snapshots
1. Software and its engineering
  1. Software organization and properties
    1. Extra-functional properties
      1. Software fault tolerance
        Checkpoint / restart

Recommendations

PARS: A Page-Aware Replication System for Efficiently Storing Virtual Machine Snapshots
VEE '15

Virtual machine (VM) snapshot enhances the system availability by saving the running state into stable storage during failure-free execution and rolling back to the snapshot point upon failures. Unfortunately, the snapshot state may be lost due to disk ...
Read More
Introspection-Based Memory Pruning for Live VM Migration

Virtual Machine (VM) migration is an appealing technique on nowadays cloud platforms to achieve high availability, load balancing and power saving. Unfortunately, migration of VM involves transferring a large amount of data, thereby imposing high ...
Read More
Consistent and automatic replica regeneration

Reducing management costs and improving the availability of large-scale distributed systems require automatic replica regeneration, that is, creating new replicas in response to replica failures. A major challenge to regeneration is maintaining ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
VEE '15: Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments
March 2015
238 pages
ISBN:9781450334501
DOI:10.1145/2731186
General Chair:
Ada Gavrilovska
Georgia Tech
,
Program Chairs:
Angela Demke Brown
University of Toronto
,
Bjarne Steensgaard
Microsoft
ACM SIGPLAN Notices Volume 50, Issue 7
VEE '15
July 2015
221 pages
ISSN:0362-1340
EISSN:1558-1160
DOI:10.1145/2817817
Editor:
Andy Gill
University of Kansas, Lawrence, KS
Issue’s Table of Contents
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 March 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
availability
introspection
replication
storage space saving
virtual machine snapshot
Qualifiers
- research-article
Conference

Acceptance Rates
VEE '15 Paper Acceptance Rate16of50submissions,32%Overall Acceptance Rate80of235submissions,34%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 5
  Total Citations
  View Citations
- 402
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

PARS: A Page-Aware Replication System for Efficiently Storing Virtual Machine Snapshots

VEE '15: Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments

ABSTRACT

References

Cited By

Index Terms

Recommendations

PARS: A Page-Aware Replication System for Efficiently Storing Virtual Machine Snapshots

Introspection-Based Memory Pruning for Live VM Migration

Consistent and automatic replica regeneration