ABSTRACT
Software partitioning is a technology to partition a machine running a single instance of operating system image into multiple virtual machines called partitions. Each partition emulates an independent machine running a single instance of the operating system on dedicated hardware. All partitions are isolated form each other by the operating system. Software partitioning is very useful in server consolidation. A single powerful machine can be used to host many different servers each using a single software partition. This increases hardware resource utilization, gives flexibility to the administrator and can reduce system administration costs. The advantages offered by software partitioning are greatly enhanced with the capability to checkpoint a running software partition and restart it on a different machine. It helps in load balancing over hardware resources, load balancing over time and fault tolerance.
Workload Partition [WPAR] is IBM's implementation that provides software partitioning capability on the AIX operating system. It is possible to do the live migration of the WPARs in and across AIX systems. The live migration is achieved by the checkpoint/restart mechanism. It is possible to checkpoint and restart WPARs running most existing AIX applications without any modification to the applications. Also the checkpoint and restart process is transparent to the application running inside the WPAR (Partition).
This paper discusses the issues faced in implementing software partition checkpoint and restart in the AIX operating system. These issues will be typical to any standard UNIX operating system. To successfully checkpoint and restart a software partition, it is necessary not only to checkpoint all the user processes in the partition but also to checkpoint global data pertaining the partition itself and data shared between processes of the Partition like IPC data, Streams, timers, file handles, memory mapped regions, shared memory, System services, Virtual devices etc. The WPAR implementation handles both; the checkpoint of individual processes as well as checkpoint of partition wide data.
- Dejan S. Milojičić, Fred Douglis, Yves Paindaveine, Richard Wheeler, Songnian Zhou Process MigrationGoogle Scholar
- Eric Roman, A Survey of Checkpoint/Restart Implementations.Google Scholar
- Erik Hendriks. VMADump. http://bproc.sourceforge.net. 2002Google Scholar
- Hua Zhong and Jason Nieh. CRAK: Linux Checkpoint / Restart As a Kernel Module. Technical Report CUCS-014-01. Department of Computer Science. Columbia University, November 2002Google Scholar
- José Carlos Sancho Fabrizio Petrini Kei Davis Roberto Gioiosa Song Jiang Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault ToleranceGoogle Scholar
- Michael Litzkow, Todd Tannenbaum, Jim Basney, and Miron Livny. Checkpoint and migration of UNIX processes in the Condor distributed processing system. Technical Report CS-TR-199701346, University of Wisconsin, Madison, 1997.Google Scholar
- William R. Dieter, and James E. Lumpp, Jr., User-level Checkpointing for LinuxThreads Programs. FREENIX Track: USENIX 2001 Annual Technical. Conference. pp. 81--92. June, 2001 Google ScholarDigital Library
- Y. Wang, Y. Huang, K. Vo, P. Chung, and C. Kintala. Checkpointing and Its Applications. In Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing, Pasadena, California, June 27--30, 1995 Google ScholarDigital Library
Index Terms
- Migration of software partition in UNIX system
Recommendations
Modeling and analysis of software rejuvenation in a server virtualized system with live VM migration
As server virtualization is used in a number of IT systems, the unavailability of virtual machines (VM) on server virtualized systems becomes a significant concern. Software rejuvenation is a promising technique for improving the availability of server ...
Improving Total Migration Time in Live Virtual Machine Migration
ICCCT '15: Proceedings of the Sixth International Conference on Computer and Communication Technology 2015Virtualization is the key underlying technology enabling cloud providers to host services for a large number of customers. Live migration is an essential feature of virtualization that allows transfer of virtual machines from one physical server to ...
Zero-copy Migration for Lightweight Software Rejuvenation of Virtualized Systems
APSys '15: Proceedings of the 6th Asia-Pacific Workshop on SystemsVirtualized systems tend to suffer from software aging, which is the phenomenon that the state of a running system degrades with time. Software aging is restored by a technique called software rejuvenation, e.g., a system reboot. To reduce the downtime ...
Comments