Skip to main content
Log in

Distributed File System Virtualization Techniques Supporting On-Demand Virtual Machine Environments for Grid Computing

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

This paper presents a data management solution which allows fast Virtual Machine (VM) instantiation and efficient run-time execution to support VMs as execution environments in Grid computing. It is based on novel distributed file system virtualization techniques and is unique in that: (1) it provides on-demand cross-domain access to VM state for unmodified VM monitors; (2) it enables private file system channels for VM instantiation by secure tunneling and session-key based authentication; (3) it supports user-level and write-back disk caches, per-application caching policies and middleware-driven consistency models; and (4) it leverages application-specific meta-data associated with files to expedite data transfers. The paper reports on its performance in wide-area setups using VMware-based VMs. Results show that the solution delivers performance over 30% better than native NFS and with warm caches it can bring the application-perceived overheads below 10% compared to a local-disk setup. The solution also allows a VM with 1.6 GB virtual disk and 320 MB virtual memory to be cloned within 160 seconds for the first clone and within 25 seconds for subsequent clones.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. I. Foster, C. Kesselman and S. Tuecke, The anatomy of the grid: Enabling scalable virtual organizations, International Journal of Supercomputer Applications 15(3) (2001).

  2. A. Butt, S. Adabala, N. Kapadia, R. Figueiredo and J. Fortes, Grid-computing portals and security issues, Journal of Parallel and Distributed Computing 63(10) (2003) 1006–1014.

    Article  Google Scholar 

  3. R.P. Goldberg, Survey of virtual machine research, IEEE Computer Magazine 7(6) (1974) 34–45.

    Google Scholar 

  4. R.J. Figueiredo, P.A. Dinda and J.A.B. Fortes, A case for grid computing on virtual machines, in: Proc. International Conference on Distributed Computing Systems (May 2003).

  5. R.J. Figueiredo, N. Kapadia and J.A.B. Fortes, Seamless access to decentralized storage services in computational grids via a virtual file system, Cluster Computing Journal 7(2) (2004) 113–122.

    Google Scholar 

  6. B. Pawlowski, C Juszczak, P. Staubach, C. Smith, D. Lebel and D. Hitz, NFS version 3 design and implementation, in: Proc. USENIX Summer Technical Conference (1994).

  7. J. Sugerman, G. Venkitachalan and B-H. Lim, Virtualizing I/O devices on vmware workstation's hosted virtual machine monitor, in: Proc. USENIX Annual Technical Conference (June 2001).

  8. J. Dike, A user-mode port of the linux kernel, in: Proc. the 4th Annual Linux Showcase and Conference, USENIX Association, Atlanta, GA (October 2000).

  9. P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt and A. Warfield, Xen and the art of virtualization, in: Proc. ACM Symposium on Operating Systems Principles (October 2003).

  10. B. Allcock, J. Bester, J. Bresnahan, A. Chervenak, I. Foster, C. Kesselman, S. Meder, V. Nefedova, D. Quesnel and S. Tuecke, Secure, efficient data transport and replica management for high-performance data-intensive computing, in: Proc. IEEE Mass Storage Conference (2001).

  11. J. Bester, I. Foster, C. Kesselman, J. Tedesco and S. Tuecke, GASS: A data movement and access service for wide area computing systems, in: Proc. the 6th Workshop on I/O in Parallel and Distributed Systems (May 1999).

  12. N. Kapadia, R.J. Figueiredo and J.A.B. Fortes, Enhancing the scalability and usability of computational grids via logical user accounts and virtual file systems, in: Proc. Heterogeneous Computing Workshop at the International Parallel and Distributed Processing Symposium (April 2001).

  13. S. Adabala, A. Matsunaga, M. Tsugawa, R.J. Figueiredo and J.A.B. Fortes, Single sign-on in in-vigo: Role-based access via delegation mechanisms using short-lived user identities, in: Proc. International Parallel and Distributed Processing Symposium (April 2004).

  14. S. Adabala, V. Chadha, P. Chawla, R.J. Figueiredo, J.A.B. Fortes, I. Krsul, A. Matsunaga, M. Tsugawa, J. Zhang, M. Zhao, L. Zhu, and X. Zhu, From virtualized resources to virtual computing grids: The in-vigo system, Future Generation Computing Systems, special issue on Complex Problem-Solving Environments for Grid Computing, Vol 21/6, pp. 896–909.

  15. J.C. Bowman, Secure NFS via SSH tunnel, http://www.math. ualberta.ca/imaging/snfs/

  16. B. Callaghan, NFS Illustrated (Addison-Wesley, 2002).

  17. J. Hennessy and D. Patterson, Computer Architecture: A Quantitative Approach, 3rd edition (Morgan Kaufmann, 2002).

  18. J. Morris, M. Satyanarayanan, M. Conner, J. Howard, D. Rosenthal and F. Smith, Andrew: A distributed personal computing environment, Communications of the ACM 29(3) (March 1986) 184–201.

    Article  Google Scholar 

  19. M. Zhao, V. Chadha and R.J. Figueiredo, Supporting application-tailored grid file system sessions with wsrf-based services, in: Proc. the 14th IEEE International Symposium on High Performance Distributed Computing (July 2005).

  20. I.V. Krsul, A. Ganguly, J. Zhang, J.A.B. Fortes and R.J. Figueiredo, VMPlants: Providing and managing virtual machine execution environments for grid computing, in: Proc. the 2004 ACM/IEEE conference on Supercomputing, (July 2004).

  21. VMware Inc., VMware VirtualCenter user's manual, http://www.vmware.com/pdf/VC_Users_Manual_11.pdf, 13th Apr. 2004.

  22. C. Sapuntzakis, R. Chandra, B. Pfaff, J. Chow, M. Lam and M. Rosenblum, Optimizing the migration of virtual computers, in: Proc. the 5th Symposium on Operating Systems Design and Implementation (2002).

  23. A. Zeitoun, Z. Wang and S. Jamin, RTTometer: Measuring path minimum rtt with confidence, IEEE Workshop on IP Operations and Management (2003).

  24. J.H. Howard, M.L. Kazar, S.G. Menees, D.A. Nichols, M. Satyanarayanan, R.N. Sidebotham and M.J. West, Scale and performance of a distributed file system, ACM Transactions on Computer Systems 6(1) (1988) 51–81.

    Article  Google Scholar 

  25. VMware Inc., GSX Server 2.5.1 user's manual, http://www.vmware.com/pdf/gsx251vm_manual.pdf.

  26. B. White, A. Grimshaw and A. Nguyen-Tuong, Grid-based file access: The legion I/O model, in: Proc. the 9th IEEE International Symposium on High Performance Distributed Computing (Aug. 2000) pp. 165–173.

  27. M. Litzkow, M. Livny and M.W. Mutka, Condor: A hunter of idle workstations, in: Proc. the 8th International Conference on Distributed Computing Systems (June 1988) pp. 104–111.

  28. D. Thain, J. Basney, S-C. Son and M. Livny, The kangaroo approach to data movement on the grid, in: Proc. the 10th IEEE International Symposium on High Performance Distributed Computing (Aug 2001) pp. 325–333.

  29. J. Bent, V. Venkataramani, N. LeRoy, A. Roy, J. Stanley, A. Arpaci-Dusseau, R. Arpaci-Dusseau and M. Livny, Flexibility, manageability and performance in a grid storage appliance, in: Proc. the Eleventh IEEE Symposium on High Performance Distributed Computing, Edinburgh, Scotland (July 2002).

  30. A. Alexandrov, M. Ibel, K. Schauser and C. Scheiman, UFO: A personal global file system based on user-level extensions to the operating system, ACM Transactions on Computer Systems (Aug. 1998) pp. 207–233.

  31. D. Thain and M. Livny, Parrot: Transparent user-level middleware for data-intensive computing, in: Proc. Workshop on Adaptive Grid Middleware, New Orleans, Louisiana (September 2003).

  32. D. Mazieres, M. Kaminsky, M. Kaashoek and E. Witchel, Separating key management from file system security, in: Proc. the 17th ACM Symposium on Operating System Principles, (Dec. 1999).

  33. B. Pawlowski, S. Shepler, C. Beame, B. Callaghan, M. Eisler, D. Noveck, D. Robinson and R. Thurlow, The NFS version 4 protocol, in: Proc. the 2nd International System Administration and Networking Conference (May 2000).

  34. R. Chandra, N. Zeldovich, C. Sapuntzakis and M.S. Lam, The collective: A cache-based system management architecture, in: Proc. the 2nd Symposium on Networked Systems Design and Implementation (May 2005) pp. 259–272,.

  35. C. Clark, K. Fraser, S. Hand, J.G. Hansen, E. Jul, C. Limpach, I. Pratt and A. Warfield, Live migration of virtual machines, in: Proc. the 2nd Symposium on Networked Systems Design and Implementation (May 2005).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ming Zhao.

Additional information

Ming Zhao is a PhD candidate in the department of Electrical and Computer Engineering and a member of the Advance Computing and Information Systems Laboratory, at University of Florida. He received the degrees of BE and ME from Tsinghua University. His research interests are in the areas of computer architecture, operating systems and distributed computing.

Jian Zhang is a PhD student in the Department of Electrical and Computer Engineering at University of Florida and a member of the Advance Computing and Information Systems Laboratory (ACIS). Her research interest is in virtual machines and Grid computing. She is a member of the IEEE and the ACM.

Renato J. Figueiredo received the B.S. and M.S. degrees in Electrical Engineering from the Universidade de Campinas in 1994 and 1995, respectively, and the Ph.D. degree in Electrical and Computer Engineering from Purdue University in 2001. From 2001 until 2002 he was on the faculty of the School of Electrical and Computer Engineering of Northwestern University at Evanston, Illinois. In 2002 he joined the Department of Electrical and Computer Engineering of the University of Florida as an Assistant Professor. His research interests are in the areas of computer architecture, operating systems, and distributed systems.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhao, M., Zhang, J. & Figueiredo, R.J. Distributed File System Virtualization Techniques Supporting On-Demand Virtual Machine Environments for Grid Computing. Cluster Comput 9, 45–56 (2006). https://doi.org/10.1007/s10586-006-4896-x

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-006-4896-x

Keywords

Navigation