Skip to main content

Transparent Network Connectivity in Dynamic Cluster Environments

  • Conference paper
Network-Based Parallel Computing. Communication, Architecture, and Applications (CANPC 2000)

Abstract

This paper presents the design, implementation, and performance of a transparent network connectivity layer for dynamic cluster environments. Our design uses the techniques of API interception and virtualization to construct a transparent layer in user space; use of the layer requires no modification either to the application or the underlying operating system and messaging layers. Our layer enables the migration of application components without breaking network connections, and additionally permits adaptation to the characteristics of the underlying networking substrate. Experiments with supporting a persistent socket interface in two environments—an Ethernet LAN on top of TCP/IP, and a Myrinet LAN on top of Fast Messages—show that our approach incurs minimal overheads and can effectively select the best substrate for implementing application communication requirements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Balzer, R.: Mediating Connectors. In: Proc. of ICDCS Middleware Workshop (1999)

    Google Scholar 

  2. Baratloo, A., Dasgupta, A., Kedem, Z.: Calypso: A novel software system for fault-tolerant parallel processing on distributed platforms. In: Proc. of 4th IEEE Intl. Symp. on High Performance Distributed Computing (1995)

    Google Scholar 

  3. Birman, K.: Replication and fault-tolerance in the ISIS system. In: Proc. of 10th ACM Symp. on Operating System Principle, pp. 79–86 (1985)

    Google Scholar 

  4. Blumofe, R., Joerg, C., Kuszmaul, B., Leiserson, C., Randall, K., Zhou, Y.: Cilk: An efficient multithreaded runtime system. In: 5th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, pp. 207–216 (1995)

    Google Scholar 

  5. Dasgupta, P., Karamcheti, V., Kedem, Z.: Transparent distribution middle ware for general purpose computations. In: Proc. of Intl. Conf. on Parallel and Distributed Processing Techniques and Applications (PDPTA 1999) (June 1999)

    Google Scholar 

  6. Douglis, F., Ousterhout, J.: Process migration in the sprite operating system. In: Proc. of 7th Intl. Conf. on Distributed Computing Systems, pp. 18–25 (1987)

    Google Scholar 

  7. Hayden, M.: The Ensemble System. Technical Report TR98-1662, Cornell University (1998)

    Google Scholar 

  8. Hunt, G., Brubacher, D.: Detours: Binary interception of Win32 functions. Technical Report MSR-TR-98-33, Microsoft Research (1999)

    Google Scholar 

  9. Lauria, M., Pakin, S., Chien, A.A.: Efficient layering for high speed communication: Fast Message 2.x. In: Proc. of the 7th High Performance Distributed Computing (HPDC7) conf. (1998)

    Google Scholar 

  10. Milojicic, D., Zint, W., Dangel, A., Giese, P.: Task migration on the top of the mach microkernel. In: Proc. of the 3rd USENIX Mach Symp., pp. 273–289 (1993)

    Google Scholar 

  11. Pakin, S., Karamcheti, V., Chien, A.: Fast message (FM): efficient, portable communication for workstation clusters and massively-parallel processors. IEEE Concurrency 5, 60–73 (1997)

    Google Scholar 

  12. Paoli, D., Goscinski, A.: The RHODOS Migration Facility. Technical Report TR C95/36, School of Computing and Mathematics, Deakin University (1995)

    Google Scholar 

  13. Rozier, M., Abrossimov, V., Gien, M., Guillemont, M., Hermann, F., Kaiser, C.: Chorus (Overview of the Chorus distributed operating system). In: Proc. of USENIX Workshop on Micro-Kernels and Other Kernel Architectures, pp. 39–70 (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fu, X., Wang, H., Karamcheti, V. (2000). Transparent Network Connectivity in Dynamic Cluster Environments. In: Falsafi, B., Lauria, M. (eds) Network-Based Parallel Computing. Communication, Architecture, and Applications. CANPC 2000. Lecture Notes in Computer Science, vol 1797. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10720115_3

Download citation

  • DOI: https://doi.org/10.1007/10720115_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67879-3

  • Online ISBN: 978-3-540-44655-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics