Failure detectors for large-scale distributed systems | IEEE Conference Publication | IEEE Xplore

Failure detectors for large-scale distributed systems


Abstract:

This paper discusses the problem of implementing a scalable failure detection service for grid systems. More specifically, traditional implementations of failure detector...Show More

Abstract:

This paper discusses the problem of implementing a scalable failure detection service for grid systems. More specifically, traditional implementations of failure detectors are often tuned for running over local networks and fail to address important problems found in wide-area distributed systems, such as grid systems. We identify some of the most important problems raised in the context of grids. We then survey recent propositions that can help in solving some of these problems.
Date of Conference: 13-16 October 2002
Date Added to IEEE Xplore: 25 February 2003
Print ISBN:0-7695-1659-9
Print ISSN: 1060-9857
Conference Location: Suita, Japan

Contact IEEE to Subscribe

References

References is not available for this document.