Extending algorithm-based fault tolerance to tolerate fail-stop failures in high performance distributed environments | IEEE Conference Publication | IEEE Xplore