A proactive fault-detection mechanism in large-scale cluster systems | IEEE Conference Publication | IEEE Xplore