Improving availability of software subsystems through on-line error detection | IBM Journals & Magazine | IEEE Xplore