Abstract:
Reliability is widely identified as an increasingly relevant issue in heterogeneous service-oriented systems because processor failure affects the quality of service to u...Show MoreNotes: IEEE Xplore Notice to Reader "Minimizing Redundancy to Satisfy Reliability Requirement for a Parallel Application on Heterogeneous Service-oriented Systems" by Guoqi Xie, Gang Zeng, Yuekun Chen, Yang Bai, Zhili Zhou, Renfa Li, and Keqin Li published in IEEE Transactions on Services Computing Early Access Digital Object Identifier: 10.1109/TSC.2017.2665552 This article includes an author who was prohibited from publishing with IEEE prior to publication of the article. Due to this prohibition, reasonable effort should be made to remove all past references to this article, and refrain from future references to this article. We regret any inconvenience this may have caused.
Metadata
Abstract:
Reliability is widely identified as an increasingly relevant issue in heterogeneous service-oriented systems because processor failure affects the quality of service to users. Replication-based fault-tolerance is a common approach to satisfy application's reliability requirement. This study solves the problem of minimizing redundancy to satisfy reliability requirement for a directed acyclic graph (DAG)-based parallel application on heterogeneous service-oriented systems. We first propose the enough replication for redundancy minimization (ERRM) algorithm to satisfy application's reliability requirement, and then propose heuristic replication for redundancy minimization (HRRM) to satisfy application's reliability requirement with low time complexity. Experimental results on real and randomly generated parallel applications at different scales, parallelism, and heterogeneity verify that ERRM can generate least redundancy followed by HRRM, and the state-of-the-art MaxRe and RR algorithm. In addition, HRRM implements approximate minimum redundancy with a short computation time.
Notes: IEEE Xplore Notice to Reader "Minimizing Redundancy to Satisfy Reliability Requirement for a Parallel Application on Heterogeneous Service-oriented Systems" by Guoqi Xie, Gang Zeng, Yuekun Chen, Yang Bai, Zhili Zhou, Renfa Li, and Keqin Li published in IEEE Transactions on Services Computing Early Access Digital Object Identifier: 10.1109/TSC.2017.2665552 This article includes an author who was prohibited from publishing with IEEE prior to publication of the article. Due to this prohibition, reasonable effort should be made to remove all past references to this article, and refrain from future references to this article. We regret any inconvenience this may have caused.
Published in: IEEE Transactions on Services Computing ( Volume: 13, Issue: 5, 01 Sept.-Oct. 2020)