Replication and Checkpoint Schemes for Task-Fault Tolerance in Campus-Wide Mobile Grid

Choi, SookKyong; Lee, JongHyuk; Yu, HeonChang; Lee, Hwamin

doi:10.1007/978-3-642-27180-9_56

SookKyong Choi⁸,
JongHyuk Lee⁸,
HeonChang Yu⁸ &
…
Hwamin Lee⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 261))

Included in the following conference series:

International Conference on Grid and Distributed Computing

1626 Accesses
2 Citations

Abstract

Mobile grid computing is a computing environment that incorporates mobile devices to an existing grid environment and supports users’ mobility. But this environment is not stable, so methodologies to cope with the reliability issue are needed. Fault tolerance approaches for task execution in grid computing can be categorized into replication and checkpoint. We apply these techniques to a SimGrid simulator to provide a fault tolerance for a mobile environment and show the results in this paper. The results demonstrate that the best solution for fault tolerance in mobile grid computing depends on the situations of the network. The contribution of this paper is the use of real-life trace data to simulate fault tolerance in a mobile grid computing.

This work was supported by National Research Foundation of Korea Grant funded by the Korean Government (2009-0070138)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Hybrid Fault Tolerant Scheduler for Computational Grid Environment

Adaptive fault-tolerant scheduling strategies for mobile cloud computing

Article 10 January 2019

Reliability-Aware Distributed Computing Scheduling Policy

References

Kang, W., Huang, H.H., Grimshaw, A.: A highly available job execution service in computational service market. In: 8th IEEE/ACM International Conference on Grid Computing, September 19-21, pp. 275–282 (2007)
Google Scholar
Katsaros, K., Polyzos, G.C.: Evaluation of scheduling policies in a Mobile Grid architecture. In: Proc. International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS 2008), Edinburgh, UK (June 2008)
Google Scholar
Silva, D., Cirne, W., Brasileiro, F.: Trading Cycles for Information: Using Replication to Schedule Bag-of-Tasks Applications on Computational Grids. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 169–180. Springer, Heidelberg (2003)
Chapter Google Scholar
Dobber, M., van der Mei, R., Koole, G.: Dynamic Load Balancing and Job Replication in a Global-Scale Grid Environment: A Comparison. IEEE Transactions on Parallel and Distributed Systems 20(2), 207–218 (2009)
Article Google Scholar
Limaye, K., Leangsuksun, C.B., et al.: Job-Site Level Fault Tolerance for Cluster and Grid environments. In: The 2005 IEEE Cluster Computing, Boston, MA, September 27-30 (2005)
Google Scholar
Baghavathi Priya, S., Prakash, M., Dhawan, K.K.: Fault Tolerance-Genetic Algorithm for Grid Task Scheduling using Check Point. In: Sixth International Conference on Grid and Cooperative Computing, GCC 2007 (2007)
Google Scholar
Katsaros, P., Angelis, L., Lazos, C.: "Performance and Effectiveness Trade-Off for Checkpointing in Fault-Tolerant Distributed Systems. Concurrency and Computation: Practice and Experience 19(1), 37–63 (2007)
Article Google Scholar
Darby III, P.J., Tzeng, N.-F.: Decentralized QoS-Aware Checkpointing Arrangement in Mobile Grid Computing. IEEE Transactions On Mobile Computing 9(8), 1173–1186 (2010)
Article Google Scholar
Wu, C.-C., Lai, K.-C., Sun, R.-Y.: GA-Based Job Scheduling Strategies for Fault Tolerant Grid Systems. In: IEEE Asia-Pacific Services Computing Conference (2008)
Google Scholar
Chtepen, M., Claeys, F.H.A., Dhoedt, B., De Turck, F., Demeester, P., Vanrolleghem, P.A.: Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids. IEEE Transactions on Parallel and Distributed Systems 20(2), 180–190 (2009)
Article Google Scholar
Katsaros, K., Polyzos, G.C.: Optimizing Operation of a Hierarchical Campus-wide Mobile Grid for Intermittent Wireless Connectivity. In: 15th IEEE Workshop on Local & Metropolitan Area Networks, LANMAN 2007, June 10-13, pp. 111–116 (2007)
Google Scholar
Balazinska, M., Castro, P.: Characterizing Mobility and Network Usage in a CorporateWireless Local-Area Network. In: Proceedings of the First International Conference on Mobile Systems, Applications, and Services (2003)
Google Scholar
Henderson, T., Kotz, D.: CRAWDAD trace dartmouth/campus/syslog/05_06 (February 8, 2007), http://crawdad.cs.dartmouth.edu
Lee, J.H., Choi, S.J., Suh, T., Yu, H.C.: Mobility-aware Balanced Scheduling Algorithm in Mobile Grid Based on Mobile Agent. The Knowledge Engineering Review (2010) (accepted for publication)
Google Scholar
Buyya, R., Murshed, M.: GridSim: A Toolkit for the Modeling and Simulation of Distributed Resource Management and Scheduling for Grid Computing. J. Concurrency and Computation: Practice and Experience 14, 13–15 (2002)
MATH Google Scholar
Sulistio, A., Cibej, U., Venugopal, S., Robic, B., Buyya, R.: A toolkit for modelling and simulating data Grids: an extension to GridSim. Concurrency and Computation: Practice & Experience 20(13), 1591–1609 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science Education, Korea University, Anam-Dong, Seongbuk-Gu, Seoul, Korea
SookKyong Choi, JongHyuk Lee & HeonChang Yu
Dept. of Computer Software Engineering, Soonchunhyang University, 336-745, Asan-si, Korea
Hwamin Lee

Authors

SookKyong Choi
View author publications
You can also search for this author in PubMed Google Scholar
JongHyuk Lee
View author publications
You can also search for this author in PubMed Google Scholar
HeonChang Yu
View author publications
You can also search for this author in PubMed Google Scholar
Hwamin Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Multimedia Engineering Department, Hannam University, 133 Ojeong-dong, Daeduk-gu, Daejeon, Korea
Tai-hoon Kim
The Ohio State University, 470 Hitchcock Hall, 2070 Neil Avenue, 43210-1275, Columbus, OH, USA
Hojjat Adeli
Chungwoon University, 350-701, Chungnam, Korea
Hyun-seob Cho
Department of Mathematics and Computer Science, University of Perugia, Via Vanvitelli, 1, 06123, Perugia, Italy
Osvaldo Gervasi
Department of Computer Science and Engineering, Arizona State University, 85281, Mesa, AZ, USA
Stephen S. Yau
School of Computing and Information Systems, University of Tasmania, Hobart, TAS, Australia
Byeong-Ho Kang
Universidad Complutense de Madrid, 28040, Madrid, Spain
Javier García Villalba

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Choi, S., Lee, J., Yu, H., Lee, H. (2011). Replication and Checkpoint Schemes for Task-Fault Tolerance in Campus-Wide Mobile Grid. In: Kim, Th., et al. Grid and Distributed Computing. GDC 2011. Communications in Computer and Information Science, vol 261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27180-9_56

Download citation

DOI: https://doi.org/10.1007/978-3-642-27180-9_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27179-3
Online ISBN: 978-3-642-27180-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics