Skip to main content

Replication and Checkpoint Schemes for Task-Fault Tolerance in Campus-Wide Mobile Grid

  • Conference paper
Grid and Distributed Computing (GDC 2011)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 261))

Included in the following conference series:

Abstract

Mobile grid computing is a computing environment that incorporates mobile devices to an existing grid environment and supports users’ mobility. But this environment is not stable, so methodologies to cope with the reliability issue are needed. Fault tolerance approaches for task execution in grid computing can be categorized into replication and checkpoint. We apply these techniques to a SimGrid simulator to provide a fault tolerance for a mobile environment and show the results in this paper. The results demonstrate that the best solution for fault tolerance in mobile grid computing depends on the situations of the network. The contribution of this paper is the use of real-life trace data to simulate fault tolerance in a mobile grid computing.

This work was supported by National Research Foundation of Korea Grant funded by the Korean Government (2009-0070138)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kang, W., Huang, H.H., Grimshaw, A.: A highly available job execution service in computational service market. In: 8th IEEE/ACM International Conference on Grid Computing, September 19-21, pp. 275–282 (2007)

    Google Scholar 

  2. Katsaros, K., Polyzos, G.C.: Evaluation of scheduling policies in a Mobile Grid architecture. In: Proc. International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS 2008), Edinburgh, UK (June 2008)

    Google Scholar 

  3. Silva, D., Cirne, W., Brasileiro, F.: Trading Cycles for Information: Using Replication to Schedule Bag-of-Tasks Applications on Computational Grids. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 169–180. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  4. Dobber, M., van der Mei, R., Koole, G.: Dynamic Load Balancing and Job Replication in a Global-Scale Grid Environment: A Comparison. IEEE Transactions on Parallel and Distributed Systems 20(2), 207–218 (2009)

    Article  Google Scholar 

  5. Limaye, K., Leangsuksun, C.B., et al.: Job-Site Level Fault Tolerance for Cluster and Grid environments. In: The 2005 IEEE Cluster Computing, Boston, MA, September 27-30 (2005)

    Google Scholar 

  6. Baghavathi Priya, S., Prakash, M., Dhawan, K.K.: Fault Tolerance-Genetic Algorithm for Grid Task Scheduling using Check Point. In: Sixth International Conference on Grid and Cooperative Computing, GCC 2007 (2007)

    Google Scholar 

  7. Katsaros, P., Angelis, L., Lazos, C.: "Performance and Effectiveness Trade-Off for Checkpointing in Fault-Tolerant Distributed Systems. Concurrency and Computation: Practice and Experience 19(1), 37–63 (2007)

    Article  Google Scholar 

  8. Darby III, P.J., Tzeng, N.-F.: Decentralized QoS-Aware Checkpointing Arrangement in Mobile Grid Computing. IEEE Transactions On Mobile Computing 9(8), 1173–1186 (2010)

    Article  Google Scholar 

  9. Wu, C.-C., Lai, K.-C., Sun, R.-Y.: GA-Based Job Scheduling Strategies for Fault Tolerant Grid Systems. In: IEEE Asia-Pacific Services Computing Conference (2008)

    Google Scholar 

  10. Chtepen, M., Claeys, F.H.A., Dhoedt, B., De Turck, F., Demeester, P., Vanrolleghem, P.A.: Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids. IEEE Transactions on Parallel and Distributed Systems 20(2), 180–190 (2009)

    Article  Google Scholar 

  11. Katsaros, K., Polyzos, G.C.: Optimizing Operation of a Hierarchical Campus-wide Mobile Grid for Intermittent Wireless Connectivity. In: 15th IEEE Workshop on Local & Metropolitan Area Networks, LANMAN 2007, June 10-13, pp. 111–116 (2007)

    Google Scholar 

  12. Balazinska, M., Castro, P.: Characterizing Mobility and Network Usage in a CorporateWireless Local-Area Network. In: Proceedings of the First International Conference on Mobile Systems, Applications, and Services (2003)

    Google Scholar 

  13. Henderson, T., Kotz, D.: CRAWDAD trace dartmouth/campus/syslog/05_06 (February 8, 2007), http://crawdad.cs.dartmouth.edu

  14. Lee, J.H., Choi, S.J., Suh, T., Yu, H.C.: Mobility-aware Balanced Scheduling Algorithm in Mobile Grid Based on Mobile Agent. The Knowledge Engineering Review (2010) (accepted for publication)

    Google Scholar 

  15. Buyya, R., Murshed, M.: GridSim: A Toolkit for the Modeling and Simulation of Distributed Resource Management and Scheduling for Grid Computing. J. Concurrency and Computation: Practice and Experience 14, 13–15 (2002)

    MATH  Google Scholar 

  16. Sulistio, A., Cibej, U., Venugopal, S., Robic, B., Buyya, R.: A toolkit for modelling and simulating data Grids: an extension to GridSim. Concurrency and Computation: Practice & Experience 20(13), 1591–1609 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Choi, S., Lee, J., Yu, H., Lee, H. (2011). Replication and Checkpoint Schemes for Task-Fault Tolerance in Campus-Wide Mobile Grid. In: Kim, Th., et al. Grid and Distributed Computing. GDC 2011. Communications in Computer and Information Science, vol 261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27180-9_56

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27180-9_56

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27179-3

  • Online ISBN: 978-3-642-27180-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics