Skip to main content
Log in

A Dynamic Shadow Approach to Fault-Tolerant Mobile Agents in an Autonomic Environment

  • Published:
Real-Time Systems Aims and scope Submit manuscript

Abstract

Large-scale distributed applications such as online information retrieval and collaboration over computational elements demand an approach to self-managed computing systems with a minimum of human interference. However, large scales and full distribution often lead to poor system dependability and security, and increase the difficulty in managing and controlling redundancy for fault tolerance. In particular, fault tolerance schemes for mobile agents to survive agent server crash failures in an autonomie environment are complex since developers normally have no control over remote agent servers. Some solutions inject a replica into stable storage upon its arrival at an agent server. But in the event of an agent server crash the replica is unavailable until the agent server recovers. In this paper we present a failure model and an exception handling framework for mobile agent systems. An exception handling scheme is developed for mobile agents to survive agent server crash failures. A replica mobile agent operates at the agent server visited prior to its master's current location. If a master crashes its replica is available as a replacement. The proposed scheme is examined in comparison with a simple time-out scheme. Experimental evaluation is performed, and performance results show that the scheme leads to some overhead in the round trip time when fault tolerance measures are exercised. However the scheme offers the advantage that fault tolerance is provided during the mobile agent trip, i.e. in the event of an agent server crash all agent servers are not revisited.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Coulouris, G., Dollimore, J., and Kindberg, T. 2001. Distributed Systems Concepts and Design, 3rd edition, Addison Wesley.

  • Fuggetta, A., Picco, G. P., and Vigna, G. 1998. Understanding code mobility. IEEE Transactions on Software Engineering, 24(5):342–361.

    Article  Google Scholar 

  • Macedo, R. J. A., and Silva, F. M. 2001. Integrating mobility into groups. In European Research Seminar on Advances in Distributed Systems (ERSADS), Bertinoro.

  • Marsden, E., Fabre, J., and Arlat, J. 2002. Dependability of CORBA systems: Service characterization by fault injection. In Proc. 21st IEEE Symposium on Reliable Distributed Systems, Suita, pp. 276–285.

  • Mohindra, A., Purakayastha, A., and Tahiti, P. 2000. Exploiting non-determinism for reliability of mobile agent systems, In Proc. International Conference on Dependable Systems and Networks, New York, pp. 144–153.

  • Moser, L. E., Melliar Smith, P. M., Agarwal, D. A., Budhia, R. K., and Lingley-Papadopoulos, C. A. 1996. Totem: A fault tolerant multicast group communication system. Communications of the ACM, 39(4):54–63.

    Article  Google Scholar 

  • Nagamuta, V., and Endler, M. 2001. Coordinating mobile agents through the broadcast channel. Anais do Simp$osio Brasileiro de Redes de Computadores (SBRC 2001), Florianopolis.

  • Oshima, M., Karjoth, G., and Ono, K. 1998. Aglets specification 1.1 draft, http://www.trl.ibm.co.jp/aglets/specll.httml.

  • Park, T., Byun, I., Kim, H., and Yeom, H. 2002. The performance of checkpointing and replication schemes for fault tolerant mobile agent systems. In Proc. 21st IEEE Symposium on Reliable Distributed Systems, Suita, pp. 256–261.

  • Pears, S., Xu J., and Boldyreff, C. 2003. Mobile agent fault tolerance for information retrieval applications: An exception handling approach. In Proc. 6th International Symposium on Autonomous Decentralized Systems, Pisa.

  • Pleisch, S., and Schiper, A. 2000. Modeling fault-tolerant mobile agents as a sequence of agreement problems. In Proc. 19th Symposium on Reliable Distributed Systems (SRDS), Nuremberg, pp. 11–20.

  • Renesse, R., Birman, K. P., and Maffeis, S. 1996. Horus: A flexible group communication system. Communications of the ACM, 39(4):76–83.

    Google Scholar 

  • Schneider, F. 1997. Towards fault-tolerant and secure agentry. In Proc. 11th International Workshop on Distributed Algorithms, Saarbrucken, pp. 1–14.

  • Silva, L. M., Batista, V., and Silva, J. G. 2000. Fault-tolerant execution of mobile agents. In Proc. International Conference on Dependable Systems and Networks, New York, pp. 144–153.

  • Silva, F. M., and Popescu-Zeletin, R. 2000. Mobile agent-based transactions in open environments. IEICE Transactions on Communications, Vol. E83-B, No. 5, pp. 973–987.

    Google Scholar 

  • Strasser, M., Rothermel, K., and Maihofer, C. 1998. Providing reliable agents for electronic commerce. In Trends in Distributed Systems for Electronic Commerce (TREC'98), LNCS 1402, Springer-Verlag, pp. 241–253.

  • Tripathi, A., and Karnik, N. 1998. Protected resource access for mobile agent-based distributed computing. In Proc. ICPP workshop on Wireless Networking and Mobile Computing, Minneapolis, pp. 144–153.

  • Tripathi, A., and Milner, R. 2001. Exception handling in agent-oriented systems. In Advances in Exception Handling Techniques, LNCS-2022, Springer-Verlag, pp. 128–146.

  • Vogler, H., Hunklemann, T., and Moschgath, M. 1997. An approach for mobile agent security and fault tolerance using distributed transactions. In Proc. International Conference on Parallel and Distributed Systems (ICPADS'97), Seoul, pp. 268–274.

  • Waldo, J. 2001. Mobile code, distributed computing and agents. IEEE Intelligent Systems, 16(2):10–12.

    Article  Google Scholar 

  • Xu, J., and Randell, B. 2000. Tutorial: Exception handling and software fault tolerance. In Proc. International Conference on Dependable Systems and Networks, New York.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xu, J., Pears, S. A Dynamic Shadow Approach to Fault-Tolerant Mobile Agents in an Autonomic Environment. Real-Time Syst 32, 235–252 (2006). https://doi.org/10.1007/s11241-005-4682-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11241-005-4682-5

Keywords

Navigation