Abstract
This paper presents DARX, our framework for building failure- resilient applications through adaptive fault tolerance. It relies on the fact that multi-agent platforms constitute a very strong basis for decentralized software that is both flexible and scalable, and makes the assumption that the relative importance of each agent varies during the course of the computation. DARX regroups solutions which facilitate the creation of multi-agent applications in a large-scale context. Its most important feature is adaptive replication: replication strategies are applied on a per-agent basis with respect to transient environment characteristics such as the importance of the agent for the computation, the network load or the mean time between failures.
Firstly, the interwoven concerns of multi-agent systems and fault-tolerant solutions are put forward. An overview of the DARX architecture follows, as well as an evaluation of its performances. We conclude, after outlining the promising outcomes, by presenting prospective work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Boukachour, H., Duvallet, C., Cardon, A.: Multiagent systems to prevent technological risks. In: Logananthara, R., Palm, G., Ali, M. (eds.) IEA/AIE 2000. LNCS (LNAI), vol. 1821, Springer, Heidelberg (2000)
Briot, J.-P., Guessoum, Z., Charpentier, S., Aknine, S., Marin, O., Sens, P.: Dynamic Adaptation of Replication Strategies for Reliable Agents. In: Proc. 2nd Symposium on Adaptive Agents and Multi-Agent Systems (AAMAS-2), London, UK, April (2002)
Bertier, M., Marin, O., Sens, P.: Implementation and performance evaluation of an adaptable failure detector. In: Proc. of the International Conference on Dependable Systems and Networks, Washington, DC, USA (2002)
Bertier, M., Marin, O., Sens, P.: Performance analysis of hierarchical failure detector. To be published in Proc. of the International Conference on Dependable Systems and Networks, San Francisco, CA, USA, June (2003)
Chandra, T.D., Toueg, S.: Unreliable Failure Detectors for Reliable Distributed Systems. Journal of the ACM 43(2), 225–267 (1996)
Fedoruk, A., Deters, R.: Improving Fault-Tolerance by Replicating Agents. In: Proceedings of 1st International Joint Conference on Autonomous Agents and Multi-Agent Systems, Bologna, Italy, July (2002)
Guessoum, Z., Briot, J.-P.: From active objects to autonomous agents. Special Series on Actors and Agents, edited by Dennis Kafura and Jean-Pierre Briot, IEEE Concurrency 7(3), 68–76 (1999)
Guessoum, Z., Faci, N., Briot, J-P.: Adaptive Replication of Large-Scale Multi-Agent Systems - Towards a Fault-Tolerant Multi-Agent Platform. In: Garcia, A., Choren, R., Lucena, C., Giorgini, P., Holvoet, T., Romanovsky, A. (eds.) Software Engineering for Multi-Agent Systems IV. LNCS, vol. 3914, Springer, Heidelberg (2006)
Gutknecht, O., Ferber, J.: The MadKit agent platform architecture. In: Wagner, T.A., Rana, O.F. (eds.) Infrastructure for Agents, Multi-Agent Systems, and Scalable Multi-Agent Systems. LNCS (LNAI), vol. 1887, Springer, Heidelberg (2001)
Guerraoui, R., Schiper, A.: Software-Based Replication For Fault Tolerance. IEEE Computer 30(4), 68–74 (1997)
Hägg, S.: A Sentinel Approach to Fault Handling in Multi-Agent Systems. In: Dickson, L., Zhang, C. (eds.) Multi-Agent Systems Methodologies and Applications. LNCS, vol. 1286, Springer, Heidelberg (1997)
Hoare, C.A.R.: Communicating Sequential Processes. Prentice-Hall, Englewood Cliffs (1985)
Kumar, S., Cohen, P.R., Levesque, H.J.: The Adaptive AgentArchitecture: Achieving Fault-Tolerance Using Persistent Broker Teams. In: 4th International Conference on Multi-Agent Systems (ICMAS 2000), Boston MA, USA, July (2000)
Kalbarczyk, Z., Iyer, R.K., Bagchi, S., Whisnant, K.: Chameleon: A Software Infrastructure for Adaptive Fault Tolerance. IEEE Transactions on Parallel and Distributed Systems 10(6), 560–579 (1999)
Milojicic, D., et al.: MASIF: The OMG Mobile Agent System Interoperability Facility. In: Rothermel, K., Hohl, F. (eds.) MA 1998. LNCS, vol. 1477, pp. 50–67. Springer, Heidelberg (1998)
Martin, D., Cheyer, A., Moran, D.: The Open Agent Architecture: A Framework for Building Distributed Software Systems. Applied Artificial Intelligence 13(1-2), 91–128 (1999)
Marin, O., Sens, P., Briot, J.-P., Guessoum, Z.: Towards Adaptive Fault-Tolerance for Distributed Multi-Agent Systems. In: Proceedings of ERSADS’2001, Bertinoro, Italy, May, pp. 195–201 (2001)
Suri, N., et al.: An Overview of the NOMADS Mobile Agent System. In: Bertino, E. (ed.) ECOOP 2000. LNCS, vol. 1850, Springer, Heidelberg (2000)
Pleisch, S., Schiper, A.: Fatomas - a fault-tolerant mobile agent system based on the agent-dependent approach. In: Proc. of the IEEE Int. Conf. on Dependable Systems and Networks, Goteborg, Sweden, July 2001, IEEE Computer Society Press, Los Alamitos (2001)
van Renesse, R., Birman, K., Maffeis, S.: Horus: A flexible group communication system. Communications of the ACM 39(4), 76–83 (1996)
Strasser, M., Baumann, J., Schwehm, M.: An Agent-based Framework for the Transparent Distribution of Computations. In: Arabnia, H. (ed.) Proc. of PDPTA’1999, vol. I, Las Vegas, USA, pp. 376–382 (1999)
Silva, L., Batista, V., Silva, J.: Fault-tolerant execution of mobile agents. In: Proc. of the International Conference on Dependable Systems and Networks, New York, June, pp. 135–143 (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Marin, O., Bertier, M., Sens, P., Guessoum, Z., Briot, JP. (2007). DARX - A Self-healing Framework for Agents. In: Kordon, F., Sztipanovits, J. (eds) Reliable Systems on Unreliable Networked Platforms. Monterey Workshop 2005. Lecture Notes in Computer Science, vol 4322. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71156-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-71156-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71155-1
Online ISBN: 978-3-540-71156-8
eBook Packages: Computer ScienceComputer Science (R0)