Abstract
Self-healing, one of the four key properties characterizing Autonomic Systems, aims to enable large-scale software systems delivering complex services on a 24/7 basis to meet their goals without any human intervention. Achieving self-healing requires the elicitation and maintenance of domain knowledge in the form of 〈service failure diagnosis, remediation strategy〉 patterns, a task which can be overwhelming. Case-Based Reasoning (CBR) is a lazy learning paradigm that largely reduces this kind of knowledge acquisition bottleneck. Moreover, the application of CBR for failure diagnosis and remediation in software systems appears to be very suitable, as in this domain most errors are re-occurrences of known problems. In this paper, we describe a CBR approach for providing large-scale, distributed software systems with self-healing capabilities, and demonstrate the practical applicability of our methodology by means of some experimental results on a real world application.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
The Moodle project web site (accessed on January15, 2006), http://www.moodle.org
Avizienis, A., Laprie, J., Randell, B., Landwehr, C.: Basic Concepts and Taxonomy of Dependable and Secure Computing. IEEE Transactions on Dependable and Secure Computing 1(1) (January-March 2004)
Aamodt, A., Plaza, E.: Case-based reasoning: foundational issues, methodological variations and systems approaches. AI Communications 7, 39–59 (1994)
Aghassi, D.S.: Evaluating case-based reasoning for heart failure diagnosis. Technical report, September of EECS. MIT, Cambridge, MA (1990)
Aha, D., Daniels, J. (eds.): Proc. AAAI Workshop on CBR Integrations. AAAI Press, Menlo Park (1998)
Anglano, C., Montani, S.: Achieving self-healing in autonomic software systems: a case-based reasoning approach. In: Czap, H., Unland, R., Branki, C., Tianfield, H. (eds.) Proc. International Conference on Self-Organization and Adaptation of Multi-agent and Grid Systems (SOAS), Glasgow, pp. 267–281. IOS Press, Amsterdam (2005)
Anglano, C., Montani, S.: Cavy: a tool for the deployment and operation of Self-Healing testbeds (January 2006) (submitted for publication)
Arshad, N., Heimbigner, D., Wolf, A.: A Planning Based Approach to Failure Recovery in Distributed Systems. In: Proc. of 2nd ACM Workshop on Self-Healing Systems (WOSS 2004), Newport Beach, CA, USA. ACM Press, New York (2004)
Bichindaritz, I., Kansu, E., Sullivan, K.M.: Case-based reasoning in CARE-PARTNER: Gathering evidence for evidence-based medical practice. In: Smyth, B., Cunningham, P. (eds.) EWCBR 1998. LNCS, vol. 1488, pp. 334–345. Springer, Heidelberg (1998)
Bonissone, P.P., Dutta, S.: Integrating case-based and rule-based reasoning: the possibilistic connection. In: Proc. of 6th Conference on Uncertainty in Artificial Intelligence, Cambridge, MA, USA (July 1990)
Branting, L.K., Porter, B.W.: Rules and precedents as complementary warrants. In: Proc. of 9th National Conference on Artificial Intelligence, Anaheim, CA, USA. AAAI Press, Menlo Park (1991)
Brewer, E.: Lessons from giant-scale services. IEEE Internet Computing 5(4) (2001)
Brodie, M., Ma, S., Lohman, G., Syeda-Mahmood, T., Mignet, L., Modani, N., Champlin, J., Sohn, P.: Quickly finding known software problems via automated symptom matching. In: Proc. of the 2nd International Conference on Autonomic Computing, Seattle, WA, USA (June 2005)
Freuder, E. (ed.): Proc. AAAI Spring Symposium on Multi-modal Reasoning. AAAI Press, Menlo Park (1998)
Ganek, A.G., Corbi, T.A.: The dawning of the autonomic computing era. IBM Systems Journal 42(1), 5–18 (2003)
Garlan, D., Schmerl, B.: Model-based Adaptation for Self-Healing Systems. In: Proc. of 1st ACM Workshop on Self-Healing Systems (WOSS 2002), Charleston, SC, USA. ACM Press, New York (2002)
Hammond, K.J.: Case-Based Planning: viewing planning as a memory task. Academic Press, London (1989)
Joshi, K.R., Hiltunen, M.A., Sanders, W.H., Schlichting, R.D.: Automatic Model- Driver Recovery in Distributed Systems. In: Proc. of 24th IEEE Symposium on Reliable Distributed Systems (SRDS 2005). IEEE Press, Los Alamitos (2005)
Kaiser, G., Parekh, J., Gross, P., Valetto, G.: Kenesthetics eXtreme: An External Infrastructure for Monitoring Distributed Legacy Systems. In: Proc. of 5th IEEE International Active Middleware Workshop, Seattle, WA, USA. IEEE CS Press, Los Alamitos (2003)
Kaiser, G., Parekh, J., Gross, P., Valetto, G.: Retrofitting Autonomic Capabilities onto Legacy Systems. Technical Report TR CUCS-026-03, Department of Computer Science, Columbia University (2003)
Kephart, J.O., Chess, D.M.: The vision of autonomic computing. IEEE Computer (January 2003)
Kolodner, J.L.: Case-Based Reasoning. Morgan Kaufmann, San Mateo (1993)
Littman, M., Nguyen, T., Hirsh, H.: Cost-Sensitive Fault Remediation for Autonomic Computing. In: Proc. of IJCAI Workshop on AI and Autonomic Computing: Developing a Research Agenda for Self-Managing Computer Systems, Acapulco, Mexico (August. 2003)
Macchion, D., Vo, D.: A hybrid knowledge-based system for technical diagnosis learning and assistance. In: Wess, S., Richter, M., Althoff, K.-D. (eds.) EWCBR 1993. LNCS, vol. 837, pp. 301–312. Springer, Heidelberg (1994)
Montani, S., Portinale, L.: Accounting for the temporal dimension in case-based retrieval: a framework for medical applications. Computational Intelligence (to appear)
Oppenheimer, D., Ganapathi, A., Patterson, D.: Why do Internet services fail, and what can be done about it? In: Proc. of 4th Usenix Symposium on Internet Technologies and Systems (USITS 2003), Seattle, WA, USA (March 2003)
Oppenheimer, D., Patterson, D.: Architecture and Dependability of Large-Scale Internet Services. IEEE Internet Computing, (September-October, 2002)
Portinale, L., Torasso, P., Magro, D.: Selecting most adaptable diagnostic solutions through pivoting-based retrieval. In: Leake, D.B., Plaza, E. (eds.) ICCBR 1997. LNCS, vol. 1266, pp. 393–402. Springer, Heidelberg (1997)
Rissland, E., Skalak, D.: Combining case-based and rule-based reasoning: A heuristic approach. In: Sridharan, N.S. (ed.) Proc. of 11th International Joint Conference on Artificial Intelligence, pp. 524–530 (1989)
Schaaf, J.W.: Fish and shrink. a next step towards efficient case retrieval in large-scale case bases. In: Smith, I., Faltings, B.V. (eds.) EWCBR 1996. LNCS, vol. 1168, pp. 362–376. Springer, Heidelberg (1996)
Schmidt, R., Montani, S., Bellazzi, R., Portinale, L., Gierl, L.: Case-based reasoning for medical knowledge-based systems. International Journal of Medical Informatics 64(2-3), 355–367 (2001)
Sterrit, R.: Autonomic networks: engineering the self-healing property. Engineering Applications of Artificial Intelligence 17, 727–739 (2004)
Surma, J., Vanhoof, K.: Integration rules and cases for the classification task. In: Veloso, M., Aamodt, A. (eds.) ICCBR 1995. LNCS, vol. 1010, pp. 325–334. Springer, Heidelberg (1995)
Wilson, D.R., Martinez, T.R.: Improved heterogeneous distance functions. Journal of Artificial Intelligence Research 6, 1–34 (1997)
Xu, L.D.: An integrated rule- and case-based approach to AIDS initial assessment. International Journal of Biomedical Computing 40, 197–207 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Montani, S., Anglano, C. (2006). Case-Based Reasoning for Autonomous Service Failure Diagnosis and Remediation in Software Systems. In: Roth-Berghofer, T.R., Göker, M.H., Güvenir, H.A. (eds) Advances in Case-Based Reasoning. ECCBR 2006. Lecture Notes in Computer Science(), vol 4106. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11805816_36
Download citation
DOI: https://doi.org/10.1007/11805816_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36843-4
Online ISBN: 978-3-540-36846-5
eBook Packages: Computer ScienceComputer Science (R0)