Abstract
The paper reviews current research results integrating machine learning and agent technologies. Although complementary solutions from both fields are discussed the focus is on using agent technology in the field of machine learning with a particular interest on applying agent-based solutions to supervised learning. The paper contains a short review of applications, in which machine learning methods have been used to support agent learning capabilities. This is followed by a corresponding review of machine learning methods and tools in which agent technology plays an important role. Final part gives a more detailed description of some example machine learning models and solutions where the asynchronous team of agents paradigm has been implemented to support the machine learning methods and which have been developed by the author and his research group.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abraham, A., Jain, R., Thomas, J., Han, S.Y.: D-SCIDS: Distributed soft computing intrusion detection system. J. Net.Comp. Appl. 30, 81–98 (2007)
Albashiri, K.A., Coenen, F., Leng, P.: EMADS: An Extendible Multi-agent Data Miner. Knowl. Bas. Syst. 22, 523–528 (2009)
Arevian, G., Wermter, S., Panchev, C.: Symbolic state transducers and recurrent neural preference machines for text mining. Int. J. Approx. Reason. 32, 237–258 (2003)
Bacardit, J., Butz, M.V.: Data mining in learning classifier systems: comparing xcs with gassist. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 4399, pp. 282–290. Springer, Heidelberg (2007)
Bacardit, J., Garrell, J.M.: Bloat Control and Generalization Pressure Using the Minimum Description Length Principle for a Pittsburgh Approach Learning Classifier System. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 4399, pp. 59–79. Springer, Heidelberg (2007)
Bacardit, J., Krasnogor, N.: Empirical Evaluation of Ensemble Techniques for a Pittsburgh Learning Classifier System. In: Bacardit, J., Bernadó-Mansilla, E., Butz, M.V., Kovacs, T., Llorà, X., Takadama, K. (eds.) IWLCS 2006 and IWLCS 2007. LNCS (LNAI), vol. 4998, pp. 255–268. Springer, Heidelberg (2008)
Barbucha, D., Czarnowski, I., Jedrzejowicz, P., Ratajczak-Ropel, E., Wierzbowska, I.: An Implementation of the JADE-base A-Team Environment. Int. Trans.Syst. Sc.Appl. 3(4), 319–328 (2008)
Boylu, F., Aytug, H., Koehler, G.J.: Principal–Agent Learning. Dec. Supp. Syst. 47, 75–81 (2009)
Bull, L., Kovacs, T.: Foundations of Learning Classifier Systems: An Introduction. Stud. Fuzz. Soft Comp. 183, 1–17 (2005)
Busniu, L., Babuska, R., Schutter, B.: A comprehensive survey of multiagent reinforcement learning. IEEE Trans. Syst. Man Cyb. 38, 156–171 (2008)
Czarnowski, I.: Distributed data reduction through agent collaboration. In: Håkansson, A., Nguyen, N.T., Hartung, R.L., Howlett, R.J., Jain, L.C. (eds.) KES-AMSTA 2009. LNCS, vol. 5559, pp. 724–733. Springer, Heidelberg (2009)
Czarnowski, I.: Prototype Selection Algorithms for Distributed Learning. Pat. Recogn. 43, 2292–2300 (2010)
Czarnowski, I.: Distributed learning with data reduction. LNCS Transactions on Collective Computational Intelligence IV. Springer, Heidelberg (to appear, 2011)
Czarnowski, I., Jedrzejowicz, P.: An Approach to Instance Reduction in Supervised Learning. In: Coenen, F., Preece, A., Macintosh, A. (eds.) Research and Development in Intelligent Systems XX, pp. 267–282. Springer, London (2004)
Czarnowski, I., Jędrzejowicz, P.: An Agent-Based PLA for the Cascade Correlation Learning Architecture. In: Duch, W., Kacprzyk, J., Oja, E., Zadrożny, S. (eds.) ICANN 2005. LNCS, vol. 3697, pp. 197–202. Springer, Heidelberg (2005)
Czarnowski, I., Jedrzejowicz, P.: An Agent-based Approach to ANN Training. Knowl.-Based Syst. 19, 304–308 (2006)
Czarnowski, I., Jędrzejowicz, P.: An Agent-Based Approach to the Multiple-Objective Selection of Reference Vectors. In: Perner, P. (ed.) MLDM 2007. LNCS (LNAI), vol. 4571, pp. 117–130. Springer, Heidelberg (2007)
Czarnowski, I., Jedrzejowicz, P.: An agent-based algorithm for data Reduction. In: Bramer, M., Coenen, F., Petridis, M. (eds.) Research and Development of Intelligent Systems XXIV, pp. 351–356. Springer, London (2007)
Czarnowski, I., Jędrzejowicz, P.: A Comparison Study of Strategies for Combining Classifiers from Distributed Data Sources. In: Kolehmainen, M., Toivanen, P., Beliczynski, B. (eds.) ICANNGA 2009. LNCS, vol. 5495, pp. 609–618. Springer, Heidelberg (2009)
Czarnowski, I., Jedrzejowicz, P.: An Approach to Data Reduction and Integrated Machine Classification. New Gen. Comp. 28, 21–40 (2010)
Czarnowski, I., Jedrzejowicz, P.: An agent-based framework for distributed Learning. Eng. Appl. Art. Intel. 24, 93–102 (2011)
Czarnowski, I., Jedrzejowicz, P., Wierzbowska, I.: An A-Team Approach to Learning Classifiers from Distributed Data Sources. Int. J. Intel. Inf. Db. Syst. 4(3), 245–263 (2010)
Fan, W., Gordon, M., Pathak, P.: An integrated two-stage model for intelligent information routing. Dec. Sup. Syst. 42(1), 362–374 (2006)
Gifford, C.M.: Collective Machine Learning: Team Learning and Classification in Multi-Agent Systems. Ph.D. dissertation, University of Kansas (2009)
Gifford, C.M., Agah, A.: Collaborative multi-agent rock facies classification from wireline well log data. Eng. Appl. Art. Intel. 23, 1158–1172 (2010)
Hofmann, T., Basilico, J.: Collaborative Machine Learning. In: Hemmje, M., Niederée, C., Risse, T. (eds.) From Integrated Publication and Information Systems to Information and Knowledge Environments. LNCS, vol. 3379, pp. 173–182. Springer, Heidelberg (2005)
Hoenl, P.J., Tuyls, K.: Analyzing Multi-agent Reinforcement Learning Using Evolutionary Dynamics. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 168–179. Springer, Heidelberg (2004)
Holland, J.H.: Escaping Brittleness: The possibilities of General-Purpose Learning Algorithms Applied to Parallel Rule-Based Systems. In: Michalski, R.S., Carbonell, J.G., Mitchell, T.M. (eds.) Machine Learning, an Artificial Intelligence Approach, vol. II, pp. 593–623. Morgan Kaufmann, Palo Alto (1986)
Ishiwaka, Y., Sato, T., Kakazu, Y.: An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning. Rob. Autonom. Syst. 43, 245–256 (2003)
Jansen, W.A.: Intrusion detection with mobile agents. Comp. Comm. 25, 1392–1401 (2002)
Jędrzejowicz, P.: A-Teams and Their Applications. In: Nguyen, N.T., Kowalczyk, R., Chen, S.-M. (eds.) ICCCI 2009. LNCS, vol. 5796, pp. 36–50. Springer, Heidelberg (2009)
Jędrzejowicz, J., Jędrzejowicz, P.: A Family of GEP-Induced Ensemble Classifiers. In: Nguyen, N.T., Kowalczyk, R., Chen, S.-M. (eds.) ICCCI 2009. LNCS, vol. 5796, pp. 641–652. Springer, Heidelberg (2009)
Jędrzejowicz, J., Jędrzejowicz, P.: Two Ensemble Classifiers Constructed from GEP-Induced Expression Trees. In: Jędrzejowicz, P., Nguyen, N.T., Howlet, R.J., Jain, L.C. (eds.) KES-AMSTA 2010. LNCS, vol. 6071, pp. 200–209. Springer, Heidelberg (2010)
Jennings, N., Sycara, K., Wooldridge, M.: A roadmap of agent research and development. Aut. Ag. Multi-Ag. Syst. 1, 7–38 (1998)
Jiang, C., Sheng, Z.: Case-based reinforcement learning for dynamic inventory control in a multi-agent supply-chain system. Exp. Syst. Appl. 36, 6520–6526 (2009)
Kitakoshi, D., Shioya, H., Nakano, R.: Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks. Inf. Sc. 180, 2856–2874 (2010)
Klusch, M., Lodi, S., Moro, G.L.: Agent-Based Distributed Data Mining: The KDEC Scheme. In: Klusch, M., Bergamaschi, S., Edwards, P., Petta, P. (eds.) Intelligent Information Agents. LNCS (LNAI), vol. 2586, pp. 104–122. Springer, Heidelberg (2003)
Liau, C.J.: Belief, information acquisition, and trust in multi-agent systems - A modal logic formulation. Artif. Int. 149(1), 31–60 (2003)
Loizos, M.: Partial observability and learnability. Artif. Int. 174, 639–669 (2010)
Luo, J., Wang, M., Hu, J., Shi, Z.: Distributed data mining on Agent Grid: Issues, platform and development toolkit. Fut. Gen. Comp. Syst. 23, 61–68 (2007)
Mannor, S., Shamma, J.S.: Multi-agent learning for engineers. Artif. Int. 171, 417–422 (2007)
Masoumi, B., Meybodi, M.R.: Speeding up learning automata based multi agent systems using the concepts of stigmergy and entropy. Exp. Syst. Appl. (to appear, 2011)
Moskovitch, R., Elovici, Y., Rokach, L.: Detection of unknown computer worms based on behavioral classification of the host. Comp. Stat. Data Anal. 52, 4544–4566 (2008)
Negatu, A., D’Mello, S.K., Franklin, S.: Cognitively Inspired Anticipatory Adaptation and Associated Learning Mechanisms for Autonomous Agents. In: Butz, M.V., Sigaud, O., Pezzulo, G., Baldassarre, G. (eds.) ABiALS 2006. LNCS (LNAI), vol. 4520, pp. 108–127. Springer, Heidelberg (2007)
Nowé, A., Verbeeck, K., Peeters, M.: Learning automata as a basis for multi agent reinforcement learning. In: Tuyls, K., Hoen, P.J., Verbeeck, K., Sen, S. (eds.) LAMAS 2005. LNCS (LNAI), vol. 3898, pp. 71–85. Springer, Heidelberg (2006)
Pazzani, M., Billsus, D.: Learning and revising user profiles: the identification of interesting web sites. Mach. Learn. 27(3), 313–331 (1997)
Preux, P., Delepoulle, S., Darcheville, J.-C.: A generic architecture for adaptive agents based on reinforcement learning. Inf. Sc. 161, 37–55 (2004)
Prodromidis, A., Chan, P.K., Stolfos, S.J.: Meta-learning in Distributed Data Mining Systems: Issues and Approaches. In: Kargupta, H., Chan, P. (eds.) Advances in Distributed and Parallel Knowledge Discovery, vol. 3, AAAI/MIT Press, Menlo Park (2000)
Quteishat, A., Lim, C.P., Tweedale, J., Jain, L.C.: A neural network-based multi-agent classifier system. Neurocomp. 72, 1639–1647 (2009)
Raicevic, P.: Parallel reinforcement learning using multiple reward signals. Neurocomp. 69, 2171–2179 (2006)
Rosaci, D.: CILIOS: Connectionist inductive learning and inter-ontology similarities for recommending information agents. Inf. Sys. 32, 793–825 (2007)
Sardinha, J.A.R.P., Garcia, A., de Lucena, C.J.P., Milidiú, R.L.: A Systematic Approach for Including Machine Learning in Multi-agent Systems. In: Bresciani, P., Giorgini, P., Henderson-Sellers, B., Low, G., Winikoff, M. (eds.) AOIS 2004. LNCS (LNAI), vol. 3508, pp. 198–211. Springer, Heidelberg (2005)
Shoham, Y., Powers, R., Grenager, T.: If multi-agent learning is the answer, what is the question? Artif. Int. 171(7), 365–377 (2007)
Sian, S.: Extending Learning to Multiple Agents: Issues and a Model for Multi-Agent Machine Learning (Ma-Ml). In: Kodratoff, Y. (ed.) EWSL 1991. LNCS, vol. 482, pp. 440–456. Springer, Heidelberg (1991)
Smith, R.E., Jiang, M.K., Bacardit, J., Stout, M., Krasnogor, N., Hirst, J.D.: A learning classifier system with mutual-information-based fitness. Evol. Int. 1(3), 31–50 (2010)
Stolfo, S., Prodromidis, A.L., Tselepis, S., Lee, W., Fan, D.W.: JAM: Java Agents for Meta-learning over Distributed Databases. In: 3rd International Conference on Knowledge Discovery and Data Mining, pp. 74–81. AAAI Press, Newport Beach (1997)
Sutton, R.S., Barto, A.G.: Reinforcement Learning. An Introduction. MIT Press, Cambridge (1998)
Symeonidis, A.L., Chatzidimitriou, K.C., Athanasiadis, I.N., Mitkas, P.A.: Data mining for agent reasoning: A synergy for training intelligent agents. Eng. Appl. Artif. Int. 20, 1097–1111 (2007)
Takadama, K., Inoue, H., Shimohara, K., Okada, M., Katai, O.: Agent architecture based on an interactive self-reflection classifier system. Artif. Life Rob. 5, 103–108 (2001)
Talukdar, S., Baerentzen, L., Gove, A., De Souza, P.: Asynchronous Teams: Cooperation Schemes for Autonomous Agents. J. Heur. 4(4), 295–321 (1998)
Tozicka, J., Rovatsos, M., Pechoucek, M., Urban, U.: MALEF: Framework for Distributed Machine Learning and Data Mining. Int. J. Int. Inf. Db. Sys. 2(1), 6–24 (2008)
Tweedale, J., Ichalkaranje, N., Sioutis, C., Jarvis, B., Consoli, A., Phillips-Wren, G.E.: Innovations in multi-agent systems. J. Net. Comp. Appl. 30(3), 1089–1115 (2007)
Wang, Y.-C., Usher, J.M.: Application of reinforcement learning for agent-based production scheduling. Eng. Appl. Artif. Int 18, 73–82 (2005)
Wilson, S.W.: State of XCS Classifier System Research. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 1999. LNCS (LNAI), vol. 1813, pp. 63–81. Springer, Heidelberg (2000)
Yu, L., Yue, W., Wang, S., Lai, K.K.: Support vector machine based multiagent ensemble learning for credit risk evaluation. Exp. Sys. Appl. 37, 1351–1360 (2010)
Zhang, W.-R., Zhang, L.: A multiagent data warehousing (MADWH) and multiagent data mining (MADM) approach to brain modeling and neurofuzzy control. Inf. Sc 167, 109–127 (2004)
Zhang, S., Wu, X., Zhang, C.: Multi-Database Mining. IEEE Computational Intelligence Bulletin 2(1), 5–13 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jędrzejowicz, P. (2011). Machine Learning and Agents. In: O’Shea, J., Nguyen, N.T., Crockett, K., Howlett, R.J., Jain, L.C. (eds) Agent and Multi-Agent Systems: Technologies and Applications. KES-AMSTA 2011. Lecture Notes in Computer Science(), vol 6682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22000-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-22000-5_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21999-3
Online ISBN: 978-3-642-22000-5
eBook Packages: Computer ScienceComputer Science (R0)