Abstract
Learning Automata are stochastic decision-making machines that have been widely used in classification, control, and network routing, between others. Despite their versatility, one of the main drawbacks of these models is the low convergence rate of the learning rules used for the training. Estimator algorithms such as Pursuit schemes help to overcome this limitation, although they require a high computer memory cost for their operation. This fact becomes a serious inconvenient when a large set of learning automata collaborate in a team to solve a concrete task, since the memory requirements of these algorithms increases exponentially. In these cases, Pursuit algorithms are ineffective due to memory overflow.
In this work, we address this problem and we propose an estimator algorithm that can be used to train large teams of Learning Automata. The approach uses a similar strategy to Tabu Search algorithms to manage long and short term memory, in order to reduce the memory requirements. The method is applied in classic permutation problems as a test-bed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agache, M., Oommen, B.J.: Generalized pursuit learning schemes: new families of continuous and discretized learning automata. IEEE Transactions on Systems, Man, and Cybernetics, Part B 32(6), 738–749 (2002)
Baba, N., Mogami, Y.: A new learning algorithm for the hierarchical structure learning automata operating in the nonstationary s-model random environment. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 32(6), 750–758 (2002)
Baba, N., Mogami, Y.: A relative reward-strength algorithm for the hierarchical structure learning automata operating in the general nonstationary multiteacher environment. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 36(4), 781–794 (2006)
Beigy, H., Meybodi, M.R.: Utilizing distributed learning automata to solve stochastic shortest path problems. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 14(5), 591–615 (2006)
Glover, F., Laguna, M.: Tabu search, pp. 70–150 (1993)
Lukac, K., Lukac, Z., Tkalic, M.: Behaviour of f learning automata as multicriteria routing agents in connection oriented networks. In: The 12th IEEE International Conference on Fuzzy Systems, FUZZ 2003, vol. 1, pp. 296–301 (2003)
Narendra, K.S., Thathachar, M.A.L.: Learning automata: an introduction. Prentice-Hall, Inc., Upper Saddle River (1989)
Nowé, A., Verbeeck, K., Peeters, M.: Learning automata as a basis for multi agent reinforcement learning. In: Tuyls, K., ’t Hoen, P.J., Verbeeck, K., Sen, S. (eds.) LAMAS 2005. LNCS (LNAI), vol. 3898, pp. 71–85. Springer, Heidelberg (2006)
Oommen, B., Agache, M.: A comparison of continuous and discretized pursuit learning schemes. In: Proceedings of 1999 IEEE International Conference on Systems, Man, and Cybernetics, SMC 1999, vol. 4, pp. 1061–1067 (1999)
Oommen, B., Lanctot, J.: Discretized pursuit learning automata. IEEE Transactions on Systems, Man and Cybernetics 20(4), 931–938 (1990)
Oommen, B.J., Hashem, M.K.: Modeling a student-classroom interaction in a tutorial-like system using learning automata. Trans. Sys. Man Cyber. Part B 40(1), 29–42 (2010), http://dx.doi.org/10.1109/TSMCB.2009.2032414
Peeters, M., Verbeeck, K., Nowé, A.: Solving multi-stage games with hierarchical learning automata that bootstrap. In: Tuyls, K., Nowe, A., Guessoum, Z., Kudenko, D. (eds.) ALAMAS 2005, ALAMAS 2006, and ALAMAS 2007. LNCS (LNAI), vol. 4865, pp. 169–187. Springer, Heidelberg (2008)
Poznyak, A.S., Najim, K.: Learning Automata and Stochastic Optimization. Springer-Verlag New York, Inc., Secaucus (1997)
Sastry, P., Thathachar, M.: Learning automata algorithms for pattern classification. Sadhana 24(4-5), 261–292 (1999)
Thathachar, M.A.L., Arvind, M.T.: Parallel algorithms for modules of learning automata. IEEE Transactions on Systems, Man, and Cybernetics, Part B 28(1), 24–33 (1998)
Thathachar, M.A.L., Ramakrishnan, K.R.: A cooperative game of a pair of learning automata. Automatica 20(6), 797–801 (1984)
Thathachar, M.A.L., Sastry, P.S.: Varieties of learning automata: an overview. IEEE Transactions on Systems, Man, and Cybernetics, Part B 32(6), 711–722 (2002)
Torkestani, J., Meybodi, M.: Graph coloring problem based on learning automata. In: International Conference on Information Management and Engineering, ICIME 2009, pp. 718–722 (2009)
Torkestani, J., Meybodi, M.: Solving the minimum spanning tree problem in stochastic graphs using learning automata. In: International Conference on Information Management and Engineering, ICIME 2009, pp. 643–647 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cuéllar, M.P., Ros, M., Delgado, M., Vila, A. (2011). An Estimator Update Scheme for Large Teams of Learning Automata. In: Abraham, A., Corchado, J.M., González, S.R., De Paz Santana, J.F. (eds) International Symposium on Distributed Computing and Artificial Intelligence. Advances in Intelligent and Soft Computing, vol 91. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19934-9_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-19934-9_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19933-2
Online ISBN: 978-3-642-19934-9
eBook Packages: EngineeringEngineering (R0)