ART-Based Neuro-fuzzy Modelling Applied to Reinforcement Learning

Zikidis, Konstantinos C.; Tzafestas, Spyros G.

doi:10.1007/978-3-540-45226-3_4

Konstantinos C. Zikidis⁹ &
Spyros G. Tzafestas⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2774))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

962 Accesses

Abstract

The mountain car problem is a well-known task, often used for testing reinforcement learning algorithms. It is a problem with real valued state variables, which means that some kind of function approximation is required. In this paper, three reinforcement learning architectures are compared on the mountain car problem. Comparison results are presented, indicating the potentials of the actor-only approach. The function approximation modules used are based on NeuroFAST ( Neuro- Fuzzy ART-Based Structure and Parameter Learning TSK Model). NeuroFAST is a neuro-fuzzy modelling algorithm, with well-proven function approximation capabilities, and features the functional reasoning method (the Takagi-Sugeno-Kang fuzzy model), Fuzzy ART concepts and specific techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baird, L.C.: Reinforcement learning in continuous time: Advantage updating. In: Proc. of IEEE Intl. Conf. on Neural Networks (ICNN 1994), Orlando, Florida (1994)
Google Scholar
Baird, L.C.: Residual algorithms: Reinforcement learning with function approximation. In: Prieditis, A., Russell, S. (eds.) Proc. of 12th Intl. Conf. on Mach. Learn., pp. 30–37. Morgan Kaufmann, San Francisco (1995)
Google Scholar
Baxter, J., Bartlett, P.L.: Reinforcement learning in POMDP’s via direct gradient ascent. In: Proc. of 17th Intl. Conf. on Machine Learning, Stanford, CA (2000)
Google Scholar
Box, G.E.P., Jenkins, G.M.: Time Series Analysis, Forecasting, and Control. Holden Day, San Francisco (1970)
Google Scholar
Boyan, J.A., Moore, A.W.: Generalization in reinforcement learning: Safely approximating the value functions. In: Tesauro, G., Touretzky, D. (eds.) Advances in Neural Information Processing Systems (Proc. of 1994 Conf.), San Mateo, CA, pp. 369–376. Morgan Kaufmann, San Mateo (1995)
Google Scholar
Carpenter, G.A., Grossberg, S., Rosen, G.A.: Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system. Neural Networks 4, 759–771 (1991)
Article Google Scholar
Carpenter, G.A., Grossberg, S., Markuzon, N., Reynolds, J.H., Rosen, D.B.: Fuzzy ARTMAP: A neural architecture for incremental supervised learning of analog multidimensional maps. IEEE Trans. on Neural Networks 3(5), 698–712 (1992)
Article Google Scholar
Gordon, G.: Stable function approximation in dynamic programming. In: Proc. of 14th Intl. Conf. on Machine Learning, Nashville, TN (1997)
Google Scholar
Kimura, H., Kobayashi, S.: An analysis of actor/critic algorithms using eligibility traces: reinforcement learning with imperfect value functions. In: Proc. of 15th Intl. Conf. on Mach. Learn., Madison, Wisconsin, pp. 278–286 (1998)
Google Scholar
Konda, V.R., Tsitsiklis, J.N.: Actor critic algorithms. In: Advances in Neural Information Processing Systems (Proc. of the 1999 conference), Cambridge, MA, vol. 12. MIT Press, Cambridge (1999)
Google Scholar
Lin, C.-J., Lee, C.S.G.: Neural Fuzzy Systems: a Neuro-Fuzzy Synergism to Intelligent Systems. Prentice Hall, Englewood Cliffs (1996)
Google Scholar
Mackey, M.C., Glass, L.: Oscillation and chaos in physiological control systems. Science 197, 287–289 (1977)
Article Google Scholar
Mamdani, E.H., Assilian, S.: Applications of fuzzy algorithms for control of simple dynamic plant. Proc. Inst. Elec. Eng. 121, 1585–1588 (1974)
Article Google Scholar
Moore, A.W., Atkeson, C.G.: The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces. Machine Learning 21, 1–36 (1995)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. MIT Press, Cambridge (1998)
Google Scholar
Sutton, R.S., McAllester, D., Singh, S., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation. In: Advances in Neural Information Processing Systems, Cambridge, MA, vol. 12, pp. 1057–1063. MIT Press, Cambridge (1999)
Google Scholar
Takagi, T., Sugeno, M.: Fuzzy identification of systems and its application to modeling and control. IEEE Trans on Syst., Man, Cybern. 15, 116–132 (1985)
MATH Google Scholar
Tzafestas, S.G., Zikidis, K.C.: NeuroFAST: On-line neuro-fuzzy ART-based structure and parameter learning TSK model. IEEE Trans on Syst., Man, Cybern. 31(5), 797–802 (2001)
Article Google Scholar
Tzafestas, S.G., Zikidis, K.C.: High Accuracy Neuro – Fuzzy Modeling. In: IEEE Int. Conf. on Artificial Intelligence Systems (IEEE ICAIS 2002), Divnomorskoe, Russia (2002)
Google Scholar
Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8, 229–256 (1992)
MATH Google Scholar
Zikidis, K.C., Tzafestas, S.G.: Adaptive neuro-fuzzy modeling applied to policy gradient reinforcement learning. In: 5th Hellenic European Conference on Computer Mathematics & its Applications (HERCMA 2001), Athens, Greece (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Robotics and Automation Laboratory, Nat. Tech. University of Athens, Zografou Campus, Zografou, 15773, Athens, Greece
Konstantinos C. Zikidis & Spyros G. Tzafestas

Authors

Konstantinos C. Zikidis
View author publications
You can also search for this author in PubMed Google Scholar
Spyros G. Tzafestas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computing Laboratory, Oxford University, Parks Road, OXI 3QD, Oxford, United Kingdom
Vasile Palade
Centre for SMART Systems, School of Environment and Technology, University of Brighton, BN2 4GJ, Brighton, UK
Robert J. Howlett
Knowledge-Based Intelligent Engineering Systems Centre, University of South Australia, Mawson Lakes, SA 5095, Adelaide, Australia
Lakhmi Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zikidis, K.C., Tzafestas, S.G. (2003). ART-Based Neuro-fuzzy Modelling Applied to Reinforcement Learning. In: Palade, V., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2003. Lecture Notes in Computer Science(), vol 2774. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45226-3_4

Download citation

DOI: https://doi.org/10.1007/978-3-540-45226-3_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40804-8
Online ISBN: 978-3-540-45226-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics