An Extremely Simple Reinforcement Learning Rule for Neural Networks

Ma, Xiaolong

doi:10.1007/978-3-540-72383-7_51

Xiaolong Ma²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4491))

Included in the following conference series:

International Symposium on Neural Networks

1361 Accesses

Abstract

In this paper we derive a simple reinforcement learning rule based on a more general form of REINFORCE formulation. We test our new rule on both classification and reinforcement problems. The results have shown that although this simple learning rule has a high probability of being stuck in local optimum for the case of classification tasks, it is able to solve some global reinforcement problems (e.g. the cart-pole balancing problem) directly in the continuous space.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hertz, J., Palmer, R.G., Krogh, A.S.: Introduction to the theory of neural computation. Addison-Wesley Pub. Co., Redwood City (1991)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. MIT Press, Cambridge (1998)
Google Scholar
Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8, 229–256 (1992)
MATH Google Scholar
Ma, X., Likharev, K.K.: Global reinforcement learning in neural networks with stochastic synapses. In: Proc. of WCCI/IJCNN’06, pp. 47–53 (2006)
Google Scholar
Ma, X., Likharev, K.K.: Global reinforcement learning in neural networks. To be published in IEEE Tran. on Neural Networks (2007)
Google Scholar
Türel, Ö., Lee, J.H., Ma, X., Likharev, K.K.: Neuromorphic architectures for nanoelectronic circuits. Int. J. Circ. Theory App. 32(5), 277–302 (2004)
Article Google Scholar
Baxter, J., Bartlett, P.L.: Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research 15, 319–350 (2001)
MathSciNet MATH Google Scholar
Barto, A.G., Sutton, R.S., Anderson, C.W.: Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Syst., Man, Cybern. 13, 834–846 (1983)
Article Google Scholar
Albus, J.S.: A new approach to manipulator conrol: the cerebellar model articulation controller (CMAC). Trans. of ASME Journal of Dynamic Systems, Measurements, and Control 97(3), 220–227 (1975)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Stony Brook University, Stony Brook, NY 11794-3800, USA
Xiaolong Ma

Authors

Xiaolong Ma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering (M/C 154), University of Illinois at Chicago, 851 S. Morgan Street, 60607-7053, Chicago, IL, USA
Derong Liu
School of Automation, Southeast University, 210096, Nanjing, China
Shumin Fei
Laboratory of Complex Systems, Institute of Automation, Chinese Adacemy of Sciences, 100080, Beijing, P. R. China
Zeng-Guang Hou
School of Information Science and Engineering, Northeast University, Shenyang, 110004, China
Huaguang Zhang
School of Electrical Engineering, Hohai University, Nanjing, 210098, China
Changyin Sun

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ma, X. (2007). An Extremely Simple Reinforcement Learning Rule for Neural Networks. In: Liu, D., Fei, S., Hou, ZG., Zhang, H., Sun, C. (eds) Advances in Neural Networks – ISNN 2007. ISNN 2007. Lecture Notes in Computer Science, vol 4491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72383-7_51

Download citation

DOI: https://doi.org/10.1007/978-3-540-72383-7_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72382-0
Online ISBN: 978-3-540-72383-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics