An Empirical Study of Actor-Critic Methods for Feedback Controllers of Ball-Screw Drivers

Fernandez-Gauna, Borja; Ansoategui, Igor; Etxeberria-Agiriano, Ismael; Graña, Manuel

doi:10.1007/978-3-642-38622-0_46

Borja Fernandez-Gauna¹⁸,
Igor Ansoategui¹⁸,
Ismael Etxeberria-Agiriano¹⁸ &
…
Manuel Graña¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7931))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

1627 Accesses
2 Citations

Abstract

In this paper we study the use of Reinforcement Learning Actor-Critic methods to learn the control of a ball-screw feed drive. We have tested three different actors: Q-value based, Policy Gradient and CACLA actors. We have paid special attention to the sensibility to suboptimal learning gain tuning. As a benchmark, we have used randomly-initialized PID controllers. CACLA provides an stable control comparable to the best heuristically tuned PID controller, despite its lack of knowledge of the actual error value.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Koren, Y., Lo, C.C.: Advanced controllers for feed drives. In: Annals of the CIRP, vol. 41 (1992)
Google Scholar
Srinivasan, K., Tsao, T.C.: Machine feed drives and their control - a survey of the state of the art. Journal of Manufacturing Science and Engineering 119, 743–748 (1997)
Article Google Scholar
Sutton, R., Barto, A.G.: Reinforcement Learning I: Introduction. MIT Press (1998)
Google Scholar
Chen, J.-S., Huang, Y.-K., Cheng, C.C.: Mechanical model and contouring analysis of high-speed ball-screw drive systems with compliance effect. Int. J. Adv. Manuf. Technol. 24, 241–250 (2004)
Google Scholar
Hasselt, H.: Reinforcement Learning in Continuous State and Action Spaces. In: Reinforcement Learning: State of the art. Adaptation, Learning, and Optimization, pp. 207–251. Springer (2012)
Google Scholar
van Hasselt, H., Wiering, M.A.: Reinforcement learning in continuous action spaces. In: Proceedings of the 2007 IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning (2007)
Google Scholar
Busoniu, L., Babuska, R., De Schutter, B., Ernst, D.: Reinforcement Learning and Dynamic Programming using Function Approximation. CRC Press (2010)
Google Scholar
Hafner, R., Riedmiller, M.: Reinforcement learning in feedback control: Challenges and benchmarks from technical process control. Machine Learning 84(1-2), 137–169 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Grupo de Inteligencia Computacional (GIC), Universidad del País Vasco (UPV/EHU), San Sebastian, Spain
Borja Fernandez-Gauna, Igor Ansoategui, Ismael Etxeberria-Agiriano & Manuel Graña

Authors

Borja Fernandez-Gauna
View author publications
You can also search for this author in PubMed Google Scholar
Igor Ansoategui
View author publications
You can also search for this author in PubMed Google Scholar
Ismael Etxeberria-Agiriano
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Graña
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Electrónica, Tecnología de Computadoras y Proyectos, Universidad Politécnica de Cartagena, Pl. Hospital, 1, 30201, Cartagena, Spain
José Manuel Ferrández Vicente & Fco. Javier Toledo Moreo &
Departamento de Inteligencia Artificial, Universidad Nacional de Educación a Distancia, E.T.S. de Ingenería Informática, Juan del Rosal, 16, 28040, Madrid, Spain
José Ramón Álvarez Sánchez & Félix de la Paz López &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fernandez-Gauna, B., Ansoategui, I., Etxeberria-Agiriano, I., Graña, M. (2013). An Empirical Study of Actor-Critic Methods for Feedback Controllers of Ball-Screw Drivers. In: Ferrández Vicente, J.M., Álvarez Sánchez, J.R., de la Paz López, F., Toledo Moreo, F.J. (eds) Natural and Artificial Computation in Engineering and Medical Applications. IWINAC 2013. Lecture Notes in Computer Science, vol 7931. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38622-0_46

Download citation

DOI: https://doi.org/10.1007/978-3-642-38622-0_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38621-3
Online ISBN: 978-3-642-38622-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics