Reinforcement Learning in Single Robot Hose Transport Task: A Physical Proof of Concept

Lopez-Guede, Jose Manuel; Estévez, Julián; Graña, Manuel

doi:10.1007/978-3-319-19719-7_26

Jose Manuel Lopez-Guede⁷,
Julián Estévez⁷ &
Manuel Graña⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 368))

959 Accesses

Abstract

In this paper we address the physical realization of proof of concept experiments demonstrating the suitability of the controllers learned by means of Reinforcement Learning (RL) techniques to accomplish tasks involving Linked Multi-Component Robotic System (LMCRS). In this paper, we deal with the task of transporting a hose by a single robot as a prototypical example of LMCRS, which can be extended to much more complex tasks. We describe how the complete system has been designed and built, explaining its different main components: the RL controller, the communications, and finally, the monitoring system. A previously learned RL controller has been tested solving a concrete problem with a determined state space modeling and discretization step. This physical realization validates our previous published works carried out through computer simulations, giving a strong argument in favor of the suitability of RL techniques to deal with real LMCRS systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Reinforcement Learning for Dynamic Trajectory Adjustment in Human-Robot Interaction Within Virtual Simulations

Knowledge Modeling by ELM in RL for SRHT Problem

Multi-agent reinforcement learning for redundant robot control in task-space

Article 09 July 2020

References

Duro R, Graña M, de Lope J (2010) On the potential contributions of hybrid intelligent approaches to multicomponen robotic system development. Inf Sci 180(14):2635–2648
Article Google Scholar
Lopez-Guede JM, Graña M, Zulueta E (2008) On distributed cooperative control for the manipulation of a hose by a multirobot system. In: Corchado E, Abraham A, Pedrycz W (eds) Hybrid artificial intelligence systems. Lecture notes in artificial intelligence, vol 5271, pp 673–679. 3rd international workshop on hybrid artificial intelligence systems, pp 24–26. University of Burgos, Burgos, Spain
Google Scholar
Echegoyen Z (2009) Contributions to visual servoing for legged and linked multicomponent robots. Ph.D. dissertation, UPV/EHU
Google Scholar
Echegoyen Z, Villaverde I, Moreno R, Graña M, d’Anjou A (2010) Linked multi-component mobile robots: modeling, simulation and control. Rob Auton Syst 58(12, SI):1292–1305
Google Scholar
Boor CD (1994) A practical guide to splines. Springer
Google Scholar
Rubin M (2000) Cosserat theories: shells. Kluwer, Rods and Points
Book Google Scholar
Theetten A, Grisoni L, Andriot C, Barsky B (2008) Geometrically exact dynamic splines. Comput Aided Des 40(1):35–48
Article Google Scholar
Fernandez-Gauna B, Lopez-Guede J, Zulueta E (2010) Linked multicomponent robotic systems: basic assessment of linking element dynamical effect. In: Manuel Grana Romay MGS, Corchado ES (eds.) Hybrid artificial intelligence systems, Part I, vol 6076. Springer, pp 73–79
Google Scholar
Sutton R, Barto A (1998) Reinforcement learning: an introduction. MIT Press
Google Scholar
Bellman R (1957) A markovian decision process. Indiana Univ Math J 6:679–684
Article MATH Google Scholar
Tijms HC (2004) Discrete-time Markov decision processes. John Wiley & Sons Ltd, pp 233–277. http://dx.doi.org/10.1002/047001363X.ch6
Watkins C (1989) Learning from delayed rewards. Ph.D. dissertation, University of Cambridge, England
Google Scholar
Watkins C, Dayan P (1992) Technical note: Q-learning. Mach Learn 8:279–292. doi:10.1023/A:1022676722315. http://dx.doi.org/10.1023/A:1022676722315
Fernandez-Gauna B, Lopez-Guede J, Zulueta E, Graña M (2010) Learning hose transport control with q-learning. Neural Netw World 20(7):913–923
Google Scholar
Graña M, Fernandez-Gauna B, Lopez-Guede J (2011) Cooperative multi-agent reinforcement learning for multi-component robotic systems: guidelines for future research. Paladyn. J Behav Rob 2:71–81. doi:10.2478/s13230-011-0017-5. http://dx.doi.org/10.2478/s13230-011-0017-5
Fernandez-Gauna B, Lopez-Guede JM, Zulueta E, Echegoyen Z, Graña M (2011) Basic results and experiments on robotic multi-agent system for hose deployment and transportation. Int J Artif Intell 6(S11):183–202
Google Scholar
Fernandez-Gauna B, Lopez-Guede J, Graña M (2011) Towards concurrent q-learning on linked multi-component robotic systems. In: Corchado E, Kurzynski M, Wozniak M (eds) Hybrid artificial intelligent systems. Lecture notes in computer science, vol 6679. Springer, Berlin/Heidelberg, pp 463–470
Google Scholar
Fernandez-Gauna B, Lopez-Guede J, Graña M (2011) Concurrent modular q-learning with local rewards on linked multi-component robotic systems. In: Ferrández J, Alvarez Sánchez J, de la Paz F, Toledo F (eds) Foundations on natural and artificial computation. Lecture notes in computer science, vol 6686. Springer, Berlin/Heidelberg, pp 148–155
Google Scholar
Lopez-Guede JM, Fernandez-Gauna B, Graña M, Zulueta E (2011) Empirical study of q-learning based elemental hose transport control. In: Corchado E, Kurzynski M, Wozniak M (eds) Hybrid artificial intelligent systems. Lecture notes in computer science, vol 6679. Springer, Berlin/Heidelberg, pp 455–462
Google Scholar
Lopez-Guede J, Fernandez-Gauna B, Graña M, Zulueta E (2012) Improving the control of single robot hose transport. Cybern Syst 43(4):261–275
Article Google Scholar
Lopez-Guede JM, Fernandez-Gauna B, Moreno R, Graña M (2012) Robotic vision: technologies for machine learning and vision applications. In: José García-Rodríguez MC (ed.) IGI Global
Google Scholar

Download references

Acknowledgments

The research was supported by the Computational Intelligence Group of the Basque Country University (UPV/EHU) through Grant IT874-13 of Research Groups Call 2013-2017 (Basque Country Government).

Author information

Authors and Affiliations

Computational Intelligence Group of the Basque Country University (UPV/EHU), San Sebastian, Spain
Jose Manuel Lopez-Guede, Julián Estévez & Manuel Graña

Authors

Jose Manuel Lopez-Guede
View author publications
You can also search for this author in PubMed Google Scholar
Julián Estévez
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Graña
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jose Manuel Lopez-Guede .

Editor information

Editors and Affiliations

Department of Civil Engineering, University of Burgos, Burgos, Spain
Álvaro Herrero
C/ López Bravo 70, Pol. Ind. Villalonquejar, Technological Institute of Castilla y León, Burgos, Spain
Javier Sedano
Department of Civil Engineering, University of Burgos, Burgos, Spain
Bruno Baruque
University of Salamanca, Salamanca, Spain
Héctor Quintián
University of Salamanca, Salamanca, Spain
Emilio Corchado

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lopez-Guede, J.M., Estévez, J., Graña, M. (2015). Reinforcement Learning in Single Robot Hose Transport Task: A Physical Proof of Concept. In: Herrero, Á., Sedano, J., Baruque, B., Quintián, H., Corchado, E. (eds) 10th International Conference on Soft Computing Models in Industrial and Environmental Applications. Advances in Intelligent Systems and Computing, vol 368. Springer, Cham. https://doi.org/10.1007/978-3-319-19719-7_26

Download citation

DOI: https://doi.org/10.1007/978-3-319-19719-7_26
Published: 27 May 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19718-0
Online ISBN: 978-3-319-19719-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics