Abstract
Applying conventional Q-Learning to Multi-Component Robotic Systems (MCRS) increasing the number of components produces an exponential growth of state storage requirements. Modular approaches limit the state size growth to be polynomial on the number of components, allowing more manageable state representation and manipulation. In this article, we advance on previous works on a modular Q-learning approach to learn the distributed control of a Linked MCRS. We have chosen a paradigmatic application of this kind of systems using only local rewards: a set of robots carrying a hose from some initial configuration to a desired goal. The hose dynamics are simplified to be a distance constraint on the robots positions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Busoniu, L., Babuska, R., Schutter, B.D.: A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and ReviewsĀ 38(2), 156ā172 (2008)
Duro, R.J., GraƱa, M., de Lope, J.: On the potential contributions of hybrid intelligent approaches to multicomponen robotic system development. Information SciencesĀ 180(14), 2635ā2648 (2010)
Echegoyen, Z., Villaverde, I., Moreno, R., GraƱa, M., dāAnjou, A.: Linked multi-component mobile robots: modeling, simulation and control. Robotics and Autonomous SystemsĀ 58(12), 1292ā1305 (2010)
Fernandez-Gauna, B., GraƱa, M., Lopez-Guede, J.M.: Towards concurrent q-learning on linked multi-component robotic systems. In: HAIS 2011. LNCS (LNAI), Springer, Heidelberg (2011) (in press)
Fernandez-Gauna, B., Lopez-Guede, J.M., Zulueta, E., GraƱa, M.: Learning hose transport control with Q-learning. Neural Network WorldĀ 20(7), 913ā923 (2010)
Fernandez-Gauna, B., Lopez-Guede, J.M., Zulueta, E.: Linked multicomponent robotic systems: Basic assessment of linking element dynamical effect. In: Corchado, E., GraƱa, M., Savio, A. (eds.) Hybrid Artificial Intelligence Systems, Part I, vol.Ā 6076, pp. 73ā79. Springer, Heidelberg (2010)
Fernandez-Gauna, B., Lopez-Guede, J.M., Zulueta, E., Echegoyen, Z., GraƱa, M.: Basic results and experiments on robotic multi-agent system for hose deployment and transportation. International Journal of Artificial IntelligenceĀ 6(S11), 183ā202 (2011)
GraƱa, M., Torrealdea, F.J.: Hierarchically structured systems. European Journal of Operational ResearchĀ 25, 20ā26 (1986)
Maravall, D., de Lope, J., Martin, J.A.: Hybridizing evolutionary computation and reinforcement learning for the design of almost universal controllers for autonomous robots. NeurocomputingĀ 72(4-6), 887ā894 (2009)
Panait, L., Luke, S.: Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent SystemsĀ 11(3), 387ā434 (2005)
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Whitehead, S., Karlsson, J., Tenenberg, J.: Learning multiple goal behavior via task decomposition and dynamic policy merging. In: Robot Learning, pp. 45ā78. Kluwer Academic Publisher, Dordrecht (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fernandez-Gauna, B., Lopez-Guede, J.M., GraƱa, M. (2011). Concurrent Modular Q-Learning with Local Rewards on Linked Multi-Component Robotic Systems. In: FerrĆ”ndez, J.M., Ćlvarez SĆ”nchez, J.R., de la Paz, F., Toledo, F.J. (eds) Foundations on Natural and Artificial Computation. IWINAC 2011. Lecture Notes in Computer Science, vol 6686. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21344-1_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-21344-1_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21343-4
Online ISBN: 978-3-642-21344-1
eBook Packages: Computer ScienceComputer Science (R0)