Skip to main content

Concurrent Modular Q-Learning with Local Rewards on Linked Multi-Component Robotic Systems

  • Conference paper
Foundations on Natural and Artificial Computation (IWINAC 2011)

Abstract

Applying conventional Q-Learning to Multi-Component Robotic Systems (MCRS) increasing the number of components produces an exponential growth of state storage requirements. Modular approaches limit the state size growth to be polynomial on the number of components, allowing more manageable state representation and manipulation. In this article, we advance on previous works on a modular Q-learning approach to learn the distributed control of a Linked MCRS. We have chosen a paradigmatic application of this kind of systems using only local rewards: a set of robots carrying a hose from some initial configuration to a desired goal. The hose dynamics are simplified to be a distance constraint on the robots positions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Busoniu, L., Babuska, R., Schutter, B.D.: A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and ReviewsĀ 38(2), 156ā€“172 (2008)

    ArticleĀ  Google ScholarĀ 

  2. Duro, R.J., GraƱa, M., de Lope, J.: On the potential contributions of hybrid intelligent approaches to multicomponen robotic system development. Information SciencesĀ 180(14), 2635ā€“2648 (2010)

    ArticleĀ  Google ScholarĀ 

  3. Echegoyen, Z., Villaverde, I., Moreno, R., GraƱa, M., dā€™Anjou, A.: Linked multi-component mobile robots: modeling, simulation and control. Robotics and Autonomous SystemsĀ 58(12), 1292ā€“1305 (2010)

    ArticleĀ  Google ScholarĀ 

  4. Fernandez-Gauna, B., GraƱa, M., Lopez-Guede, J.M.: Towards concurrent q-learning on linked multi-component robotic systems. In: HAIS 2011. LNCS (LNAI), Springer, Heidelberg (2011) (in press)

    Google ScholarĀ 

  5. Fernandez-Gauna, B., Lopez-Guede, J.M., Zulueta, E., GraƱa, M.: Learning hose transport control with Q-learning. Neural Network WorldĀ 20(7), 913ā€“923 (2010)

    Google ScholarĀ 

  6. Fernandez-Gauna, B., Lopez-Guede, J.M., Zulueta, E.: Linked multicomponent robotic systems: Basic assessment of linking element dynamical effect. In: Corchado, E., GraƱa, M., Savio, A. (eds.) Hybrid Artificial Intelligence Systems, Part I, vol.Ā 6076, pp. 73ā€“79. Springer, Heidelberg (2010)

    ChapterĀ  Google ScholarĀ 

  7. Fernandez-Gauna, B., Lopez-Guede, J.M., Zulueta, E., Echegoyen, Z., GraƱa, M.: Basic results and experiments on robotic multi-agent system for hose deployment and transportation. International Journal of Artificial IntelligenceĀ 6(S11), 183ā€“202 (2011)

    Google ScholarĀ 

  8. GraƱa, M., Torrealdea, F.J.: Hierarchically structured systems. European Journal of Operational ResearchĀ 25, 20ā€“26 (1986)

    ArticleĀ  Google ScholarĀ 

  9. Maravall, D., de Lope, J., Martin, J.A.: Hybridizing evolutionary computation and reinforcement learning for the design of almost universal controllers for autonomous robots. NeurocomputingĀ 72(4-6), 887ā€“894 (2009)

    ArticleĀ  Google ScholarĀ 

  10. Panait, L., Luke, S.: Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent SystemsĀ 11(3), 387ā€“434 (2005)

    ArticleĀ  Google ScholarĀ 

  11. Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)

    Google ScholarĀ 

  12. Whitehead, S., Karlsson, J., Tenenberg, J.: Learning multiple goal behavior via task decomposition and dynamic policy merging. In: Robot Learning, pp. 45ā€“78. Kluwer Academic Publisher, Dordrecht (1993)

    ChapterĀ  Google ScholarĀ 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fernandez-Gauna, B., Lopez-Guede, J.M., GraƱa, M. (2011). Concurrent Modular Q-Learning with Local Rewards on Linked Multi-Component Robotic Systems. In: FerrƔndez, J.M., Ɓlvarez SƔnchez, J.R., de la Paz, F., Toledo, F.J. (eds) Foundations on Natural and Artificial Computation. IWINAC 2011. Lecture Notes in Computer Science, vol 6686. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21344-1_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-21344-1_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-21343-4

  • Online ISBN: 978-3-642-21344-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics