Skip to main content

Knowledge Modeling by ELM in RL for SRHT Problem

  • Conference paper
  • First Online:
Hybrid Artificial Intelligent Systems (HAIS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9648))

Included in the following conference series:

  • 2090 Accesses

Abstract

Single Robot Hose Transport (SRHT) is a limit case of Linked Multicomponent Robotic Systems (L-MCRS), when one robot moves the tip of a hose to a desired position, while the other hose extreme is attached to a source position. Reinforcement Learning (RL) algorithms have been applied to learn autonomously the robot control with success. However, RL algorithms produce large and intractable data structures. This paper addresses the problem by learning an Extreme Learning Machine (ELM) from the state-action value Q-table, obtaining very relevant data reduction. In this paper we evaluate empirically a classification strategy to formulate ELM learning to provide approximations to the Q-table, obtaining very promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Duro, R., GraƱa, M., de Lope, J.: On the potential contributions of hybrid intelligent approaches to multicomponen robotic system development. Inf. Sci. 180(14), 2635ā€“2648 (2010)

    ArticleĀ  Google ScholarĀ 

  2. LĆ³pez-Guede, J.M., GraƱa, M., Zulueta, E.: On distributed cooperative control for the manipulation of a hose by a multirobot system. In: Corchado, E., Abraham, A., Pedrycz, W. (eds.) HAIS 2008. LNCS (LNAI), vol. 5271, pp. 673ā€“679. Springer, Heidelberg (2008)

    ChapterĀ  Google ScholarĀ 

  3. Fernandez-Gauna, B., Lopez-Guede, J., Zulueta, E., GraƱa, M.: Learning hose transport control with Q-Learning. Neural Netw. World 20(7), 913ā€“923 (2010)

    Google ScholarĀ 

  4. Fernandez-Gauna, B., Lopez-Guede, J.M., Zulueta, E., Echegoyen, Z., GraƱa, M.: Basic results and experiments on robotic multi-agent system for hose deployment and transportation. Int. J. Artif. Intell. 6(S11), 183ā€“202 (2011)

    Google ScholarĀ 

  5. Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)

    Google ScholarĀ 

  6. Watkins, C., Dayan, P.: Q-Learning. Mach. Learn. 8(3ā€“4), 279ā€“292 (1992)

    MATHĀ  Google ScholarĀ 

  7. Lopez-Guede, J.M., Fernandez-Gauna, B., GraƱa, M., Zulueta, E.: Improving the control of single robot hose transport. Cybern. Syst. 43(4), 261ā€“275 (2012)

    ArticleĀ  Google ScholarĀ 

  8. Bellman, R.: A Markovian decision process. Indiana Univ. Math. J. 6, 679ā€“684 (1957)

    ArticleĀ  MathSciNetĀ  MATHĀ  Google ScholarĀ 

  9. Tijms, H.: Discrete Time Markov Decision Processes, pp. 233ā€“277. Wiley, Hoboken (2004)

    Google ScholarĀ 

  10. Watkins, C.: Learning from delayed rewards. In: Ph.D. dissertation, University of Cambridge, England (1989)

    Google ScholarĀ 

  11. Watkins, C., Dayan, P.: Technical note: Q-Learning. Mach. Learn. 8, 279ā€“292 (1992)

    MATHĀ  Google ScholarĀ 

  12. Huang, D.H.W.G.-B., Lan, Y.: Extreme learning machines: a survey. Int. J. Mach. Learn. Cybern. 2(2), 107ā€“122 (2011)

    ArticleĀ  MathSciNetĀ  Google ScholarĀ 

  13. Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: theory and applications. Neurocomputing 70(1ā€“3), 489ā€“501 (2006)

    ArticleĀ  Google ScholarĀ 

  14. Widrow, B., Lehr, M.: 30 years of adaptive neural networks: perceptron, madaline, and backpropagation. Proc. IEEE 78(9), 1415ā€“1442 (1990)

    ArticleĀ  Google ScholarĀ 

  15. Lopez-Guede, J.M., GraƱa, M., Zulueta, E., Barambones, O.: Economical implementation of control loops for multi-robot systems. In: Kƶppen, M., Kasabov, N., Coghill, G. (eds.) ICONIP 2008, Part I. LNCS, vol. 5506, pp. 1053ā€“1059. Springer, Heidelberg (2009). Revised Selected Papers, Part I

    ChapterĀ  Google ScholarĀ 

  16. Lopez-Guede, J.M., Fernandez-Gauna, B., GraƱa, M., Zulueta, E.: Multi-robot systems control implementation. In: Robot Learning, pp. 137ā€“150. SCIYO (2010)

    Google ScholarĀ 

Download references

Acknowledgements

The research was supported by the Computational Intelligence Group, funded by the Basque Government with grant IT874-13.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jose Manuel Lopez-Guede .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Lopez-Guede, J.M., Garmendia, A., GraƱa, M. (2016). Knowledge Modeling by ELM in RL for SRHT Problem. In: Martƭnez-Ɓlvarez, F., Troncoso, A., QuintiƔn, H., Corchado, E. (eds) Hybrid Artificial Intelligent Systems. HAIS 2016. Lecture Notes in Computer Science(), vol 9648. Springer, Cham. https://doi.org/10.1007/978-3-319-32034-2_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-32034-2_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-32033-5

  • Online ISBN: 978-3-319-32034-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics