Abstract
In this paper, walking motion learning of quadrupedal walking robot is realized by the Profit Sharing that can learn deterministic policy for POMDPs environments. In this research, we used the Profit Sharing that can learn deterministic policy for POMDPs environments which can obtain the deterministic policy by using the history of observations. We carried out a series of experiments using quadrupedal walking robot, and confirmed that walking motion learning can be realized by the Profit Sharing that can learn deterministic policy for POMDPs environments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press (1998)
Takamori, Y., Osana, Y.: Profit sharing that can learn deterministic policy for POMDPs environments. In: Proceedings of IEEE International Conference on System, Man and Cybernetics, Anchorage (2011)
Saito, K., Masuda, S.: Profit Sharing Introducing the Judgement of Incomplete Perception. Transactions of the Japanese Society for Artificial Intelligence 19(5), 379–388 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Morino, Y., Osana, Y. (2014). Walking Motion Learning of Quadrupedal Walking Robot by Profit Sharing That Can Learn Deterministic Policy for POMDPs Environments. In: Dick, G., et al. Simulated Evolution and Learning. SEAL 2014. Lecture Notes in Computer Science, vol 8886. Springer, Cham. https://doi.org/10.1007/978-3-319-13563-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-13563-2_28
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13562-5
Online ISBN: 978-3-319-13563-2
eBook Packages: Computer ScienceComputer Science (R0)