Interactive Q-Learning on heterogeneous agents system for autonomous adaptive interface

Ishiwaka, Yuko; Yokoi, Hiroshi; Kakazu, Yukinori

doi:10.1007/978-4-431-65941-9_47

Interactive Q-Learning on heterogeneous agents system for autonomous adaptive interface

Yuko Ishiwaka⁵,
Hiroshi Yokoi⁶ &
Yukinori Kakazu⁶

Conference paper

591 Accesses

Abstract

Purpose of this system is to adapt the bedridden people who cannot move their body easily, so the simple reinforcement signals are applied. The application is to control the behaviors of Khepera robot, which is a small mobile robot. For the simple reinforcement signals the on-off signals are employed when the operators as the training agent feels discomfort for the behaviors of the learning agent Khepera robot. We proposed the new reinforcement learning method called Interactive Q-learning and the heterogeneous multi agent system. Our multi agent system has three kinds of heterogeneous single agent: Learning agent, Training agent and Interface Agent. The system is hierarchic. There are also three hierarchies. It is impossible to iterate the many episodes and steps to converge the learning which is adopted in general reinforcement learning in simulation world. We show the results of experiments using the Khepera robot for 3 examinees, and discuss how to give the rewards according to each operator and the significance of heterogeneous multi agent system. We confirmed the effectiveness through the some experiments which are to control the behavior of Khepera robot in real world. The convergences of our learning system are quite quick. Furthermore the importance of the interface agent is indicated. The individual differences for the timing to give the penalties are happened even though all operators are young.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Watkins C.J.C.H. (1992). Q-Learning, Machine Learning 8 p279
MATH Google Scholar
Sutton, R.S. and A.G. Barto (1998). Reinforcement Learning An Introduction MIT
Google Scholar
Lee, C and Xu, Y (1996) Online, Interactive Learning of Gestures for Human/Robot Interfaces, In Proceedings, IEEE international Conference on Robotics and Automation, vol.4, pp.2982–2987, MN
Google Scholar
Yu, W, H. Yokoi and D. Nishikawa (1998). Adaptive Electromyograohic (EMG) Prosthetic and Control Using Reinforcement Learning, IAS-5JOS Press, pp.266–271
Google Scholar
Ishiwaka, Y. H. Yokoi, and Kakazu, Y.(2000) Adaptive Learning Interface Used Physiological signals, Proceedings SMC 2000 Conference. Nashville, USA, pp. 32–38
Google Scholar
Bradtke, S.J and Duff, M.O.(1994) Reinforcement Learning Method for Continuous Time Markov Decision Problems, Advances in Neural Information Processing Systems 7,pp.393–400
Google Scholar
Parr, R. and Russell, S.(1995) Approximating Optimal Policies for Partially Observable Stochastic Domains, In Proceedings of the International Conference on Artificial Intelligence,pp. 1088–1094,Morgan Kaufmann
Google Scholar
Nehmzow U. and McGonigle B.: “Achieving Rapid Adaptations in Robots by Means of External Tuition”, SAB, 1994
Google Scholar
A. Cesta and D. D’Aloisi.:”Building Interfaces as Personal Agents”, Sigchi Bulletin, vol.3, 1996
Google Scholar
Wiering M. and Schmidhuber J.: “HQ-Learning”, Adaptive Behavior vol 6. No.2, 1997
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. Information Engineering, Hakodate National College of Technology, 14-l, Tokuracho, Hakodate, 042-8501, Japan
Yuko Ishiwaka
Complex System Engineering Dept., Hokkaido University, North-13, West-8, Sapporo, 060-8628, Japan
Hiroshi Yokoi & Yukinori Kakazu

Authors

Yuko Ishiwaka
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Yokoi
View author publications
You can also search for this author in PubMed Google Scholar
Yukinori Kakazu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Advanced Engineering Center, RIKEN (The Institute of Physical and Chemical Research), 2-1 Hirosawa, 351-0198, Wako-shi, Saitama, Japan
Hajime Asama (Head of Instrumentation Project Promotion Division) (Head of Instrumentation Project Promotion Division)
Department of Precision Machinery Engineering, The Unviersity of Tokyo, 7-3-1 Hongo, 113-8656, Bunkyo-ku, Tokyo, Japan
Tamio Arai (Professor) (Professor)
Department of Micro System Engineering, Nagoya University, 464-8603, Furo-cho, Chikusa-ku, Nagoya, Japan
Toshio Fukuda (Professor) (Professor)
Department of Intelligent Systems Graduate School of Information and Electrical Engineering, Kyushu University, 6-10-1 Hakozaki, 812-8581, Higaschi-ku, Fukuoka, Japan
Tsutomu Hasegawa (Professor) (Professor)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ishiwaka, Y., Yokoi, H., Kakazu, Y. (2002). Interactive Q-Learning on heterogeneous agents system for autonomous adaptive interface. In: Asama, H., Arai, T., Fukuda, T., Hasegawa, T. (eds) Distributed Autonomous Robotic Systems 5. Springer, Tokyo. https://doi.org/10.1007/978-4-431-65941-9_47

Download citation

DOI: https://doi.org/10.1007/978-4-431-65941-9_47
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-65943-3
Online ISBN: 978-4-431-65941-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics