Parallel Reinforcement Learning for Weighted Multi-criteria Model with Adaptive Margin

Hiraoka, Kazuyuki; Yoshida, Manabu; Mishima, Taketoshi

doi:10.1007/978-3-540-69158-7_51

Kazuyuki Hiraoka¹,
Manabu Yoshida¹ &
Taketoshi Mishima¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4984))

Included in the following conference series:

International Conference on Neural Information Processing

1208 Accesses
2 Citations

Abstract

Reinforcement learning (RL) for a linear family of tasks is studied in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy by a naive approach. Though there exists an algorithm for calculating the equivalent result to Q-learning for each task all together, it has a problem with explosion of set sizes. We introduce adaptive margins to overcome this difficulty.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jaakkola, T., et al.: Neural Computation 6, 1185–1201 (1994)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning. The MIT Press, Cambridge (1998)
Google Scholar
Kaneko, Y., et al.: In: Proc. IEICE Society Conference (in Japanese), vol. 167 (2004)
Google Scholar
Kaneko, N., et al.: In: Proc. IEICE Society Conference (in Japanese), vol. A-2-10 (2005)
Google Scholar
Natarajan, S., et al.: In: Proc. Intl. Conf. on Machine Learning, pp. 601–608 (2005)
Google Scholar
Hiraoka, K., et al.: The Brain & Neural Networks (in Japanese). Japanese Neural Network Society 13, 137–145 (2006)
Google Scholar
Yoshida, M., et al.: Proc. FIT (in Japanese) (to appear, 2007)
Google Scholar
Preparata, F.P., et al.: Computational Geometry. Springer, Heidelberg (1985)
Google Scholar
Alexandrov, V.N., Dongarra, J., Juliano, B.A., Renner, R.S., Tan, C.J.K. (eds.): ICCS 2001. LNCS, vol. 2073. Springer, Heidelberg (2001)
Google Scholar
Fukuda, K.: J. Symbolic Computation 38, 1261–1272 (2004)
Google Scholar
Fogel, E., et al.: In: Proc. ALENEX, pp. 3–15 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Japan
Kazuyuki Hiraoka, Manabu Yoshida & Taketoshi Mishima

Authors

Kazuyuki Hiraoka
View author publications
You can also search for this author in PubMed Google Scholar
Manabu Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Taketoshi Mishima
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Masumi Ishikawa Kenji Doya Hiroyuki Miyamoto Takeshi Yamakawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hiraoka, K., Yoshida, M., Mishima, T. (2008). Parallel Reinforcement Learning for Weighted Multi-criteria Model with Adaptive Margin. In: Ishikawa, M., Doya, K., Miyamoto, H., Yamakawa, T. (eds) Neural Information Processing. ICONIP 2007. Lecture Notes in Computer Science, vol 4984. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69158-7_51

Download citation

DOI: https://doi.org/10.1007/978-3-540-69158-7_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69154-9
Online ISBN: 978-3-540-69158-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics