Bayesian Multitask Inverse Reinforcement Learning

Dimitrakakis, Christos; Rothkopf, Constantin A.

doi:10.1007/978-3-642-29946-9_27

Christos Dimitrakakis²¹ &
Constantin A. Rothkopf²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7188))

Included in the following conference series:

European Workshop on Reinforcement Learning

2582 Accesses

Abstract

We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonstrations. Each one may represent one expert trying to solve a different task, or as different experts trying to solve the same task. Our main contribution is to formalise the problem as statistical preference elicitation, via a number of structured priors, whose form captures our biases about the relatedness of different tasks or expert policies. In doing so, we introduce a prior on policy optimality, which is more natural to specify. We show that our framework allows us not only to learn to efficiently from multiple experts but to also effectively differentiate between the goals of each. Possible applications include analysing the intrinsic motivations of subjects in behavioural experiments and learning from multiple teachers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A survey of inverse reinforcement learning

Article Open access 08 February 2022

Multi-task reinforcement learning in humans

Article 28 January 2021

Active inference and the two-step task

Article Open access 21 October 2022

References

Abbeel, P., Ng, A.Y.: Apprenticeship learning via inverse reinforcement learning. In: ICML 2004 (2004)
Google Scholar
Babes, M., Marivate, V., Littman, M., Subramanian, K.: Apprenticeship learning about multiple intentions. In: ICML 2011 (2011)
Google Scholar
Birlutiu, A., Groot, P., Heskes, T.: Multi-task preference learning with gaussian processes. In: ESANN 2009, pp. 123–128 (2009)
Google Scholar
Boutilier, C.: A POMDP formulation of preference elicitation problems. In: AAAI 2002, pp. 239–246 (2002)
Google Scholar
Choi, J., Kim, K.-E.: Inverse reinforcement learning in partially observable environments. Journal of Machine Learning Research 12, 691–730 (2011)
MathSciNet Google Scholar
Chu, W., Ghahramani, Z.: Preference learning with Gaussian processes. In: ICML 2005 (2005)
Google Scholar
Coates, A., Abbeel, P., Ng, A.Y.: Learning for control from multiple demonstrations. In: ICML 2008, pp. 144–151. ACM (2008)
Google Scholar
Dearden, R., Friedman, N., Russell, S.J.: Bayesian Q-learning. In: AAAI/IAAI, pp. 761–768 (1998)
Google Scholar
Dimitrakakis, C.: Robust Bayesian reinforcement learning through tight lower bounds. In: EWRL 2011 (2011)
Google Scholar
Doshi-Velez, F., Wingate, D., Roy, N., Tenenbaum, J.: Nonparametric Bayesian policy priors for reinforcement learning. In: NIPS 2010, pp. 532–540 (2010)
Google Scholar
Ferguson, T.S.: Prior distributions on spaces of probability measures. The Annals of Statistics 2(4), 615–629 (1974) ISSN 00905364
Article MathSciNet MATH Google Scholar
Geweke, J.: Bayesian inference in econometric models using Monte Carlo integration. Econometrica: Journal of the Econometric Society, 1317–1339 (1989)
Google Scholar
Heskes, T.: Solving a huge number of similar tasks: a combination of multi-task learning and a hierarchical Bayesian approach. In: ICML 1998, pp. 233–241. Citeseer (1998)
Google Scholar
Lazaric, A., Ghavamzadeh, M.: Bayesian multi-task reinforcement learning. In: ICML 2010 (2010)
Google Scholar
Natarajan, S., Kunapuli, G., Judah, K., Tadepalli, P., Kersting, K., Shavlik, J.: Multi-agent inverse reinforcement learning. In: ICMLA 2010, pp. 395–400. IEEE (2010)
Google Scholar
Ng, A.Y., Russell, S.: Algorithms for inverse reinforcement learning. In: ICML 2000, pp. 663–670. Morgan Kaufmann (2000)
Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, New Jersey (2005)
MATH Google Scholar
Ramachandran, D., Amir, E.: Bayesian inverse reinforcement learning. In: IJCAI 2007, vol. 51, p. 61801 (2007)
Google Scholar
Robbins, H.: An empirical Bayes approach to statistics (1955)
Google Scholar
Rothkopf, C.A., Dimitrakakis, C.: Preference Elicitation and Inverse Reinforcement Learning. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011. LNCS, vol. 6913, pp. 34–48. Springer, Heidelberg (2011)
Chapter Google Scholar
Syed, U., Schapire, R.E.: A game-theoretic approach to apprenticeship learning. In: NIPS 2008, vol. 10 (2008)
Google Scholar
Wilson, A., Fern, A., Ray, S., Tadepalli, P.: Multi-task reinforcement learning: a hierarchical Bayesian approach. In: ICML 2007, pp. 1015–1022. ACM (2007)
Google Scholar
Ziebart, B.D., Andrew Bagnell, J., Dey, A.K.: Modelling interaction via the principle of maximum causal entropy. In: ICML 2010, Haifa, Israel (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

EPFL, Lausanne, Switzerland
Christos Dimitrakakis
Frankfurt Institute for Advanced Studies, Frankfurt, Germany
Constantin A. Rothkopf

Authors

Christos Dimitrakakis
View author publications
You can also search for this author in PubMed Google Scholar
Constantin A. Rothkopf
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

NICTA and the Australian National University, 7 London Circuit, ACT 2601, Canberra, Australia
Scott Sanner
Research School of Computer Science, Australian National University, ACT 0200, Canberra, Australia
Marcus Hutter

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dimitrakakis, C., Rothkopf, C.A. (2012). Bayesian Multitask Inverse Reinforcement Learning. In: Sanner, S., Hutter, M. (eds) Recent Advances in Reinforcement Learning. EWRL 2011. Lecture Notes in Computer Science(), vol 7188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29946-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-29946-9_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29945-2
Online ISBN: 978-3-642-29946-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Bayesian Multitask Inverse Reinforcement Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A survey of inverse reinforcement learning

Multi-task reinforcement learning in humans

Active inference and the two-step task

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Bayesian Multitask Inverse Reinforcement Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A survey of inverse reinforcement learning

Multi-task reinforcement learning in humans

Active inference and the two-step task

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation