Automated Transfer for Reinforcement Learning Tasks

Bou Ammar, Haitham; Chen, Siqi; Tuyls, Karl; Weiss, Gerhard

doi:10.1007/s13218-013-0286-8

Automated Transfer for Reinforcement Learning Tasks

Discussion
Published: 09 January 2014

Volume 28, pages 7–14, (2014)
Cite this article

KI - Künstliche Intelligenz Aims and scope Submit manuscript

Haitham Bou Ammar¹,
Siqi Chen³,
Karl Tuyls² &
…
Gerhard Weiss³

313 Accesses
Explore all metrics

Abstract

Reinforcement learning applications are hampered by the tabula rasa approach taken by existing techniques. Transfer for reinforcement learning tackles this problem by enabling the reuse of previously learned behaviours. To be fully autonomous a transfer agent has to: (1) automatically choose a relevant source task(s) for a given target, (2) learn about the relation between the tasks, and (3) effectively and efficiently transfer between tasks. Currently, most transfer frameworks require substantial human intervention in at least one of the previous three steps. This discussion paper aims at: (1) positioning various knowledge re-use algorithms as forms of transfer, and (2) arguing the validity and possibility of autonomous transfer by detailing potential solutions to the above three steps.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

Notes

In policy iteration algorithms, for example, the policy space can be defined as the space of all possible policies that can be learnt. In other words, this space can be defined by a combination of basis functions and parameterisations spanning different policies.
Such a setting is typical in continuous reinforcement learning. The reasons relate to: (1) Q-function, and (2) state and action space representations.
A typical criterion used is to maximise the expected value of the total discounted pay-off signal.
${\mathfrak{X}_{\hbox {transfer}}}$ can either be: (1) hand-coded (see [13]), or (2) learned through source and target samples [2].
Typically, n ₂ < < n ₁ where only few transitions are available from the target task.

References

Abbeel P, Ng AY (2004) Apprenticeship learning via inverse reinforcement learning. In: Proceedings of the 21st international conference on Machine learning, ICML ’04, ACM, New York, NY, USA
Ammar HB, Taylor ME, Tuyls K, Driessens K, Weiss G (2012) Reinforcement learning transfer via sparse coding (full paper). In: Proceedings of the 11th conference on Autonomous Agents and Multiagent Systems (AAMAS), Valencia
Argall BD, Chernova S, Veloso M, Browning B (2009) A survey of robot learning from demonstration. Robot Auton Syst, 57(5):469–483
Article Google Scholar
Buşoniu L, Babuška R, De Schutter B, Ernst D (2010) Reinforcement learning and dynamic programming using function approximators. CRC Press, Boca Raton
Castro PS, Precup D (2010) Using bisimulation for policy transfer in mdps. In: Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Volume 1, AAMAS ’10, International Foundation for Autonomous Agents and Multiagent Systems, Richland, pp 1399–1400
Ferns N, Panangaden P, Precup D (2004) Metrics for finite markov decision processes. In: Chickering DM, Halpern JY, (eds), UAI, AUAI Press pp 162–169
Ferns N, Panangaden P, Precup D (2011) Bisimulation metrics for continuous markov decision processes. SIAM J Comput, 40(6):1662–1714
Article MATH MathSciNet Google Scholar
Knox WB, Stone P, Breazeal C (2013) Teaching agents with human feedback: a demonstration of the tamer framework. In: IUI Companion, pp 65–66
Lee H, Battle A, Raina R, Ng AY (2007) Efficient sparse coding algorithms. In: In NIPS, NIPS pp 801–808
Ng AY, Harada D, Russell S (1999) Policy invariance under reward transformations: theory and application to reward shaping. In: In Proceedings of the 16th International Conference on Machine Learning, Morgan Kaufmann, pp 278–287
Snelson E, Ghahramani Z (2006) Sparse gaussian processes using pseudo-inputs. In: Advances in Neural Information Processing Systems, MIT press, pp 1257–1264
Taylor ME, Stone P (2009) Transfer learning for reinforcement learning domains: a survey. J Mach Learn Res, 10:1633–1685
MATH MathSciNet Google Scholar
Taylor ME, Stone P, Liu Y (2007) Transfer learning via inter-task mappings for temporal difference learning. J Mach Learn Res 8(1):2125–2167
MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Computer and Information Science Department, University of Pennsylvania, Philadelphia, PA, 19104-6309, USA
Haitham Bou Ammar
Department of Computer Science, University of Liverpool, Liverpool, UK
Karl Tuyls
Department of Knowledge Engineering, Maastricht University, Maastricht, Netherlands
Siqi Chen & Gerhard Weiss

Authors

Haitham Bou Ammar
View author publications
You can also search for this author inPubMed Google Scholar
Siqi Chen
View author publications
You can also search for this author inPubMed Google Scholar
Karl Tuyls
View author publications
You can also search for this author inPubMed Google Scholar
Gerhard Weiss
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Haitham Bou Ammar.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bou Ammar, H., Chen, S., Tuyls, K. et al. Automated Transfer for Reinforcement Learning Tasks. Künstl Intell 28, 7–14 (2014). https://doi.org/10.1007/s13218-013-0286-8

Download citation

Published: 09 January 2014
Issue Date: February 2014
DOI: https://doi.org/10.1007/s13218-013-0286-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automated Transfer for Reinforcement Learning Tasks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Options in Multi-task Reinforcement Learning - Transfer via Reflection

A Study on Efficient Reinforcement Learning Through Knowledge Transfer

Skill based transfer learning with domain adaptation for continuous reinforcement learning domains

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Automated Transfer for Reinforcement Learning Tasks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Options in Multi-task Reinforcement Learning - Transfer via Reflection

A Study on Efficient Reinforcement Learning Through Knowledge Transfer

Skill based transfer learning with domain adaptation for continuous reinforcement learning domains

Explore related subjects

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now