Article

Cross-domain transfer for reinforcement learning

Authors:

Matthew E. Taylor,

Peter StoneAuthors Info & Claims

ICML '07: Proceedings of the 24th international conference on Machine learning

Pages 879 - 886

https://doi.org/10.1145/1273496.1273607

Published: 20 June 2007 Publication History

Get Access

Abstract

A typical goal for transfer learning algorithms is to utilize knowledge gained in a source task to learn a target task faster. Recently introduced transfer methods in reinforcement learning settings have shown considerable promise, but they typically transfer between pairs of very similar tasks. This work introduces Rule Transfer, a transfer algorithm that first learns rules to summarize a source task policy and then leverages those rules to learn faster in a target task. This paper demonstrates that Rule Transfer can effectively speed up learning in Keepaway, a benchmark RL problem in the robot soccer domain, based on experience from source tasks in the gridworld domain. We empirically show, through the use of three distinct transfer metrics, that Rule Transfer is effective across these domains.

References

[1]

Cohen, W. W. (1995). Fast effective rule induction. International Conf. on Machine Learning (pp. 115--123).

Crossref

Google Scholar

[2]

Kuhlmann, G., Stone, P., Mooney, R., & Shavlik, J. (2004). Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer. The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems.

Google Scholar

[3]

Liu, Y., & Stone, P. (2006). Value-function-based transfer for reinforcement learning using structure mapping. Proc. of the 21st National Conference on Artificial Intelligence.

Digital Library

Google Scholar

[4]

Madden, M. G., & Howley, T. (2004). Transfer of experience between reinforcement learning environments with progressive difficulty. Artif. Intell. Rev., 21, 375--398.

Digital Library

Google Scholar

[5]

Rummery, G. A., & Niranjan, M. (1994). On-line Q-learning using connectionist systems Technical Report CUED/F-INFENGRT 116). Engineering Dept., Cambridge University.

Google Scholar

[6]

Singh, S. P., & Sutton, R. S. (1996). Reinforcement learning with replaceing eligibility traces. Machine Learning, 22, 123--158.

Digital Library

Google Scholar

[7]

Soni, V., & Singh, S. (2006). Using homomorphisms to transfer options across continuous reinforcement learning domains. Proc. of the 21st National Conference on Artificial Intelligence.

Digital Library

Google Scholar

[8]

Srinivasan, A. (2001). The aleph manual.

Google Scholar

[9]

Stone, P., Kuhlmann, G., Taylor, M. E., & Liu, Y. (2006). Keepaway soccer: From machine learning testbed to benchmark. In I. Noda, A. Jacoff, A. Bredenfeld and Y. Takahashi (Eds.), RoboCup-2005: Robot soccer world cup IX, vol. 4020, 93--105. Berlin: Springer Verlag.

Digital Library

Google Scholar

[10]

Sutton, R. S., & Barto, A. G. (1998). Introduction to reinforcement learning. MIT Press.

Digital Library

Google Scholar

[11]

Torrey, L., Shavlik, J. W., Walker, T., & Maclin, R. (2006). Skill acquisition via transfer learning and advice taking. ECML (pp. 425--436). Springer.

Digital Library

Google Scholar

[12]

Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques. Morgan Kaufmann.

Digital Library

Google Scholar

Cited By

View all

Askarizadeh MMorsali ANguyen K(2025)Resource-Constrained Multisource Instance-Based Transfer LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.332724836:1(1029-1043)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3327248
Damiani ALopez GManganini GMetelli ARestelli M(2024)Transfer Learning for Dynamical Systems Models via Autoencoders and GANs2024 American Control Conference (ACC)10.23919/ACC60939.2024.10644658(8-14)Online publication date: 10-Jul-2024
https://doi.org/10.23919/ACC60939.2024.10644658
Tappler MPferscher AAichernig BKönighofer BRoychoudhury APaiva AAbreu RStorey M(2024)Learning and Repair of Deep Reinforcement Learning Policies from Fuzz-Testing DataProceedings of the IEEE/ACM 46th International Conference on Software Engineering10.1145/3597503.3623311(1-13)Online publication date: 20-May-2024
https://dl.acm.org/doi/10.1145/3597503.3623311
Show More Cited By

Cross-domain transfer for reinforcement learning
1. Computing methodologies

Recommendations

Autonomous cross-domain knowledge transfer in lifelong policy gradient reinforcement learning
IJCAI'15: Proceedings of the 24th International Conference on Artificial Intelligence

Online multi-task learning is an important capability for lifelong learning agents, enabling them to acquire models for diverse tasks over time and rapidly learn new tasks by building upon prior experience. However, recent progress toward lifelong ...
Autonomous inter-task transfer in reinforcement learning domains
Relational transfer in reinforcement learning

Comments

Information & Contributors

Information

Published In

ICML '07: Proceedings of the 24th international conference on Machine learning

June 2007

1233 pages

ISBN:9781595937933

DOI:10.1145/1273496

Editor:
Zoubin Ghahramani
University of Cambridge, United Kingdom

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Conference

ICML '07 & ILP '07

Sponsor:

ICML '07 & ILP '07: The 24th Annual International Conference on Machine Learning held in conjunction with the 2007 International Conference on Inductive Logic Programming

June 20 - 24, 2007

Oregon, Corvalis, USA

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

107
Total Citations
View Citations
1,043
Total Downloads

Downloads (Last 12 months)69
Downloads (Last 6 weeks)5

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Askarizadeh MMorsali ANguyen K(2025)Resource-Constrained Multisource Instance-Based Transfer LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.332724836:1(1029-1043)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3327248
Damiani ALopez GManganini GMetelli ARestelli M(2024)Transfer Learning for Dynamical Systems Models via Autoencoders and GANs2024 American Control Conference (ACC)10.23919/ACC60939.2024.10644658(8-14)Online publication date: 10-Jul-2024
https://doi.org/10.23919/ACC60939.2024.10644658
Tappler MPferscher AAichernig BKönighofer BRoychoudhury APaiva AAbreu RStorey M(2024)Learning and Repair of Deep Reinforcement Learning Policies from Fuzz-Testing DataProceedings of the IEEE/ACM 46th International Conference on Software Engineering10.1145/3597503.3623311(1-13)Online publication date: 20-May-2024
https://dl.acm.org/doi/10.1145/3597503.3623311
Zhang GFeng LWang YLi MXie HTan K(2024)Reinforcement Learning With Adaptive Policy Gradient Transfer Across Heterogeneous ProblemsIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2024.33618608:3(2213-2227)Online publication date: Jun-2024
https://doi.org/10.1109/TETCI.2024.3361860
Serrano SMartinez-Carranza JSucar L(2024)Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic ReviewIEEE Access10.1109/ACCESS.2024.343555812(114552-114572)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3435558
Ghandi AShouraki SGholampour IKamranian ARiazati M(2024)Ex-RL: Experience-Based Reinforcement LearningInformation Sciences10.1016/j.ins.2024.121479(121479)Online publication date: Sep-2024
https://doi.org/10.1016/j.ins.2024.121479
Verginis CKoprulu CChinchali STopcu U(2024)Joint Learning of Reward Machines and Policies in Environments with Partially Known SemanticsArtificial Intelligence10.1016/j.artint.2024.104146(104146)Online publication date: May-2024
https://doi.org/10.1016/j.artint.2024.104146
Ni DSchwartz H(2024)Enhancing Learning Efficiency in FACL: A Novel Fuzzy Rule Transfer Method for Transfer LearningInternational Journal of Fuzzy Systems10.1007/s40815-023-01662-326:4(1215-1232)Online publication date: 17-Feb-2024
https://doi.org/10.1007/s40815-023-01662-3
Okudo TYamada S(2023)Learning Potential in Subgoal-Based Reward ShapingIEEE Access10.1109/ACCESS.2023.324626711(17116-17137)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3246267
Baker MNew AAguilar-Simon MAl-Halah ZArnold SBen-Iwhiwhu EBrna ABrooks EBrown RDaniels ZDaram ADelattre FDellana REaton EFu HGrauman KHostetler JIqbal SKent CKetz NKolouri SKonidaris GKudithipudi DLearned-Miller ELee SLittman MMadireddy SMendez JNguyen EPiatko CPilly PRaghavan ARahman ARamakrishnan SRatzlaff NSoltoggio AStone PSur ITang ZTiwari SVedder KWang FXu ZYanguas-Gil AYedidsion HYu SVallabha G(2023)A domain-agnostic approach for characterization of lifelong learning systemsNeural Networks10.1016/j.neunet.2023.01.007160:C(274-296)Online publication date: 1-Mar-2023
https://dl.acm.org/doi/10.1016/j.neunet.2023.01.007
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Recommendations

Autonomous cross-domain knowledge transfer in lifelong policy gradient reinforcement learning

Autonomous inter-task transfer in reinforcement learning domains

Relational transfer in reinforcement learning

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations