Research on task decomposition and state abstraction in reinforcement learning

Lasheng, Yu; Zhongbin, Jiang; Kang, Liu

doi:10.1007/s10462-011-9243-9

Research on task decomposition and state abstraction in reinforcement learning

Published: 28 May 2011

Volume 38, pages 119–127, (2012)
Cite this article

Artificial Intelligence Review Aims and scope Submit manuscript

Yu Lasheng¹,
Jiang Zhongbin¹ &
Liu Kang¹

342 Accesses
Explore all metrics

Abstract

Task decomposition and State abstraction are crucial parts in reinforcement learning. It allows an agent to ignore aspects of its current states that are irrelevant to its current decision, and therefore speeds up dynamic programming and learning. This paper presents the SVI algorithm that uses a dynamic Bayesian network model to construct an influence graph that indicates relationships between state variables. SVI performs state abstraction for each subtask by ignoring irrelevant state variables and lower level subtasks. Experiment results show that the decomposition of tasks introduced by SVI can significantly accelerate constructing a near-optimal policy. This general framework can be applied to a broad spectrum of complex real world problems such as robotics, industrial manufacturing, games and others.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

References

Barto A, Mahadevan S (2003) Recent advances in hierarchical reinforcement learning. Discrete Event Syst (special issue on reinforcement learning) 13: 41–77
MathSciNet MATH Google Scholar
Bertsekas DP, Tsitsiklis JN (1996) Neuro-dynamic programming. Athena Scientific, Belmont
MATH Google Scholar
Boutilier C, Dearden R, Goldszmidt M (1995) Exploiting structure in policy construction. IJCAI 14: 1104–1113
Google Scholar
Dean T, Kanazawa K (1989) A model for reasoning about persistence and causation. Comput Intell 5(3): 142–150
Article Google Scholar
Dietterich T (2000) Hierarchical reinforcement learning with the MAXQ value function decoposition. J Artif Intell Res 13: 227–303
MathSciNet MATH Google Scholar
Hengst B (2002) Discovering hierarchy in reinforcement learning with HEXQ. ICML 19: 243–250
Google Scholar
Jonsson A, Barto A (2005) A causal approach to hierarchical decomposition of factored MDPs. In: Proceedings of the 22nd international conference on machine learning, pp 401–408
Makar R, Mahadevan S, Ghavamzadeh M (2001) Hierarchical multi-agent reinforcement learning. In: Proceedings of the 5th international conference on autonomous agents
Parr R, Russell S (1998) Reinforcement learning with hierarchies of machines. Advances in neural information processing systems. MIT Press, Oxford, pp 1043–1049
Google Scholar
Sutton R, Barto A (1998) Reinforcement learning. MIT Press, Oxford
Google Scholar
Sutton R, Precup D, Singh S (1999) Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. Artif Intell 112(1-2): 181–211
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Science and Engineering, Central South University, Hunan, 410083, China
Yu Lasheng, Jiang Zhongbin & Liu Kang

Authors

Yu Lasheng
View author publications
You can also search for this author in PubMed Google Scholar
Jiang Zhongbin
View author publications
You can also search for this author in PubMed Google Scholar
Liu Kang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu Lasheng.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lasheng, Y., Zhongbin, J. & Kang, L. Research on task decomposition and state abstraction in reinforcement learning. Artif Intell Rev 38, 119–127 (2012). https://doi.org/10.1007/s10462-011-9243-9

Download citation

Published: 28 May 2011
Issue Date: August 2012
DOI: https://doi.org/10.1007/s10462-011-9243-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Research on task decomposition and state abstraction in reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

State and Action Abstraction for Search and Reinforcement Learning Algorithms

Hierarchical Reinforcement Learning with Unlimited Recursive Subroutine Calls

Learning Reward Machines in Cooperative Multi-agent Tasks

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Research on task decomposition and state abstraction in reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

State and Action Abstraction for Search and Reinforcement Learning Algorithms

Hierarchical Reinforcement Learning with Unlimited Recursive Subroutine Calls

Learning Reward Machines in Cooperative Multi-agent Tasks

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation