The hierarchical task network planning method based on Monte Carlo Tree Search

doi:10.1016/j.knosys.2021.107067

Knowledge-Based Systems

Volume 225, 5 August 2021, 107067

https://doi.org/10.1016/j.knosys.2021.107067 Get rights and content

Highlights

•
A hierarchical task network planning model based on MCTS is proposed.
•
The planning tree provides a way for planners to choose decomposition methods.
•
The model can solve the planning problem with uncertain operation executions.

Abstract

Since the hierarchical task network (HTN) planning depends on the domain knowledge of the problem, the planning result relies on the writing order of the decomposition method. Besides, the solution obtained by planning is usually a general feasible solution, which means there are shortcomings in the ability of finding the optimal solution. In order to reduce the dependence of HTN planning on domain knowledge and obtain a better planning solution, Pyhop-m, an HTN planning algorithm based on Monte Carlo Tree Search(MCTS) is proposed. In the planning process, a planning tree is built by MCTS to guide the HTN planner to choose the best decomposition method. Experiments illustrates that whether in the static or dynamic environment, Pyhop-m is superior to the existing Pyhop and heuristic-based Pyhop-h in plan length, planning success rate and optimal solution rate. Under the 95% confidence level, the confidence intervals of Pyhop-m algorithm to achieve the planning success rate and the optimal solution rate in the dynamic environment are [75.82%,89.18%] and [88.67%,93.95%], which are significantly higher than those of Pyhop-h with [58.19%,77.81%] and [69.91%,80.69%], respectively. Moreover, it can solve the planning problem with uncertain action executions by repeatedly simulating and evaluating the leaf nodes of the planning tree. It can be concluded that Pyhop-m can not only make the planning result independent of the writing order of the decomposition methods, but also search out the global optimal solution.

Section snippets

Introductions

As a kind of intelligent planning technology, the HTN has been widely used in practical planning problems such as emergency plan formulation [1], real-time strategy game [2], robot control [3] and automatic combination of web services [4] because the planning process of HTN is similar to the thinking process of human problem solving. The HTN seeks a feasible solution to complete the task by decomposing the task and dissolving the conflict. The basic idea is to extract the special knowledge in

Related works

The HTN planning model can be summarized as three spaces: state space, task space and knowledge space and two systems: planning system and execution system [10]. The initial state, initial task network and domain knowledge generated by the above three spaces are utilized as the inputs of HTN planning. By means of the planning in the planning system, the COA composed of primitive tasks is outputted, and it is finally executed by the execution system in the specific environment. The planner in

The HTN planning based on MCTS

This section first reveals the definitions of HTN and extended Monte Carlo tree nodes. Secondly, the MCTS-HTN algorithm used by MCTS in HTN planning is proposed. Finally, combining MCTS-HTN with the classic SHOP planning algorithm, a new HTN planning algorithm—SHOP-m algorithm is constructed. SHOP-m can utilize the method of Monte Carlo simulation to select the global optimal decomposition method for the tasks that need to be decomposed at the moment. Whether in a static environment or a

Experimental evaluation

According to the Python version of SHOP—Pyhop [32], SHOP-m is implemented in Python, which is called Python-m, and it is compared with Pyhop and Pyhop-h. A classical path planning problem is utilized the same as [15]: given a starting point and an ending point in an n $\times$ m grid and plan the optimal path between the two points, as shown in Fig. 4. We compare the performance of three algorithms in four experiments—Experiment one makes a preliminary comparison of three algorithms; Experiment two

Conclusions and future work

In this paper, we use MCTS to guide how to choose the best decomposition method for compound tasks in HTN planning, thereby improving the planning performance of HTN. We first expand the formal description of HTN and add “planning node” to link MCTS and HTN planning. Then MCTS-HTN is proposed as the entry function for calling MCTS in HTN planning, and the calculation formula of CalValue function is given for MCTS leaf node evaluation. The leaf node evaluation function can be modified according

CRediT authorship contribution statement

Tianhao Shao: Conceptualization, Methodology, Writing - original draft. Hongjun Zhang: Supervision. Kai Cheng: Data curation, Software. Ke Zhang: Writing - review & editing. Lin Bie: Validation.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We thank Dana Nau for implementing Pyhop and providing open download. This work was supported by the National Natural Science Foundation of China (No. 61806221).

References (34)

GeorgievskiI. et al.
HTN planning: overview, comparison, and beyond
Artificial Intelligence
(2015)
ChengK. et al.
Improving hierarchical task network planning performance by the use of domain-independent heuristic search
Knowl.-Based Syst.
(2018)
WangH.W. et al.
Hierarchical Task Network Planning Based Emergency Response Decision Making Theory and Method
(2015)
RobertsonG. et al.
A review of real-time strategy game AI
AI Mag.
(2014)
T. Gateau, C. Lesire, M. Barbier, Hidden: Cooperative plan execution and repair for heterogeneous robots in dynamic...
A. Hristoskova, B. Volckaert, F.D. Turck, Dynamic composition of semantically annotated web services through QoS-aware...
E.D. Sacerdoti, The nonlinear nature of plans, in: Proceedings of the Fourth International Joint Conference on...
NauD. et al.
SHOP2: an HTN planning system
J. Artificial Intelligence Res.
(2003)
NauD. et al.
Applications of SHOP and SHOP2
Intell. Syst. IEEE
(2005)
IlghamiO.
Documentation for JSHOP2
(2006)

ShaoT.H. et al.

A review of replanning in hierarchical task network

Syst. Eng. Electron.

(2020)

WangH.W. et al.

Review on hierarchical task network planning under uncertainty

Acta Automat. Sinica

(2016)

A. Gerevini, U. Kute, D. Nau, A. Saetti, N. Waisbrot, Combining domain-independent planning and HTN planning: the duet...

GereviniA. et al.

Planning through stochastic local search and temporal action graphs

J. Artificial Intelligence Res.

(2003)

N. Waisbrot, U. Kuter, T. Könik, Combining heuristic search with hierarchical task-network planning: a preliminary...

R. Krogt, M. Weerdt, Plan repair as an extension of planning, in: Proceedings of the Fifteenth International Conference...

M. Elkawkagy, P. Bercher, B. Schattenberg, S. Biundo, Improving hierarchical planning performance by the use of...

Cited by (0)

View full text

The hierarchical task network planning method based on Monte Carlo Tree Search

Highlights

Abstract

Section snippets

Introductions

Related works

The HTN planning based on MCTS

Experimental evaluation

Conclusions and future work

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Artificial Intelligence

Knowl.-Based Syst.

Hierarchical Task Network Planning Based Emergency Response Decision Making Theory and Method

A review of real-time strategy game AI

AI Mag.

SHOP2: an HTN planning system

J. Artificial Intelligence Res.

Applications of SHOP and SHOP2

Intell. Syst. IEEE

Documentation for JSHOP2

A review of replanning in hierarchical task network

Syst. Eng. Electron.

Review on hierarchical task network planning under uncertainty

Acta Automat. Sinica

Planning through stochastic local search and temporal action graphs

J. Artificial Intelligence Res.