Simulation and validation of a reinforcement learning agent-based model for multi-stakeholder forest management

doi:10.1016/j.compenvurbsys.2009.10.001

Computers, Environment and Urban Systems

Volume 34, Issue 2, March 2010, Pages 162-174

https://doi.org/10.1016/j.compenvurbsys.2009.10.001 Get rights and content

Abstract

Spatial optimization and agent-based modeling present two distinct approaches that have been implemented in forest management research for incorporating the objectives of multiple stakeholders. However, challenges arise in their implementation as optimization procedures do not consider the interactions amongst stakeholders, and agent-based models generate results from which it is difficult to determine if objectives have been successfully achieved. The purpose of this research is to overcome these limitations by improving the ability of an agent-based model to achieve optimal forest harvesting strategies through the integration of reinforcement learning (RL). A simulation model is developed in which forest company agents harvest trees in order to maximize their profits while considering the potential to cooperate with a conservationist agent whose objectives are based on protecting species habitat. RL algorithms are implemented to allow the forest company agents and the conservationist agent to learn where harvesting should occur in order to achieve their objectives. The model is validated by determining if generated solutions can be considered optimal given system constraints, and by comparing observed agent behavior against learning functions as defined by the RL algorithms. The obtained results demonstrate a non-linear relationship between different levels of cooperation and the ability of agents to achieve their objectives. The model also provides outputs that depict the relative quality of forest areas and the tradeoffs between objectives for different optimal solutions.

Introduction

Forest management requires the ability to integrate numerous objectives in order to satisfy the goals of different stakeholders (Bettinger et al., 2003, Kangas et al., 2005, Shao et al., 2005). Forest companies, for example, are typically driven by economic incentives that involve harvesting high quality timber and minimizing the construction of logging roads, while conservation-minded groups are interested in preserving the long term ecological functions of the forest. In addition, government agencies are motivated by the need to spur economic growth while avoiding the exhaustion of available timber.

A broad range of spatial optimization procedures exist that have the ability to integrate multiple and often-times conflicting objectives that can potentially exist across spatial scales. Heuristic modeling techniques represent the most recognized collection of approaches because of their ability to produce feasible solutions to large-scale spatial problems (Baskent & Keles, 2005). Such methods include simulated annealing (Baskent and Jordon, 2002, Ohman and Lamas, 2003), tabu search (Caro et al., 2003, Richards and Gunn, 2003) and genetic algorithms (Ducheyne et al., 2006, Venema et al., 2005), all of which can evaluate sets of spatial patterns with the aim of improving forest harvesting strategies in order to meet different objectives. However, the implementation of spatial optimization procedures can prove to be challenging when patterns resulting from harvesting processes are largely dictated by the complex interactions between stakeholders and various system components (Cerda & Mitchell, 2004). Economic markets, ecological processes, political and social dynamics lead to uncertainty in our ability to implement strategic plans (Nelson, 2003). As such, the dynamics that shape the forest harvesting process can be in direct conflict with spatially-optimal patterns derived from heuristic modeling techniques.

In order to investigate the dynamics of forest management, agent-based modeling (ABM) offers a simulation approach in which computer agents represent the decision-making behaviors of individual entities that influence their surrounding environment (Brown et al., 2004, Parker et al., 2003). Agents can possess different strategies for responding to the actions of other agents and to the dynamic components of the landscape. Positive feedbacks are formed as agent actions cause a system to become entrenched along a specific trajectory, while negative feedbacks can exist due to constraints that prevent a system from entering certain states (Manson, 2006a). As a consequence, the results of an agent-based model are viewed as patterns that emerge from the various dynamics of the system (Li, Brimicombe, & Li, 2008). The ability to simulate the behaviors and complex interactions between humans has led to the widespread use of ABM for modeling a variety of spatial phenomena, including urban dynamics (Brown et al., 2008, Guzy et al., 2008, Li and Liu, 2007, Maoh and Kanaroglou, 2007, Xie et al., 2007, Yin and Muller, 2007), agricultural land use transition (Acosta-Michlik and Espaldon, 2008, Bakker and van Doorn, 2009, Millington et al., 2008) and human mobility (Batty, 2001). Such applications focus on determining how specific human behaviors react and contend with their surrounding spatial structures and produce different landscape patterns over time. In addition, ABM has been employed for examining the influence of various policies with regards to how they dictate human behavior and the consequential patterns that arise. This includes implementing ABM for evaluating sustainable land use planning strategies (Li and Liu, 2008, Zellner et al., 2008), zoning policies (Zellner et al., 2009), water use regulations (Smugly, Morris, & Heckbert, 2009) and conservation programs (Hartig and Drechsler, 2009, Janssen et al., 2000, Sengupta et al., 2005).

From a forest management perspective, ABM is a useful approach for simulating the behavior of stakeholders such as forest companies, conservation groups and government agencies in order to evaluate how the interactions amongst these interest groups lead to the emergence of different forest harvesting patterns (Purnomo, Mendoza, Prabhu, & Yasmi, 2005). The utility of ABM is evident in the number of applications for simulating forest-related processes that have surfaced in the literature in recent years. Examples include the use of ABM for simulating the emerging patterns of forest–agriculture transition (Bithell and Brasington, 2009, Castella et al., 2005, Deadman et al., 2004, Evans and Kelley, 2008), and multi-stakeholder management of tropical forests (Purnomo and Guizol, 2006, Purnomo et al., 2005). Yet, there still remain challenges when considering how to implement the information gained from an agent-based model into practical forest management strategies due to the perception that ABM remains largely a mechanism for exploring system dynamics rather than providing predictive results (Brown, Aspinall, & Bennett, 2006). This perception stems from the fact that simulating system complexity often involves the inclusion of various stochastic components that can lead a single model to produce numerous results that are all plausible outcomes of a system process (Batty & Torrens, 2005). It is difficult to determine which outcome is most representative of the process given what the agents are trying to achieve, and how, if at all, the generated harvesting patterns satisfy the objectives of the different agents. As such, a paradox exists within the attempt to develop computational models for assisting forest management because of our conflicting desires to generate optimal spatial patterns while acknowledging the spatial and temporal complexities of the system.

The objective of this study is to address this optimization-complexity paradox through the development and validation of a model for multi-stakeholder forest management that integrates ABM and reinforcement learning (RL). RL is a computation approach stemming from the literature on machine learning and artificial intelligence that is used to improve model outcomes by providing numeric reinforcing rewards to those actions in a system that lead towards the achievement of a set of defined objectives (Barto et al., 1981, Sutton, 1988). In this study, RL provides a means to incorporate optimization procedures into an agent-based model that allows agents to interact with each other and their environment while learning how to improve their decision-making behavior. RL algorithms evaluate landscape patterns and relay information to the agents that describes where and when forest harvesting strategies should take place in order for them to achieve their objectives. Furthermore, the RL agent-based model is parameterized as a multi-objective optimization model, which facilitates the use of traditional multi-objective evaluation methods for validating the ability of the model to produce optimal results given the complexity of the system. While agent-based models have previously been integrated with artificial intelligence for spatial applications such as optimal site selection problems (Li, He, & Liu, 2009), simulating animal migration within natural landscapes (Bennett & Tang, 2006) and modeling agricultural land use decision-making (Manson, 2006b), this study offers a novel approach for bridging complexity and optimization by explicitly and independently representing the knowledge acquisition of each agent in order to simulate the interactions and learning of different stakeholders in a forest management context.

Section snippets

The RL–ABM multi-stakeholder framework

The framework for integrating RL and ABM for multi-stakeholder forest management is presented in Fig. 1. Agents representing forest companies (F) harvest trees in the landscape based on the availability and price of timber for a specified number of time steps. The period from the first to the final time step is referred to as an episode; the forest harvesting pattern resulting from an episode represents a single harvesting solution. For the first episode, the forest company agents have no

Methods

The model developed in this study consists of three types of stakeholder agents that interact in a forest landscape. Forest dynamics are represented by tree growth and fluctuating timber prices. The RL algorithms ensure that the stakeholder agents learn to contend with forest changes and the actions of other stakeholders when attempting to achieve their objectives.

Model implementation

The model is implemented through the simulation of forest harvesting in the Chilliwack Forest District in south western British Columbia. The area provides opportunities for harvesting due to the availability of desired timber and proximity to timber processing and shipping locations. However, the area also lies within the habitat of the Northern Spotted Owl, a species that has been placed on Canada’s Endangered Species list due to declining populations as a result of habitat loss.

The study

Results

The model was run for 10,000 episodes as this lead to each stand being selected at least 50 times, which was found to provide a sufficient level of sampling and avoided simulating too many episodes at the expense of no significant change to the results.

Discussion

The results from this study reveal that agent behavior has an influence on the ability of the agents to learn about forest harvesting patterns that are beneficial for achieving their objectives. However, it can be safely concluded that specific changes in agent behavior are not directly manifest in the results as the forest companies’ willingness to cooperate does not always lead to an improved outcome for the conservationist. Information extracted from the comparison of solutions, the outcomes

Conclusion

The model developed in this study provides three outputs that are of important use to forest management. The first is the harvesting solution that is generated in the final episode of the model, which depicts the optimal decision making behavior of the forest company agents given their interactions with the conservationist and the government agents. Decision makers can utilize such information for determining if the resulting spatial patterns conflict with management policies that dictate the

Acknowledgements

The authors would like to thank the Natural Sciences and Engineering Research Council of Canada (NSERC) for full support of this study under the Canadian Graduate Scholarship awarded to the first author and the Discovery Grant awarded to the second author. Acknowledgement is also given to the Government of British Columbia for providing the British Columbia Forest Cover Data.

References (49)

L. Acosta-Michlik et al.
Assessing vulnerability of selected farming communities in the Philippines based on a behavioural model of agent’s adaptation to global environmental change
Global Environmental Change-Human and Policy Dimensions
(2008)
M.M. Bakker et al.
Farmer-specific relationships between land use change and landscape factors: Introducing agents in empirical land use modelling
Land Use Policy
(2009)
E.Z. Baskent et al.
Forest landscape management modeling using simulated annealing
Forest Ecology and Management
(2002)
E.Z. Baskent et al.
Spatial forest planning: A review
Ecological Modelling
(2005)
M. Batty et al.
Modelling and prediction in a complex world
Futures
(2005)
P. Bettinger et al.
Spatial forest plan development with ecological and economic goals
Ecological Modelling
(2003)
M. Bithell et al.
Coupling agent-based models of subsistence farming with individual-based forest models and dynamic models of water distribution
Environmental Modelling and Software
(2009)
D.G. Brown et al.
Agent-based and analytical modeling to evaluate the effectiveness of greenbelts
Environmental Modelling and Software
(2004)
D.G. Brown et al.
Exurbia from the bottom-up: Confronting empirical challenges to characterizing a complex system
Geoforum
(2008)
J.C. Castella et al.
Agrarian transition and lowland-upland interactions in mountain areas in northern Vietnam: Application of a multi-agent simulation model
Agricultural Systems
(2005)

T. Evans et al.

Assessing the transition from deforestation to forest regrowth with an agent-based model of land cover change for south-central Indiana (USA)

Geoforum

(2008)

F. Hartig et al.

Smart spatial incentives for market-based conservation

Biological Conservation

(2009)

M.A. Janssen et al.

An adaptive agent model for analysing co-evolution of management and policies in a complex rangeland system

Ecological Modelling

(2000)

J. Kangas et al.

Socioecological landscape planning approach and multicriteria acceptability analysis in multiple-purpose forest management

Forest Policy and Economics

(2005)

Y. Li et al.

Agent-based services for the validation and calibration of multi-agent models

Computers, Environment and Urban Systems

(2008)

X. Li et al.

Defining agents’ behaviors to simulate complex residential development using multicriteria evaluation

Journal of Environmental Management

(2007)

S. Manson

Land use in the southern Yucatan peninsular region of Mexico: Scenarios of population and institutional change

Computers, Environment and Urban Systems

(2006)

K. Ohman et al.

Clustering of harvest activities in multi-objective long-term forest planning

Forest Ecology and Management

(2003)

H. Purnomo et al.

Simulating forest plantation co-management with a multi-agent system

Mathematical and Computer Modelling

(2006)

G. Shao et al.

Integrating stand and landscape decisions for multi-purposes of forest harvesting

Forest Ecology and Management

(2005)

H.D. Venema et al.

Forest optimization using evolutionary programming and landscape ecology metrics

European Journal of Operational Research

(2005)

N. Xiao et al.

Interactive evolutionary approaches to multiobjective spatial decision making: A synthetic review

Computers, Environment, and Urban Systems

(2007)

M.L. Zellner et al.

The emergence of zoning policy games in exurban jurisdictions: Informing collective action theory

Land Use Policy

(2009)

M.L. Zellner et al.

A new framework for urban sustainability assessments: Linking complexity, information and policy

Computers Environment and Urban Systems

(2008)

Cited by (48)

Modelling forests as social-ecological systems: A systematic comparison of agent-based approaches
2024, Environmental Modelling and Software
The multifunctionality of forest systems calls for appropriately complex modelling approaches to capture social and ecosystem dynamics. Using a social-ecological systems framework, we review the functionality of 31 existing agent-based models applied to managed forests. Several applications include advanced cognitive and emotional decision-making, crucial for understanding complex sustainability challenges. However, far from all demonstrate representation of key elements in a social-ecological system like direct interactions, and dynamic representations of social and ecological processes. We conclude that agent-based approaches are adequately complex for simulating both social and ecological subsystems, but highlight three main avenues for further development: i) robust methodological standards for calibration and validation of agent-based approaches; ii) modelling of agent learning, adaptive governance and feedback loops; iii) coupling to ecological models such as dynamic vegetation models or species distribution models. We round-off by providing a set of questions to support social-ecological systems modelling choices.
Improving aboveground biomass estimation of natural forests on the Tibetan Plateau using spaceborne LiDAR and machine learning algorithms
2022, Ecological Indicators
Natural forests have the most complex structure and richest biodiversity among terrestrial ecosystems and are essential for maintaining the carbon balance and stability of the biosphere. Aboveground biomass (AGB) is a primary indicator used to evaluate forests and can directly measure forest growth and the quality of natural forests. Accurate and rapid AGB estimations can significantly improve the efficiency of forest management and deepen the understanding of the forest carbon cycle. The Ice, Cloud, and Land Elevation Satellite-2 (ICESat-2), one of the most recently launched spaceborne light detection and ranging (LiDAR) instruments, can penetrate forest canopies and obtain accurate, large-scale forest vertical parameter measurements. However, the discrete footprints offered by ICESat-2 cannot provide comprehensive spatial AGB distribution data. In this study, an optimized extreme learning machine (ELM) method was proposed to estimate the AGB of natural forests on the eastern Qinghai-Xizang Plateau, China. The synthetic Sentinel-2 images acquired from the Google Earth Engine (GEE) were synergized with ICESat-2 to realize continuous AGB mapping. To verify the effectiveness of the optimized method, support vector machine (SVM), k-nearest neighbor (kNN), and backpropagation (BP) neural network models were also established for comparison. The measured AGB data extracted from the forest management inventory (FMI) were used for model accuracy evaluation. The results show that the optimized ELM achieved the best estimation effect among all the analyzed models, with an R² value of 0.68 and a root mean square error (RMSE) value of 25.14 mg/ha. The optimized ELM obtained the minimum RMSE and greatly improved the AGB prediction efficiency. These findings prove the ability of ICESat-2 in estimating the AGB of natural forests, which can provide a new way for large-scale forest resources investigation in high-altitude areas with a harsh environment.
Perfect assumptions in an imperfect world: Managing timberland in an oligopoly market
2022, Forest Policy and Economics
We built a game-theoretic supply model where forest landowners respond to each other's decisions using two market assumptions: (i) Perfect cartel, (ii) Cournot competition (simultaneous moves) and (iii) Stackelberg competition (sequential moves). Our findings indicate that the initial forest structure is instrumental in determining forest composition outcomes among suppliers. The solutions in the Cournot model, landowners with the same initial forest structure have uniform outcomes with increased variation in financial performance arising with different initial endowments of pulpwood and sawtimber and establishment costs. Alternatively, Stackelberg leadership has profound financial benefits to leaders even under similar initial conditions, that remain regardless of scenario. However, while terminal overall forest composition was similar regardless of scenario under Cournot outcomes, the same is not true under Stackelberg. We find that Stackelberg outcomes led to the follower being unable to harvest younger age classes over time, which resulted in accumulation of older age class stands. Our results elucidate the importance of diversification and policies that reduce landownership land concentration.
An agent-based learning-embedded model (ABM-learning) for urban land use planning: A case study of residential land growth simulation in Shenzhen, China
2020, Land Use Policy
A forward-looking urban land use plan is crucial to a city’s sustainability, which requires a deep understanding of human-environment interactions between different domains, and modelling them soundly. One of the key challenges of modelling these interactions is to understand and model how human individuals make and develop their location decisions by learning that then shape urban land-use patterns. To investigate this issue, we have constructed an extended experience-weighted attraction learning model to represent the human agents’ learning when they make location decisions. Consequently, we propose and have developed an agent-based learning-embedded model (ABM-learning) for residential land growth simulation that incorporates a learning model, a decision-making model, a land use conversion model and the constraint of urban land use master plan. The proposed model was used for a simulation of the residential land growth in Shenzhen city, China. By validating the model against empirical data, the results showed that the site-specific accuracy of the model has been improved when embedding learning model. The analysis on the simulation accuracies has proved the argument that modelling individual-level learning matters in the agent’s decision model and the agent-based models. We also applied the model to predict residential land growth in Shenzhen from 2015 to 2035, and the result can be a reference for land-use allocation in detailed planning of Shenzhen. The ABM-learning is applicable to studying the past urban growth trajectory, aiding in the formulation of detailed residential land and public service facility planning and assessing the land use planning effectiveness.
An agent-based procedure with an embedded agent learning model for residential land growth simulation: The case study of Nanjing, China
2019, Cities
Citation Excerpt :
The learning behavior within human system, which is focused more in social, economic, and psychosocial studies, has not been fully explored in the context of geo-simulation (Filatova et al., 2013; Li et al., 2015; Meyfroidt, 2013). Furthermore, how agents learn from their past decisions regarding the landscape is more often studied rather than how agents learn from one another (recent work including Bone & Dragicevic, 2010a, 2010b, Bone et al., 2011). Second, in most of the ABMs for geo-simulation which consider agent's learning, collective learning, which has been nourished by machine learning algorithms (e.g. Bennett & Tang, 2006; Bone et al., 2011; Bone & Dragicevic, 2010a, 2010b; Bousquet & Le Page, 2004; Tang, 2008), is often simulated to a much higher degree than individual-level learning in the coupled human-environmental system.
The agent-based modelling (ABM) is commonly used to simulate urban land growth. A key challenge of ABM for the simulation of urban land-use dynamics in support of sustainable urban management is to understand and model how human individuals make and develop their location decisions that then shape urban land-use patterns. To investigate this issue, we focus on modelling the agent learning process in residential location decision-making process, to represent individuals' personal and interpersonal experience learning during their decision-making. We have constructed an extended reinforcement learning model to represent the human agents' learning when they make location decisions. Consequently, we propose and have developed a new agent-based procedure for residential land growth simulation that incorporates an agent learning model, an agent decision-making model, a land use conversion model, and the impacts of urban land zoning and the developers' desires. The proposed procedure was first tested by using hypothetical data. Then the model was used for a simulation of the urban residential land growth in the city of Nanjing, China. By validating the model against empirical data, the results showed that adding agent learning model contributed to the representation of the agent's adaptive location decision-making and the improvement of the model's simulation power to a certain extent. The agent-based procedure with the agent learning model embedded is applicable to studying the formulation of urban development policies and testing the responses of individuals to these policies.
Centrally located yet close to nature: A prescriptive agent-based model for urban design
2019, Computers, Environment and Urban Systems
Citation Excerpt :
This is in line with what shown by Axelrod (1984) using the well-known Prisoner's Dilemma: in many situations, the individualistic approach may only bring in short term advantages, whereas everyone would be better off with mutual cooperation. Various scholars have proven this rationale using agent-based models (Axelrod, 1997; Bone & Dragićević, 2010; Kraus, 1997; Lo Nigro, Noto La Diega, Perrone, & Renna, 2003; Peppers & Smuts, 2000; Power, 2009). It is important to note here that the cooperation mechanism depicted above does not reflect any realistic residential choice behavior (i.e. people do not choose a house location to benefit others): it merely is a modeling device that guarantees the achievement of progressively better configurations during a simulation and is acceptable in the framework of a prescriptive model.
A common dilemma for planners is how to design urban settlements that give people easy access to a center and nature. Difficulties arise because each household's access to such elements is a function of other households' location and the set of potential arrangements is constrained by the households' degree of acceptance of different density levels. This paper suggests the ideal arrangement of built-up and green areas may be identified by simulating in an agent-based model (ABM) the interactions of virtual households that try to find the best residential location based on their preferences towards distance from the center, proximity to green space and density.
Simulations showed that the ABM can, iteration after iteration, develop progressively better configurations and eventually get to an equilibrium if households' locational choice is driven not only by the maximization of individual utility, but also the preservation of the neighbors' well-being. Model's outputs suggest that compact settlements with an even distribution of green spaces offer the greatest benefits to their inhabitants, and that larger green areas are to be preferred when the population is less sensitive to density and/or the travel to the center is faster along some directions. Application of a rent formation model on the configurations generated by the ABM shows that these are relatively equitable, as lower income households could afford at least half of all locations. Future improvements can turn this model into a suitable tool for designing new settlements, guiding the densification of existing settlements or defining zoning regulations.

View all citing articles on Scopus

View full text

Simulation and validation of a reinforcement learning agent-based model for multi-stakeholder forest management

Abstract

Introduction

Section snippets

The RL–ABM multi-stakeholder framework

Methods

Model implementation

Results

Discussion

Conclusion

Acknowledgements

Global Environmental Change-Human and Policy Dimensions

Land Use Policy

Forest Ecology and Management

Ecological Modelling

Futures

Ecological Modelling

Environmental Modelling and Software

Environmental Modelling and Software

Geoforum

Agricultural Systems

Geoforum

Biological Conservation

Ecological Modelling

Forest Policy and Economics

Computers, Environment and Urban Systems

Journal of Environmental Management

Computers, Environment and Urban Systems

Forest Ecology and Management

Mathematical and Computer Modelling

Forest Ecology and Management

European Journal of Operational Research

Computers, Environment, and Urban Systems

Land Use Policy

Computers Environment and Urban Systems