Original papersError analysis and correction of spatialization of crop yield in China – Different variables scales, partitioning schemes and error correction methods
Introduction
Given a backdrop of global environmental dynamism and climate change, traditional geo-ecological processes have undergone drastic changes over the past few decades. The geographical processes are no longer simple natural processes, and the researches of ecological processes also are no longer confined to the dynamics and development in ecosystem. The integration and intersection of multiple disciplines is becoming an important characteristic of modern geo-ecological processes (Fu et al., 2006).
It is an important symbol of the combination of human activities and geo-ecological processes to apply statistics to the study of geo-ecological processes. Socio-economic statistics are collected and published based on administrative division. So they have low spatial resolution and lack of the description to spatial distribution characteristics of socioeconomic statistics. It is difficult to use them for comprehensive analysis of socio-economic data and other data in practical application, which limits their application to geographical research to a great extent. There are three major problems. First, the contradiction between the spatial heterogeneity of geographical elements and the homogeneity of statistics in the same administrative division; Second, the disagreement between landscape scale and statistical scale; Third, the statistical indicators in different regions are inconsistent (Liu and Li, 2012). The spatialization of socio-economic statistics can solve the above problems effectively (Liao and Zhang, 2009).
Numerous studies focused on the spatialization of socio-economic statistics, including spatialization of population (Tobler et al., 1995, Tobler et al., 1997, Sutton et al., 2001, Tian et al., 2005) and gross domestic product (GDP) statistics (Ebener et al., 2005, Doll et al., 2006, Sutton et al., 2007, Elvidge et al., 1997, Elvidge et al., 2009a, Elvidge et al., 2009b and Ghosh et al., 2009). With the rapid development of Remote Sensing (RS) and Geographic Information System (GIS) technology, the spatialization of agricultural production data are frequently studied, mainly including spatialization of crop acreage (Qiu et al., 2003, Leff et al., 2004, You and Wood, 2006, You et al., 2009, Monfreda et al., 2008, Khan et al., 2010, Zhang et al., 2013, Jin et al., 2015, Salmon et al., 2015, Liu et al., 2017) and agricultural production inputs (Potter et al., 2010, Sun et al., 2010, Yan and Pan, 2014). However, there are fewer researches on crop yield spatialization. For instance, Shi et al. used the cultivated land data to spatialize maize yield per unit area statistics by multivariable linear regression model, and got a spatial distribution map of maize yield per unit area in Jilin province (Shi et al., 2011). Liu et al. took population density as the dependent variables and crop yield as independent variables to construct a regression model with the support of land use data. The model was then applied to spatialize provincial-level crop yield statistics, resulting in a distribution map of crop yield of China at 1 km by 1 km in 2000 and the precision of crop yield spatialization results were analyzed from provincial scale down to prefectural scale and county scale (Liu and Li, 2012). But few studies explored the influence of variables scales and partitioning schemes on precision of crop yield spatialization.
As one of frequently-used geo-data processing methods, spatialization of attribute data inevitably results in errors during data processing. Spatialization errors can be reduced by correcting initial spatialization results. Many error modifying methods have been used to correct spatialization errors, such as average correction method (Wu et al., 2015), proportional coefficient correction method (Shi et al., 2016), weight coefficient correction method based on the basic idea that different farmland types have the same weight (Liao and Qin, 2014). However, there are few researches about comparing the pros and cons of different error correction methods. So, in this study we will discuss the influence of some new error correction methods on crop output spatialization and compare them with the existing error correction methods to improve spatialization precision.
This study attempts to simulate the spatial distribution of crop yield in China using land use data with the following objectives: (1) exploring the influence of variables scales on precision of crop yield spatialization; (2) detecting the influence of partitioning schemes on precision of crop yield spatialization; and (3) comparing the pros and cons of different error correction methods.
Section snippets
Data sources
Five datasets are used for this study.
- 1.
County-level and prefecture-level crop yield statistics of China in 2010. The data come from Statistical Yearbook of China in 2011.
- 2.
Land use dataset of China in 2010. The data set is provided by Data Center for Resources and Environmental Sciences, Chinese Academy of Sciences (RESDC) (http://www.resdc.cn).
- 3.
County-level administrative map of China in 2010. It mostly includes vector data of county-level administrative boundary in China and other attribute data,
Research method
Crop output is proportional to farmland area, and different farmland types have different influence on crop output, and multivariate linear regression analysis method (MLRAM) is the most frequently used method to realize spatialization of attribute data. So, we chose MLRAM to spatialize crop output. Its basic formula is as follows:
Supposing one dependent variable y is affected by k independent variables (x1, x2, …, xk), and there are n groups of observed values (ya, x1a, x2a, …, xka), a = 1, 2, …,
Error correction
Error is the difference between the analog value of model and the actual observation value. The basic formula is as follows:ε represents error, y is a statistics and yi is a analog value.
The purpose of error correction is to improve spatialization precision by assigning errors to initial spatialization results based on some methods, such as average correction method (Wu et al., 2015), proportional coefficient correction method (Shi et al., 2016), weight coefficient correction method based
Conclusions
In this paper, three variables scales including prefectural scale, county scale and grid cell (1 km × 1 km) were selected. Five partitioning schemes (no partition of China, 7 regions of China, 9 regions of China, 10 regions of China, partitions of China by province) were considered. A total of 28 kinds of multivariable linear regression models were constructed with area of different types of farmland as independent variables, crop yields as dependent variables. Then, seven kinds of error
Acknowledgements
This work was supported by the National Key R&D Program of China [Grant number 2016YFA0602702].
References (39)
- et al.
Mapping regional economic activity from night-time light satellite imagery
Ecol. Econ.
(2006) - et al.
A global poverty map derived from satellite data
Comput. Geosci.
(2009) - et al.
Disaggregating and mapping crop statistics using hypertemporal remote sensing
Int. J. Appl. Earth Obs. Geoinf.
(2010) - et al.
Global rain-fed, irrigated, and paddy croplands: A new high resolution map derived from remote sensing, crop inventories and climate data
Int. J. Appl. Earth Obs. Geoinf.
(2015) - et al.
Detecting spatiotemporal dynamics of global electric power consumption using DMSP-OLS nighttime stable light data
Appl. Energy
(2016) - et al.
Modeling population density using land cover data
Ecol. Model.
(2005) - et al.
An entropy approach to spatial disaggregation of agricultural production
Agric. Syst.
(2006) - et al.
Generating plausible crop distribution maps for sub-saharan africa using a spatially disaggregated data fusion and optimization approach
Agric. Syst.
(2009) Planting Regionalization in China
(1984)- et al.
From wealth to health: modelling the distribution of income per capita at the sub-national level using night-time light imagery
Int. J. Health Geogr.
(2005)
Relation between satellite observed visible-near infrared emissions, population, economic activity and electric power consumption
Int. J. Remote Sens.
A fifteen year record of global natural gas flaring derived from satellite data
Energies
Progress and perspective of geographical-ecological processes
Acta Geogr. Sinica
Estimation of Mexico’s Informal Economy and Remittances Using Nighttime Imagery
Remote Sensing
Farmland dataset reconstruction and farmland change analysis in China during 1661–1985
J. Geogr. Sci.
Geographic distribution of major crops across the world
Global Biogeochem. Cycles
A spatialization method for survey data of theoretical stock-carrying capacity of grassland in China and its application
Geogr. Res.
Study on error evaluating index for spatialisation of attribute data
J. Geo-inform. Sci.
Spatial distribution of China crop output based on land use and population density
Trans. Chinese Soc. Agric. Eng. (Trans. CSAE)
Cited by (5)
Downscaling the APSIM crop model for simulation at the within-field scale
2023, Agricultural SystemsAgri-biomass supply chain optimization in north China: Model development and application
2022, EnergyCitation Excerpt :Growing numbers of researchers have invested great efforts with the aim of developing advanced technologies that can convert agri-biomass to energy and fuel. These benefits notwithstanding, the use of agri-biomass for energy and fuel comes with high collection and transportation costs and this is because of its intrinsic characteristics, including its low energy density, scattered geographical distribution, and seasonal and weather sensitivity [10–12]. In addition, it should be immediately disposed of as this will make it possible to plant the next season crops after they are produced.
Economic analysis of different straw supply modes in China
2021, EnergyCitation Excerpt :However, China has developed an agricultural mode where individual households are responsible for production. Combining this with the special characteristics of straw, such as low density, and obvious seasonality, there is significant uncertainty in the process of straw collection, storage, and transportation [14–16]. Insufficient supply and high costs are the main obstacle restricting the business utilization of straw.
A Hybrid System Based on Dynamic Selection for Time Series Forecasting
2022, IEEE Transactions on Neural Networks and Learning Systems