Prediction of maize growth stages based on deep learning

doi:10.1016/j.compag.2020.105351

Computers and Electronics in Agriculture

Volume 172, May 2020, 105351

https://doi.org/10.1016/j.compag.2020.105351 Get rights and content

Highlights

•
ConvLSTM encoder-decoder model can forecast daily weather factors correctly.
•
Hybrid model and data-driven model can predict maize growth stages.
•
Data-driven model outperforms hybrid model in forecasting performance.
•
The proposed techniques can be used for the arrangement of agricultural activities.

Abstract

An accurate forecast of daily meteorological factors throughout the year is not only of great significance for the study of climate trends in a certain area but also enables the prediction of crop growth stages. Moreover, the prediction of crop growth stages is related to the scheduling of planting and tillage, the determination of machine harvest time, and the prediction of crop yield. However, highly complex dynamics cause large volatility in meteorological factors, so it is very challenging to predict the crop growth stage accurately, based on weather data. To solve this problem, we propose a data-driven encoder-decoder model, using long short-term memory (LSTM) and convolutional LSTM (ConvLSTM), which can be applied to forecast daily sunshine duration, cumulative precipitation, and average temperature for the coming year. To further test the performance of the ConvLSTM-based model, it is compared with the conventional LSTM encoder-decoder model and the convolutional neural network (CNN)-LSTM encoder-decoder model. The results demonstrate that, the ConvLSTM-based model is more accurate than the others for forecasting temperature (MAE = 2.602 °C, RMSE = 3.456 °C), precipitation (MAE = 3.878 mm, RMSE = 10.503 mm), and sunshine hours (MAE = 3.445 h, RMSE = 4.172 h) in 2014–2016. Furthermore, precise forecasting of meteorological factors allows us to develop a hybrid model and a data-driven model for the prediction of each growth stage separately. The hybrid model combines the ConvLSTM encoder-decoder model with empirical models, whereas the data-driven model comprises the ConvLSTM encoder-decoder model and traditional neural network structures. Finally, we compared the two types of models on a real-world dataset from Dandong, and concluded that the data-driven model is more accurate than the hybrid model for prediction of maize growth stages, with $R^{2}$ in the range of 0.755–0.883, MAE 0.588–2.205 days, and RMSE 0.978–2.729 days. In the future, these models can also be used to predict the growth stages of other crops.

Introduction

A growth stage model is one of the most important parts of a crop growth model (Ceglar et al., 2011). The accurate prediction of crop growth stages can help agricultural workers predict crop yield effectively, arrange farming activities efficiently, and determine an appropriate harvesting time (Van Oort et al., 2011). In the past few decades, many traditional methods have been used to predict the growth stage of crops, most of which function by simulating the processes of actual crop development. In addition, weather conditions play an indispensable role in crop development. Therefore, many researchers have predicted the growth stage of crops from meteorological factors, such as temperature, precipitation, and solar radiation. Of these factors, temperature is one of the main driving forces affecting crop development (Wang et al., 2017). As a consequence, many studies on temperature-based growth prediction have been reported (Kumudini et al., 2014, Soltani et al., 2006, Yun et al., 2017, Zhang and Tao, 2013, Jones et al., 1986). In addition, research has been conducted on the prediction of crop growth stage based on multiple meteorological factors (Jones, 1986, Yang et al., 2004, Zhao et al., 2018). Among these studies, the climate suitability model (Xu, 2014) is notable: this is based on multiple meteorological factors and has been used to analyze the relationship between climate and the crop growth process; it could also be used to predict the crop growth stage and yield. Beyond meteorological data, satellite remote sensing data are the most important for monitoring large-scale crop conditions and obtaining crop growth information (Meng et al., 2008). Hence, studies on the prediction of the growth stage of crops and vegetation using remote sensing data have also increased in recent years (Hu et al., 2009, Liu et al., 2017, Sakamoto et al., 2005, Yu et al., 2012). Most of the above methods depend on existing domain knowledge, and have drawbacks, such as a fixed framework and a lack of flexibility. In contrast with traditional methods, a data-driven model can be used for modeling directly from the data when there is of insufficient domain knowledge. Models created in this way even have a better non-linear fitting ability than models based on domain knowledge (Alhnaity et al., 2019, Haider et al., 2019, Nevavuori et al., 2019, Reddy and Prasad, 2018, Yalcin, 2017). We can conclude all the methods mentioned above can calculate the growth stage of crops or vegetation under certain conditions. However, most of them have the shortcoming that they cannot provide long-term predictions, and need the support of meteorological factors of the forecast year when calculating growth stages. That is, most of the above models need the weather data at the appropriate timescale to function normally, which also limits the model flexibility to a certain extent.

Therefore, these problems have hindered the longer-term prediction of crop growth stage. The key to solving the problems lies in the long-term forecasting of meteorological factors and establishing the climate model in a generalizable manner. The weather data needed for the growth stage model are mostly subsets of the daily meteorological factors throughout the year. Predicting the growth stage of crops requires the forecasting of several weather factors that play an important role in crop growth in the future year. Moreover, the daily weather of a whole year is a time series with large random variation, whose internal characteristics are difficult to learn. Therefore, we need to find a method of providing stable weather data for the growth stage model, based on annual day-to-day meteorological factors. Fortunately, the implicit relationships within the time series can be extracted well by deep learning methods, and they can establish more complex models for data prediction. Consequently, we hope to predict the crop growth stage far in advance by means of deep learning and meteorological data. For the forecast of short-term meteorological factors, a recurrent neural network (RNN) (Mikolov et al., 2010) is a suitable model. This model and extended models have proven to perform well in time series forecasting problems, such as meteorological data prediction (Kumar et al., 2019, Poornima and Pushpalatha, 2019, Qing and Niu, 2018, Wang et al., 2018). Convolutional long short-term memory (ConvLSTM) is a special-purpose variant of RNN that was established by Shi et al. for precipitation nowcasting (Shi et al., 2015). ConvLSTM can be used to construct a good temporal sequence relationship and can perform local feature extraction, like a CNN. Experimental results showed excellent performance in the forecasting of approaching precipitation (Souto et al., 2018). However, most of the above neural network structures are suitable only for short-term weather forecasting. It is difficult to forecast the daily meteorological factors for a whole year independently by using these structures. The research described in this paper addresses this challenge.

In view of the aforementioned characteristics of daily meteorological factors, an encoder-decoder model, combined with both ConvLSTM and long short-term memory (LSTM), was developed in this study. The model can be used to forecast the daily mean temperature, daily sunshine duration, and daily accumulated precipitation, which are suitable for growth stage models, throughout the whole year. In addition, the model is combined with a climate suitability model to form a hybrid model, and combined with back-propagation (BP) neural networks to form a data-driven model. These two types of models are then applied to predict maize growth stages, and their forecasting performance is compared.

Section snippets

Study region

The research area of this study is located in Dandong City, Liaoning Province, China. Its coordinates are 39°43′N to 41°09′N and 123°22′E to 125°42′E (Fig. 1). This area has a warm temperate sub-humid monsoon climate: it is mild and humid all year around, and rarely experiences extreme weather. The study region is one of the wettest areas in northern China, with a total annual rainfall of 800–1200 mm. The annual average temperature across the region is 8.9 °C, and the average annual number of

Data processing and hyperparameter selection of ConvLSTM encoder-decoder model

We used eight subsequences of daily meteorological factors for Dandong from 1981 to 2016 for model training and testing; each subsequence contained 13,068 samples. For these time series, the data for the first 32 years (11616 samples from 1981 to 2012, each containing eight weather features) were used as the training set, and the last four years (1452 samples from 2013 to 2016, each containing eight weather features) acted as the test set. In addition, we expanded the dataset by a sliding

Comparison with other encoder-decoder models

Table 8, Table 9, Table 10 show the MAE and RMSE of the predictions of temperature, precipitation, and sunshine hours by our proposed model, the CNN-LSTM encoder-decoder model (Kim and Cho, 2019), the LSTM encoder-decoder model (Park et al., 2018), ConvLSTM, LSTM, and gated recurrent unit (GRU) (Chung et al., 2014) for the test set. The units of temperature, precipitation, and sunshine duration are °C, mm, and h, respectively. Overall, the ConvLSTM encoder-decoder model was superior to other

Discussion

In this work, the ConvLSTM encoder-decoder model was first designed to predict the annual daily weather data for growth stage models. Through grid search tuning, the parameter selection of all models participating in performance comparison were similar. In the CNN-LSTM encoder-decoder model, the 1D convolution layer was used as a part of the encoder, and the number of filters selected was 64, which was the same as the number of filters of the ConvLSTM layer; LSTM was used as the decoder in all

Conclusion

In this paper, the ConvLSTM encoder-decoder model has been proposed for forecasting meteorological time series. This study has demonstrated that the hybrid model and data-driven model are effective methods for the prediction of maize growth stages, and that the predictive capability of the data-driven model is better than that of the hybrid model. Moreover, this study has also verified the feasibility and practicability of using the dates of growth stages and the meteorological data of the

CRediT authorship contribution statement

Yang Yue: Conceptualization, Methodology, Software, Writing - original draft, Formal analysis, Validation, Writing - review & editing. Jin-Hai Li: Writing - original draft, Visualization, Writing - review & editing. Li-Feng Fan: Writing - original draft, Formal analysis, Writing - review & editing. Li-Li Zhang: Resources, Data curation. Peng-Fei Zhao: Writing - original draft. Qiao Zhou: Writing - original draft. Nan Wang: Writing - original draft. Zhong-Yi Wang: Writing - original draft. Lan

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by the National Key Research and Development Program of China [grant number 2016YFD0300304].

References (48)

J. Caubel et al.
Broadening the scope for ecoclimatic indicators to assess crop climate suitability according to ecophysiological, technical and quality criteria
Agric. For. Meteorol.
(2015)
A. Ceglar et al.
The simulation of phenological development in dynamic crop model: the Bayesian comparison of different methods
Agric. For. Meteorol.
(2011)
T.-Y. Kim et al.
Predicting residential energy consumption using CNN-LSTM neural networks
Energy
(2019)
C. Liao et al.
Using spatio-temporal fusion of Landsat-8 and MODIS data to derive phenology, biomass and yield estimates for corn and soybean
Sci. Total Environ.
(2019)
L. Liu et al.
Real-time and short-term predictions of spring phenology in North America from VIIRS data
Remote Sens. Environ.
(2017)
X. Qing et al.
Hourly day-ahead solar irradiance prediction using weather forecasts by LSTM
Energy
(2018)
T. Sakamoto et al.
A crop phenology detection method using time-series MODIS data
Remote Sens. Environ.
(2005)
A. Soltani et al.
Modeling chickpea growth and development: Phenological development
Field Crops
(2006)
P. Van Oort et al.
Correlation between temperature and phenology prediction error in rice (Oryza sativa L.)
Agric. For. Meteorol.
(2011)
H. Yang et al.
Hybrid-maize—a maize simulation model that combines two crop modeling approaches
Field Crops Res.
(2004)

K. Yun et al.

Can a multi-model ensemble improve phenology predictions for climate change studies?

Ecol. Model.

(2017)

L.A. Zadeh

Fuzzy sets

S. Zhang et al.

Modeling the response of rice phenology to climate change and variability in different climatic zones: comparisons of five models

Eur. J. Agron.

(2013)

J.F. Zhao et al.

Effects of climate change on cultivation patterns of spring maize and its climatic suitability in Northeast China

Agric. Ecosyst. Environ.

(2015)

Alhnaity, B., Pearson, S., Leontidis, G., Kollias, S.J.a.p.a., 2019. Using deep learning to predict plant growth and...

Cho, K., Van Merriënboer, B., Bahdanau, D., Bengio, Y.J.a.p.a., 2014. On the properties of neural machine translation:...

Chung, J., Gulcehre, C., Cho, K., Bengio, Y., 2014. Empirical evaluation of gated recurrent neural networks on sequence...

Gers, F.A., Schmidhuber, J., Cummins, F., 1999. Learning to forget: Continual prediction with LSTM....

Chollet, F., keras[J]. 2015....

M. Ghamghami et al.

Comparison of data mining and GDD-based models in discrimination of maize phenology

Int. J. Plant Prod.

(2019)

S.A. Haider et al.

LSTM neural network based forecasting model for wheat production in Pakistan

Agronomy-Basel

(2019)

Y.-Y. Hou et al.

Simulation model of spring maize developmental stages in Northeast China based on climatic suitability

Chin. J. Ecol/

(2012)

Ioffe, S., Szegedy, C.J.a.p.a., 2015. Batch normalization: Accelerating deep network training by reducing internal...

Jones, C.A., et al. Subroutine structure[J]. CERES-Maize: a simulation model of maize growth and development/edited by...

Cited by (20)

An improved LSTM-based model for identifying high working intensity load segments of the tractor load spectrum
2023, Computers and Electronics in Agriculture
In order to solve the problems of serious redundancy of existing load spectrum data, unclear identification of load segments corresponding to high working intensity, and failure of load spectrum interception to meet the requirements of heavy load conditions, this paper proposes an MHA-ConvLSTM (Multi-head attention Convolutional LSTM) network model for identifying high working intensity load segments of the tractor load spectrum. The deep learning model integrates a multi-head attention mechanism and ConvLSTM network as the core, keeping the temporal order of continuously changing load sequences as the basic principle, deeply mining the local features contained within the small range load of dynamically changing load, and strengthening the matching relationship of the intrinsic features in long distance and large data volume load. This research selects rotary tillage as the verification condition, builds the multi-sensor test system to carry out the working load test, and takes the tractor rotary tillage load spectrum data as the validation object. The analysis shows that the accuracy and F1-score of the MHA-ConvLSTM model reach 97.69% and 97.83%, respectively, and the operation time is only 0.5289 s, which is 15.82% faster than LSTM. In addition, the model in this paper was used to identify 399 load segments with high workload and high work intensity, and 388 load segments were successfully verified with an error rate of less than 3%. This paper provides a new technical solution for applying the agricultural equipment load spectrum.
Temperature forecasting of grain in storage: A multi-output and spatiotemporal approach based on deep learning
2023, Computers and Electronics in Agriculture
Grain temperature forecasting is crucial to ventilation management in granary, as it facilitates precautions against grain mold caused by grain temperature increase. Despite of the capability in the feature extraction of nonlinear temperature data, frontier forecasting models empowered by artificial intelligence are found to be limited in the forecast efficiency and accuracy. All existing models show insufficient efficiency as their outputs are limited to either single representative sensors or average temperature values of certain layers at a time. Most of these models fail to take into account a spatial topology of the sensor network, which hinders higher forecast accuracy. This paper therefore proposes a multi-output and spatiotemporal model that combines Graph Convolution Neural Networks (GCN) and Transformer to address such issues. GCN captures the spatial correlations of the sensors and topological information of the sensor network in the granary. Transformer captures both long-term and short-term temporal features and describe temporal dependencies. Drawing on a real-granary dataset from the granary of Shaanxi, China, the proposed model is constructed whose performance is evaluated and compared with those of four existing models. Results demonstrate that the proposed model outperforms others by MAE and RMSE. Furthermore, a continuous temperature field of the entire granary is enabled by a three-dimensional interpolation based on the forecast results, which makes accessible the temperature conditions of all locations besides the discrete ‘sensored’ areas.
Regulating the time of the crop model clock: A data assimilation framework for regions with high phenological heterogeneity
2023, Field Crops Research
in crop growth data assimilation systems, the mismatch between simulated and observed phenology significantly deteriorates the performance of crop growth modeling. This situation may be more severe for smallholder farmers-managed fields, where the phenological heterogeneity was high even when climate condition was relatively uniform. Previous studies investigated the non-sequential methods to retrospectively assimilate historical phenology observations. However, approaches to dynamically assimilating phenological measurements through sequential data assimilation methods remain unexplored
one of the most intractable challenges of dynamic phenology assimilation is that a considerable proportion of model parameters and variables are entangled with phenology, therefore simply assimilating phenological measurements could disturb the model clock. This study aims to establish a robust crop data assimilation framework capable of assimilating phenological measurements in real time without disturbing the model clock
the framework used an open-source version of the AquaCrop model to simulate crop growth and used the ensemble Kalman filter (EnKF) to assimilate observations sequentially. A parameter refresh method was proposed to restore the phenological consistency of model parameters after updating the phenology state. Assimilation strategies with different observation types and compositions of state vectors were designed after a global sensitivity analysis of model parameters. These strategies were evaluated through the Observing System Simulation Experiments (OSSE), and the selected strategies were tested in a real-world case.
the results of the OSS Experiments show that the phenological mismatch problem greatly affects crop growth simulation, and this mismatch could not be narrowed effectively by assimilating non-phenological observations. Assimilating phenological measurements with the proposed parameters refresh method and assimilation strategies closed this mismatch and produced better performance compared to the Restart-EnKF method. In the real-world paddy rice case, assimilating phenology with the proposed strategies significantly improved yield estimation in low-yield plots (less than 4 ton/ha) compared to assimilating canopy cover (CC) alone, with an R² increase from 0.07 to 0.48. Assimilating CC, biomass and phenology simultaneously produced the best yield estimation for all plots, with R² = 0.57 and RMSE = 1.00 ton/ha.
assimilating phenology under a consistent model clock significantly improved yield estimation when the phenological heterogeneity of plots was high.
the results highlight the effectiveness and robustness of the established data assimilation framework for dynamic crop growth simulation, indicating the potentials of the proposed data assimilation framework for regional in-season crop modeling and yield forecasting.
Quality prediction of tractor rotary tillage based on BiConvLSTM with self-attention
2023, Computers and Electronics in Agriculture
To accurately predict rototilling performance and rotary tillage quality based on multi-sensor measured data of tractor electro-hydraulic suspension system, an improved ConvLSTM-based model is proposed, and field tests of rototilling operation are carried out to verify the accuracy. The model is based on the SA-BiConvLSTM (Self-attention Bi-directional ConvLSTM, BiConvLSTM) network as the core, and the BiConvLSTM can take the time dependence of the load sequence’s contextual state information into account and extract the local features more deeply and accurately; the self-attention mechanism strengthens the inner correlation of long-range load features, it significantly reduces the number of model parameters, solving the problems of degraded prediction performance and low computing efficiency caused by the length of the load sequence. The experimental results showed that the accuracy and F1-score of the rotary tillage quality prediction based on the measured data reached 97.40% and 97.47%, respectively, and the model had a considerable improvement in the prediction efficiency with the highest increase of 14.09%, and all evaluation metrics were better than the experimental results of the control group. This model ensures the breadth, depth and correlation of the load feature, significantly simplifies the model complexity and directly improves the prediction effect and computing efficiency. This study explores an intelligent technological approach for predicting the quality of rotary tillage, providing a new technological reference for the research and application of precision agriculture.
On the use of machine learning methods to improve the estimation of gross primary productivity of maize field with drip irrigation
2023, Ecological Modelling
Citation Excerpt :
Yang et al. (2019) trained CNN with high-resolution UAV images to predict the yield of rice, and the results obtained were far better than the statistical model based on vegetation index. Yue et al. (2020) used the convolutional LSTM to predict the variation of meteorological factors and then predicted the growth stage of crops. Cao et al. (2021) used LSTM to predict rice yields in China.
As an important part of the energy-matter cycle, gross primary productivity (GPP) reflects the ability of plants to absorb carbon dioxide from the atmosphere. The accurate estimation of GPP is critical to understanding the regional carbon cycle. With the development of machine learning (ML) theory, machine learning models are increasingly used to study complex phenomena with high variability in time and space. In this study, three machine learning models (Support vector regression (SVR), artificial neural network (ANN), and long short-term memory networks (LSTM)) were investigated for predicting GPP in northwest China and compared them with the traditional physical models. Carbon flux, various environmental factors, and maize growth indices were measured in the maize field over five years in northwest China. The rigorous analysis which included statistical comparison and cross-validation for the prediction of GPP confirmed that the machine learning models performed better than traditional physical models. And the SVR model performed the best among the considered ML models with the highest nash-efficiency coefficient and the lowest root mean squared error. The machine learning model also outperformed the traditional physical models on cloudy days and after irrigation. The SVR achieved good prediction accuracy and high stability. With different training data sets, the ANN and LSTM were relatively more sensitive to the training data set. When the training data was sufficient, SVR, ANN and LSTM could achieve similar prediction accuracy, but SVR was slightly higher. When the training data was small, the simulation accuracy of SVR was better than ANN and LSTM. The performance of ANN and LSTM was more sensitive to parameter selection, and the relationship between model performance and parameter selection had no obvious regularity. Based on this comprehensive comparison study, it was elicited that the SVR model can be successfully applied to GPP simulation of maize fields, which provided a new perspective for the application of machine learning modeling in GPP simulation.
A deep learning method for the long-term prediction of plant electrical signals under salt stress to identify salt tolerance
2021, Computers and Electronics in Agriculture
Citation Excerpt :
Long short-term memory (LSTM) networks excel in learning, processing, and classifying time series data, such as plant electrical signals. The 1D-CNN-LSTM network, formed by combining the 1D-CNN and LSTM networks, performs well in fields such as speech prediction and electroencephalogram classification (Fu et al., 2019; Li et al., 2020; Niu et al., 2019; Qiao et al., 2020; Salau et al., 2020; Turkoglu et al., 2019; Yue et al., 2020; D.-J. Zhao et al., 2019; Zhu et al., 2020). 1D-CNN-LSTM can learn both the correlation of local data and the long-term dependence relationship from the local features extracted by a CNN (J. Zhao et al. (2019)).
Screening salt-tolerant crops at different salt concentrations is of great significance but time-consuming. It remains a major challenge to automatically find the appropriate stress concentration to identify the salt tolerance of crops by using the recorded electrical signals of plants over long periods of time. To solve this problem, we designed a data-driven signal dynamics prediction model (SLSTM-TCNN), based on a one-dimensional convolutional neural network (1D-CNN) combined with a long short-term memory neural network. Furthermore, we developed a quantitative model, named the NaCl stress concentration discrimination model (SCDM), to investigate the relationship between the electrical signals, NaCl stress concentration, and time dependence, and used a salt tolerance classification model (STCM) to discover the most appropriate NaCl stress concentration for distinguishing the salt tolerance of wheat. These methods perform the time-consuming task of selecting salt-tolerant varieties of plants under different NaCl concentrations. The results show that the SLSTM-TCNN could quickly predict the signal dynamics of wheat leaves DeKang961 (salt-tolerant) and Langdon (salt-sensitive) under the ongoing stress of different NaCl concentrations. The accuracy of SCDM for differentiating NaCl stress concentrations increased to 88% and 83%, and that of STCM for classifying salt-tolerant and salt-sensitive varieties reached 92.36%. Finally, it was found that the salt tolerance of the two varieties (DeKang961 and Langdon) was higher when the NaCl concentration was in the range of 50–200 mM. In the future, the method will be a potentially useful tool for identifying the salt tolerance of other crops at the seedling stage.

View all citing articles on Scopus

View full text

Prediction of maize growth stages based on deep learning

Highlights

Abstract

Introduction

Section snippets

Study region

Data processing and hyperparameter selection of ConvLSTM encoder-decoder model

Comparison with other encoder-decoder models

Discussion

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Agric. For. Meteorol.

Agric. For. Meteorol.

Energy

Sci. Total Environ.

Remote Sens. Environ.

Energy

Remote Sens. Environ.

Field Crops

Agric. For. Meteorol.

Field Crops Res.

Ecol. Model.

Eur. J. Agron.

Agric. Ecosyst. Environ.

Comparison of data mining and GDD-based models in discrimination of maize phenology

Int. J. Plant Prod.

LSTM neural network based forecasting model for wheat production in Pakistan

Agronomy-Basel

Simulation model of spring maize developmental stages in Northeast China based on climatic suitability

Chin. J. Ecol/