Prediction of maize growth stages based on deep learning

https://doi.org/10.1016/j.compag.2020.105351Get rights and content

Highlights

  • ConvLSTM encoder-decoder model can forecast daily weather factors correctly.

  • Hybrid model and data-driven model can predict maize growth stages.

  • Data-driven model outperforms hybrid model in forecasting performance.

  • The proposed techniques can be used for the arrangement of agricultural activities.

Abstract

An accurate forecast of daily meteorological factors throughout the year is not only of great significance for the study of climate trends in a certain area but also enables the prediction of crop growth stages. Moreover, the prediction of crop growth stages is related to the scheduling of planting and tillage, the determination of machine harvest time, and the prediction of crop yield. However, highly complex dynamics cause large volatility in meteorological factors, so it is very challenging to predict the crop growth stage accurately, based on weather data. To solve this problem, we propose a data-driven encoder-decoder model, using long short-term memory (LSTM) and convolutional LSTM (ConvLSTM), which can be applied to forecast daily sunshine duration, cumulative precipitation, and average temperature for the coming year. To further test the performance of the ConvLSTM-based model, it is compared with the conventional LSTM encoder-decoder model and the convolutional neural network (CNN)-LSTM encoder-decoder model. The results demonstrate that, the ConvLSTM-based model is more accurate than the others for forecasting temperature (MAE = 2.602 °C, RMSE = 3.456 °C), precipitation (MAE = 3.878 mm, RMSE = 10.503 mm), and sunshine hours (MAE = 3.445 h, RMSE = 4.172 h) in 2014–2016. Furthermore, precise forecasting of meteorological factors allows us to develop a hybrid model and a data-driven model for the prediction of each growth stage separately. The hybrid model combines the ConvLSTM encoder-decoder model with empirical models, whereas the data-driven model comprises the ConvLSTM encoder-decoder model and traditional neural network structures. Finally, we compared the two types of models on a real-world dataset from Dandong, and concluded that the data-driven model is more accurate than the hybrid model for prediction of maize growth stages, with R2 in the range of 0.755–0.883, MAE 0.588–2.205 days, and RMSE 0.978–2.729 days. In the future, these models can also be used to predict the growth stages of other crops.

Introduction

A growth stage model is one of the most important parts of a crop growth model (Ceglar et al., 2011). The accurate prediction of crop growth stages can help agricultural workers predict crop yield effectively, arrange farming activities efficiently, and determine an appropriate harvesting time (Van Oort et al., 2011). In the past few decades, many traditional methods have been used to predict the growth stage of crops, most of which function by simulating the processes of actual crop development. In addition, weather conditions play an indispensable role in crop development. Therefore, many researchers have predicted the growth stage of crops from meteorological factors, such as temperature, precipitation, and solar radiation. Of these factors, temperature is one of the main driving forces affecting crop development (Wang et al., 2017). As a consequence, many studies on temperature-based growth prediction have been reported (Kumudini et al., 2014, Soltani et al., 2006, Yun et al., 2017, Zhang and Tao, 2013, Jones et al., 1986). In addition, research has been conducted on the prediction of crop growth stage based on multiple meteorological factors (Jones, 1986, Yang et al., 2004, Zhao et al., 2018). Among these studies, the climate suitability model (Xu, 2014) is notable: this is based on multiple meteorological factors and has been used to analyze the relationship between climate and the crop growth process; it could also be used to predict the crop growth stage and yield. Beyond meteorological data, satellite remote sensing data are the most important for monitoring large-scale crop conditions and obtaining crop growth information (Meng et al., 2008). Hence, studies on the prediction of the growth stage of crops and vegetation using remote sensing data have also increased in recent years (Hu et al., 2009, Liu et al., 2017, Sakamoto et al., 2005, Yu et al., 2012). Most of the above methods depend on existing domain knowledge, and have drawbacks, such as a fixed framework and a lack of flexibility. In contrast with traditional methods, a data-driven model can be used for modeling directly from the data when there is of insufficient domain knowledge. Models created in this way even have a better non-linear fitting ability than models based on domain knowledge (Alhnaity et al., 2019, Haider et al., 2019, Nevavuori et al., 2019, Reddy and Prasad, 2018, Yalcin, 2017). We can conclude all the methods mentioned above can calculate the growth stage of crops or vegetation under certain conditions. However, most of them have the shortcoming that they cannot provide long-term predictions, and need the support of meteorological factors of the forecast year when calculating growth stages. That is, most of the above models need the weather data at the appropriate timescale to function normally, which also limits the model flexibility to a certain extent.

Therefore, these problems have hindered the longer-term prediction of crop growth stage. The key to solving the problems lies in the long-term forecasting of meteorological factors and establishing the climate model in a generalizable manner. The weather data needed for the growth stage model are mostly subsets of the daily meteorological factors throughout the year. Predicting the growth stage of crops requires the forecasting of several weather factors that play an important role in crop growth in the future year. Moreover, the daily weather of a whole year is a time series with large random variation, whose internal characteristics are difficult to learn. Therefore, we need to find a method of providing stable weather data for the growth stage model, based on annual day-to-day meteorological factors. Fortunately, the implicit relationships within the time series can be extracted well by deep learning methods, and they can establish more complex models for data prediction. Consequently, we hope to predict the crop growth stage far in advance by means of deep learning and meteorological data. For the forecast of short-term meteorological factors, a recurrent neural network (RNN) (Mikolov et al., 2010) is a suitable model. This model and extended models have proven to perform well in time series forecasting problems, such as meteorological data prediction (Kumar et al., 2019, Poornima and Pushpalatha, 2019, Qing and Niu, 2018, Wang et al., 2018). Convolutional long short-term memory (ConvLSTM) is a special-purpose variant of RNN that was established by Shi et al. for precipitation nowcasting (Shi et al., 2015). ConvLSTM can be used to construct a good temporal sequence relationship and can perform local feature extraction, like a CNN. Experimental results showed excellent performance in the forecasting of approaching precipitation (Souto et al., 2018). However, most of the above neural network structures are suitable only for short-term weather forecasting. It is difficult to forecast the daily meteorological factors for a whole year independently by using these structures. The research described in this paper addresses this challenge.

In view of the aforementioned characteristics of daily meteorological factors, an encoder-decoder model, combined with both ConvLSTM and long short-term memory (LSTM), was developed in this study. The model can be used to forecast the daily mean temperature, daily sunshine duration, and daily accumulated precipitation, which are suitable for growth stage models, throughout the whole year. In addition, the model is combined with a climate suitability model to form a hybrid model, and combined with back-propagation (BP) neural networks to form a data-driven model. These two types of models are then applied to predict maize growth stages, and their forecasting performance is compared.

Section snippets

Study region

The research area of this study is located in Dandong City, Liaoning Province, China. Its coordinates are 39°43′N to 41°09′N and 123°22′E to 125°42′E (Fig. 1). This area has a warm temperate sub-humid monsoon climate: it is mild and humid all year around, and rarely experiences extreme weather. The study region is one of the wettest areas in northern China, with a total annual rainfall of 800–1200 mm. The annual average temperature across the region is 8.9 °C, and the average annual number of

Data processing and hyperparameter selection of ConvLSTM encoder-decoder model

We used eight subsequences of daily meteorological factors for Dandong from 1981 to 2016 for model training and testing; each subsequence contained 13,068 samples. For these time series, the data for the first 32 years (11616 samples from 1981 to 2012, each containing eight weather features) were used as the training set, and the last four years (1452 samples from 2013 to 2016, each containing eight weather features) acted as the test set. In addition, we expanded the dataset by a sliding

Comparison with other encoder-decoder models

Table 8, Table 9, Table 10 show the MAE and RMSE of the predictions of temperature, precipitation, and sunshine hours by our proposed model, the CNN-LSTM encoder-decoder model (Kim and Cho, 2019), the LSTM encoder-decoder model (Park et al., 2018), ConvLSTM, LSTM, and gated recurrent unit (GRU) (Chung et al., 2014) for the test set. The units of temperature, precipitation, and sunshine duration are °C, mm, and h, respectively. Overall, the ConvLSTM encoder-decoder model was superior to other

Discussion

In this work, the ConvLSTM encoder-decoder model was first designed to predict the annual daily weather data for growth stage models. Through grid search tuning, the parameter selection of all models participating in performance comparison were similar. In the CNN-LSTM encoder-decoder model, the 1D convolution layer was used as a part of the encoder, and the number of filters selected was 64, which was the same as the number of filters of the ConvLSTM layer; LSTM was used as the decoder in all

Conclusion

In this paper, the ConvLSTM encoder-decoder model has been proposed for forecasting meteorological time series. This study has demonstrated that the hybrid model and data-driven model are effective methods for the prediction of maize growth stages, and that the predictive capability of the data-driven model is better than that of the hybrid model. Moreover, this study has also verified the feasibility and practicability of using the dates of growth stages and the meteorological data of the

CRediT authorship contribution statement

Yang Yue: Conceptualization, Methodology, Software, Writing - original draft, Formal analysis, Validation, Writing - review & editing. Jin-Hai Li: Writing - original draft, Visualization, Writing - review & editing. Li-Feng Fan: Writing - original draft, Formal analysis, Writing - review & editing. Li-Li Zhang: Resources, Data curation. Peng-Fei Zhao: Writing - original draft. Qiao Zhou: Writing - original draft. Nan Wang: Writing - original draft. Zhong-Yi Wang: Writing - original draft. Lan

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by the National Key Research and Development Program of China [grant number 2016YFD0300304].

References (48)

  • K. Yun et al.

    Can a multi-model ensemble improve phenology predictions for climate change studies?

    Ecol. Model.

    (2017)
  • L.A. Zadeh

    Fuzzy sets

  • S. Zhang et al.

    Modeling the response of rice phenology to climate change and variability in different climatic zones: comparisons of five models

    Eur. J. Agron.

    (2013)
  • J.F. Zhao et al.

    Effects of climate change on cultivation patterns of spring maize and its climatic suitability in Northeast China

    Agric. Ecosyst. Environ.

    (2015)
  • Alhnaity, B., Pearson, S., Leontidis, G., Kollias, S.J.a.p.a., 2019. Using deep learning to predict plant growth and...
  • Cho, K., Van Merriënboer, B., Bahdanau, D., Bengio, Y.J.a.p.a., 2014. On the properties of neural machine translation:...
  • Chung, J., Gulcehre, C., Cho, K., Bengio, Y., 2014. Empirical evaluation of gated recurrent neural networks on sequence...
  • Gers, F.A., Schmidhuber, J., Cummins, F., 1999. Learning to forget: Continual prediction with LSTM....
  • Chollet, F., keras[J]. 2015....
  • M. Ghamghami et al.

    Comparison of data mining and GDD-based models in discrimination of maize phenology

    Int. J. Plant Prod.

    (2019)
  • S.A. Haider et al.

    LSTM neural network based forecasting model for wheat production in Pakistan

    Agronomy-Basel

    (2019)
  • Y.-Y. Hou et al.

    Simulation model of spring maize developmental stages in Northeast China based on climatic suitability

    Chin. J. Ecol/

    (2012)
  • Ioffe, S., Szegedy, C.J.a.p.a., 2015. Batch normalization: Accelerating deep network training by reducing internal...
  • Jones, C.A., et al. Subroutine structure[J]. CERES-Maize: a simulation model of maize growth and development/edited by...
  • Cited by (20)

    • On the use of machine learning methods to improve the estimation of gross primary productivity of maize field with drip irrigation

      2023, Ecological Modelling
      Citation Excerpt :

      Yang et al. (2019) trained CNN with high-resolution UAV images to predict the yield of rice, and the results obtained were far better than the statistical model based on vegetation index. Yue et al. (2020) used the convolutional LSTM to predict the variation of meteorological factors and then predicted the growth stage of crops. Cao et al. (2021) used LSTM to predict rice yields in China.

    • A deep learning method for the long-term prediction of plant electrical signals under salt stress to identify salt tolerance

      2021, Computers and Electronics in Agriculture
      Citation Excerpt :

      Long short-term memory (LSTM) networks excel in learning, processing, and classifying time series data, such as plant electrical signals. The 1D-CNN-LSTM network, formed by combining the 1D-CNN and LSTM networks, performs well in fields such as speech prediction and electroencephalogram classification (Fu et al., 2019; Li et al., 2020; Niu et al., 2019; Qiao et al., 2020; Salau et al., 2020; Turkoglu et al., 2019; Yue et al., 2020; D.-J. Zhao et al., 2019; Zhu et al., 2020). 1D-CNN-LSTM can learn both the correlation of local data and the long-term dependence relationship from the local features extracted by a CNN (J. Zhao et al. (2019)).

    View all citing articles on Scopus
    View full text