Skip to main content

Hybrid of Time Series Regression, Multivariate Generalized Space-Time Autoregressive, and Machine Learning for Forecasting Air Pollution

  • Conference paper
  • First Online:
Soft Computing in Data Science (SCDS 2021)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1489))

Included in the following conference series:

  • 601 Accesses

Abstract

The purpose of this study is to propose a new hybrid of space-time models by combining the time series regression (TSR), multivariate generalized space-time autoregressive (MGSTAR), and machine learning (ML) to forecast air pollution data in the city of Surabaya. The TSR model is used to capture linear patterns of data, especially trends and double seasonal. The MGSTAR model is employed to capture the relationship between locations, and the ML model is used to capture nonlinear patterns from the data. There are three ML models used in this study, namely feed-forward neural network (FFNN), deep learning neural network (DLNN), and long short-term memory (LSTM). So that there are three hybrid models used in this study, namely TSR-MGSTAR-FFNN, TSR-MGSTAR-DLNN, and TSR-MGSTAR-LSTM. The hybrid models will be used to forecast air pollution data consisting of CO, PM10, and NO2 at three locations in Surabaya simultaneously. Then, the performance of these three-combined hybrid models will be compared with the individual model of TSR and MGSTAR, two-combined hybrid models of MGSTAR-FFNN, MGSTAR-DLNN, MGSTAR-LSTM, and hybrid TSR-MGSTAR models based on the RMSE and sMAPE values in the out-of-sample data. Based on the smallest RMSE and sMAPE values, the analysis results show that the best model for forecasting CO is MGSTAR, forecasting PM10 is hybrid TSR-MGSTAR, and forecasting NO2 is hybrid TSR-MGSTAR-FFNN. In general, the hybrid model has better accuracy than the individual models. This result is in line with the results of the M3 and M4 forecasting competition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Chen, S., Zhang, D.: Impact of air pollution on labor productivity: evidence from prison factory data. China Econ. Q. Int. 1, 148–159 (2021)

    Article  Google Scholar 

  2. Liu, Y., Zhou, Y., Lu, J.: Exploring the relationship between air pollution and meteorological conditions in china under environmental governance. Sci. Rep. 10, 14518 (2020)

    Article  Google Scholar 

  3. Reames, T.G., Bravo, M.A.: People, place and pollution: investigating relationships between air quality perceptions, health concerns, exposure, and individual and area-level characteristics. Environ. Int. 122, 244–255 (2019)

    Article  Google Scholar 

  4. Pfeifer, P.E., Deutsch, S.J.: A three stage iterative procedure for space-time modeling. Technometrics 22(1), 35–47 (1980)

    Article  Google Scholar 

  5. Borovkova, S., Lopuhaa, R., Ruchjana, B.N.: Generalized STAR model with experimental weights. In: Proceedings of the 17th International Workshop on Statistical Modelling, pp. 143–151 (2002)

    Google Scholar 

  6. Suhartono, Nahdliyah, N., Akbar, M.S., Salehah, N.A., Choiruddin, A.: A MGSTAR: an extension of the generalized space-time autoregressive model. J. Phys. Conf. Ser. 1752, 012015 (2021)

    Google Scholar 

  7. Zhang, G.P.: Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50, 159–175 (2003)

    Article  Google Scholar 

  8. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)

    Article  Google Scholar 

  9. Srivastava, S., Lessmann, S.: A comparative study of LSTM neural networks in forecasting day-ahead global horizontal irradiance with satellite data. Sol. Energy 162, 232–247 (2018)

    Article  Google Scholar 

  10. Abbasimehr, H., Shabani, M., Yousefi, M.: An optimized model using LSTM network for demand forecasting. Comput. Ind. Eng. 143, 106435 (2020)

    Article  Google Scholar 

  11. Makridakis, S., Spiliotis, E., Assimakopoulus, V.: The M4 competition: results, findings, conclusion and way forward. Int. J. Forecast. 34, 802–808 (2018)

    Article  Google Scholar 

  12. Suhartono, Dana, I.M.G.M., Rahayu, S.P.: Hybrid model for forecasting space-time data with calendar variation effects. Telkomnika 17(1), 118–130 (2019)

    Google Scholar 

  13. Pusporani, E., Suhartono, Prastyo, D.D.: Hybrid multivariate generalized space-time autoregressive artificial neural network models to forecast air pollution data at Surabaya. In: AIP Conference Proceedings, vol. 2194, p. 020090 (2019)

    Google Scholar 

  14. Laily, V.O.N., Suhartono, Pusporani, E., Atok, R.M.: A novel hybrid Mgstar-Rnn model for forecasting spatio-temporal data. J. Phys. Conf. Ser. 1752, 012011 (2021)

    Google Scholar 

  15. Prastyo, D.D., Nabila, F.S., Suhartono, Lee, M.H., Suhermi, N., Fam, S.F.: VAR and GSTAR-based feature selection in support vector regression for multivariate spatio-temporal forecasting. In: Yap, B., Mohamed, A., Berry, M. (eds.) Soft Computing in Data Science. SCDS 2018. Communications in Computer and Information Science, vol. 937, pp. 46–57. Springer, Singapore (2018). https://doi.org/10.1007/978-981-13-3441-2_4

  16. Suhartono, Prastyo, D.D., Kuswanto, H., Lee, M.H.: Comparison between VAR, GSTAR, FFNN-VAR and FFNN-GSTAR models for forecasting oil production. MATEMATIKA 34(1), 103–111 (2018)

    Google Scholar 

  17. Suhartono, Prabowo, H., Fam, S.F.: A hybrid TSR and LSTM for forecasting NO2 and SO2 in Surabaya. In: Berry, M., Yap, B., Mohamed, A., Köppen, M. (eds.) Soft Computing in Data Science. SCDS 2019. Communications in Computer and Information Science, vol. 1100, pp. 107–120. Springer, Singapore (2019). https://doi.org/10.1007/978-981-15-0399-3_9

  18. Shummway, R.H., Stoffer, D.S.: Time Series Analysis and Its Application with R Examples. Springer, Pittsburg (2006). https://doi.org/10.1007/0-387-36276-2

    Book  Google Scholar 

  19. Fausset, L.: Fundamental of Neural Network: Architectures Algorithms and Applications. Prentice-Hall Inc., Hoboken (1994)

    Google Scholar 

  20. Hochreiter, S., Schmiduber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  21. Ma, X., Tao, Z., Wang, Y., Yu, H., Wang, Y.: Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transp. Res. Part C: Emerg. Technol. 54, 187–197 (2015)

    Article  Google Scholar 

  22. Hajirahimi, Z., Khashei, M.: Hybrid structures in time series modeling and forecasting: a review. Eng. Appl. Artif. Intell. 86, 83–106 (2019)

    Article  Google Scholar 

  23. Khashei, M., Bijari, M.: A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl. Soft Comput. 11, 2664–2675 (2011)

    Article  Google Scholar 

  24. Wei, W.W.S.: Time Series Analysis Univariate and Multivariate Methods, 2nd edn. Pearson Education Inc., Boston (2006)

    MATH  Google Scholar 

  25. Suhartono, Suhermi, N., Prastyo, D.D.: Design of experiment to optimize the architecture of deep learning for nonlinear time series forecasting. Procedia Comput. Sci. 144, 269–276 (2018)

    Google Scholar 

  26. Makridakis, S., Hibbon, M.: The M3-competition result, conclusions and implications. Int. J. Forecast. 16, 451–676 (2000)

    Article  Google Scholar 

Download references

Acknowledgements

This research was supported by Deputi Bidang Penguatan Riset dan Pengembangan, Kementerian Riset dan Teknologi/ Badan Riset dan Inovasi Nasional under the scheme Penelitian Dasar, project no 3/E1/KP.PTNBH/2021 and 799/PKS/ITS/2021. The authors thank to DRPM ITS for the supports and to anonymous referees for their useful suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dedy Dwi Prastyo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Prabowo, H., Prastyo, D.D., Setiawan (2021). Hybrid of Time Series Regression, Multivariate Generalized Space-Time Autoregressive, and Machine Learning for Forecasting Air Pollution. In: Mohamed, A., Yap, B.W., Zain, J.M., Berry, M.W. (eds) Soft Computing in Data Science. SCDS 2021. Communications in Computer and Information Science, vol 1489. Springer, Singapore. https://doi.org/10.1007/978-981-16-7334-4_26

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-7334-4_26

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-7333-7

  • Online ISBN: 978-981-16-7334-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics