Abstract
This paper proposes a deep learning model that integrates a convolutional neural network and a gated recurrent unit with groups of neighboring stations to accurately predict PM2.5 concentrations at 25 stations in Seoul, South Korea. The deep learning model uses observations obtained from one Korea Meteorological Administration (KMA) station, 25 National Institute of Environmental Research (NIER) stations, and 28 automatic weather stations (AWSs) throughout Seoul. To train the deep learning model, we use all available meteorological and air quality data observed between 2015 and 2017. With the trained model, we predict PM2.5 concentrations at all 25 NIER stations in Seoul for 2018. This study also proposes a geographical polygon group model that determines the optimal number of neighboring NIER stations required to increase the accuracy of PM2.5 concentration predictions at the target station. Comparing the model measures for each of the 25 monitoring sites in 2018, we find that the geographical polygon group model achieves an index of agreement of 0.82–0.89 and a Pearson correlation coefficient of 0.70–0.83. Compared to the method using only meteorological and air quality data from one target station (average IOA = 0.77) to predict PM2.5 concentrations at the 25 stations in Seoul, the proposed method using geographical correlation-based neighboring NIER stations as polygonal groups (average IOA = 0.85) improves the PM2.5 prediction accuracy by an average of about 10%. This approach, based on deep learning, can be updated to predict air pollution or air quality indices up to several days in advance.












Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bartell SM, Longhurst J, Tjoa T, Sioutas C, Delfino RJ (2013) Particulate air pollution, ambulatory heart rate variability, and cardiac arrhythmia in retirement community residents with coronary artery disease Environ Health Perspect 121(10):1135–1141
Brauer M, Amann M, Burnett RT, Cohen v, Dentener F, Ezzati M, Thurston GD (2012) Exposure assessment for estimation of the global burden of disease attributable to outdoor air pollution Environ Sci Technol 46(2):652–660
Crouse DL, Peters PA, Donkelaar van A, Goldberg MS, Villeneuve PJ, Brion O, Burnett RT (2012) Risk of nonaccidental and cardiovascular mortality in relation to long-term exposure to low concentrations of fine particulate matter: a Canadian national-level cohort study Environ Health Perspect 120(5):708–714
Engel-Cox JA, Kim Oanh NT, Donkelaar van A, Martin RV, Zell E (2013) Toward the next generation of air quality monitoring: particulate matter. Atmos Environ 80:584–590
Byun D, Schere KL (2006) Review of the governing equations, computational algorithms, and other components of the models-3 community multiscale air quality (CMAQ) modeling system. Appl Mech Rev 59:51–77
Zhang Y, Bocquet M, Mallet V,Seigneur C, Baklanov A (2012) Real-time air quality forecasting, part II: state of the science, current research needs, and future prospects. Atmos Environ 60:656–676
Slini T, Karatzas K, Moussiopoulos N (2001) Statistical analysis of environmental data as the basis of forecasting: an air quality application. Sci Total Environ 288:227–237
Han LJ, Zhou WQ, Li WF, Li L (2014) Impact of urbanization level on urban air quality: a case of fine particles (PM2.5) in Chinese cities. Environ Pollut 194:163–170
Pérez P, Trier A, Reyes J (2000) Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago. Chile Atmos Environ 34(8):1189–1196
Perez P, Reyes J (2006) An integrated neural network model for PM10 forecasting. Atmos Environ 40(16):2845–2851
Yildirim Y, Bayramoglu M (2006) Adaptive neuro-fuzzy based modelling for prediction of air pollution daily levels in city of Zonguldak. Chemosphere 63(9):1575–1582
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Proc Adv Neural Inf Process Syst 25:1090–1098
Pak U, Ma J, Ryu U, Ryom K, Juhyok U, Pak K, Pak C (2020) Deep learning-based PM25 prediction considering the spatiotemporal correlations: a case study of Beijing China. Sci Total Environ 699:10
Zhang CY, Chen CP, Gan M, Chen L (2015) Predictive deep Boltzmann machine for multiperiod wind speed forecasting. IEEE Trans Sustainable Energy 6:1416–1425
Li X, Peng L, Yao X, Cui S, Hu Y, You C, Chi T (2017) Long short-term memory neural network for air pollutant concentration predictions: method development and evaluation. Environ Pollut 231:997–1004
Li X, Peng L, Hu Y, Shao J, Chi T (2016) Deep learning architecture for air quality predictions. Environ Sci Pollut Res 23:22408–22417
Li T, Shen H, Yuan Q, Zhang X, Zhang L (2017) estimating ground-level PM2.5 by fusing satellite and station observations: a geo-intelligent deep learning approach. Geophys Res Lett 44(23):11985–11993
Eslami E, Choi Y, Lops Y, Sayeed A (2019) A real-time hourly ozone prediction system using deep convolutional neural network. Neural Comput Appl 8:1 5
Xu L, Yu Y, Yu J, Niu Z, Yin L (2013) Spatial distribution and sources identification of elements in PM2.5 among the coastal city group in the Western Taiwan Strait region, China. Sci Total Environ 442:77–85
Wang YG, Ying Q, Hu J, Zhang H (2014) Spatial and temporal variations of six criteria air pollutants in 31 provincial capital cities in China during 2013–2014. Environ Int 73:413–422
Zhao SP, Yu Y, Yin DY, He JJ, Liu N, Qu JJ, Xiao JH (2016) Annual and diurnal variations of gaseous and particulate pollutants in 31 provincial capital cities based on in situ air quality monitoring data from China National Environmental Monitoring Center. Envrion Int 86:92–106
Hoff RM, Christopher SA (2009) Remote sensing of particulate pollution from space: have we reached the promised land? J Air Waste Manage 59:645–675
Paciorek CJ, Liu Y (2008) Limitations of remotely-sensed aerosol as a spatial proxy for fine particulate matter. Environ Health Perspect 117:904–909
Ma XY, Wang JY, Yu FQ, Jia HL, Hu YN (2016) Can MODIS AOD be employed to derive PM2.5 in Beijing–Tianjin–Hebei over China? Atmos Res 181:250–256
Wu J, Li J, Peng J, Li W, Xu G, Dong C (2015) Applying land use regression model to estimate spatial variation of PM25 in Beijing, China. Environ Sci Pollut Res 22(9):7045–7061
Schmidhuber J (2015) Deep learning in neural networks: an overview Neural Netw 61 85 117
Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Conference on empirical methods in natural language processing.
Felix G, Jürgen S, Fred C (1999) Learning to forget: continual prediction with LSTM. In: Proceedings of the ICANN'99, IEE, London, pp 850–855
Ravanelli M, Brakel P, Omologo M, Bengio Y (2018) Light gated recurrent units for speech recognition. IEEE Trans Emerg Top Comput Intell 2(2):92–102
Pan R, Yang T, Cao J (2015) Missing data imputation by K nearest neighbors based on grey relational structure and mutual information. Appl Intell 43:614–632
LeCun Y, Bengio Y (1995) Convolutional networks for images, speech, and time series. In: The handbook of brain theory and neural networks, vol 3361, no. 10
Pouyaei A, Choi Y, Jung J, Sadeghi B, Song CH (2020) Concentration trajectory route of air pollution with an integrated langrangian model (C-TRAIL Model v1.0) derived from the community multiscale air quality model (CMAQ Model v5.2). Geosci Model Dev 13:3498–3505
Jung J, Souri AH, Wong DC, Lee S, Jeon W, Kim J, Choi Y (2019) The impact of the direct effect of aerosols on meteorology and air quality using aerosol optical depth assimilation during the KORUS-AQ campaign. J Geophys Res Atmos. https://doi.org/10.1029/2019JD030641
Souri AH, Choi Y, Pan S, Curci G, Nowlan C, Janz SJ, Kowalewski MG, Liu J, Herman JR, Weinheimer AJ (2018) First top-down estimates of anthropogenic NOx emissions using high-resolution airborne remote sensing observations. J Geophy Res Atmos. https://doi.org/10.1002/2017JD028009
Souri AH, Choi Y, Jeon W, Kochanski A, Diao L, Mandel J, Bhave PV, Pan S (2017) Quantifying the impact of biomass burning emissions on major inorganic aerosols and their precursors in the US. J Geophys Res Atmos
Eslami E, Salman AK, Choi Y, Sayeed A, Lops Y (2019) A data ensemble approach for real-time air quality forecasting using extremely randomized trees and deep neural networks. Neural Comput Appl. doi:https://doi.org/10.1007/s00521-019-04287-6
Lops Y, Choi Y, Eslami E, Sayeed A (2019) Real-time 7-day forecast of pollen counts using a deep convolutional neural network. Neural Comput Appl. https://doi.org/10.1007/s00521-019-04665-0
Sayeed A, Choi Y, Eslami E, Lops Y, Roy A, Jung J (2019) Using a deep convolutional neural network to predict 2017 ozone concentrations, 24 hours. Adv Neural Netw
Kim H, Park I, Song C, Lee K, Yun J, Kim H, Jeon M, Lee J, Han K (2019) Development of a daily PM10 and PM2.5 prediction system using a deep long short-term memory neural network model. Atmos Chem Phys 19:12935–12951
Biancofiore F, Busilacchio M, Verdecchia M (2017) Recursive neural network model for analysis and forecast of PM10 and PM2.5. Atmos Pollut Res 8(4):652̄–659
Liu H, Wang XM, Pang JM (2013) Feasibility and difficulties of China’s new air quality standard compliance: PRD case of PM2.5 and ozone from, 2010 to 2025. Atmos Chem Phys 13(23)12013–12027
Mchenry JN, Vukovich JM, Hsu NC (2015) Development and implementation of a remote-sensing and in situ data-assimilating version of CMAQ for operational PM2.5 forecasting. Part 1: MODIS aerosol optical depth (AOD) data-assimilation design and testing. J Air Waste Manag Assoc 65(12):1395–1412
Li X, Peng L, Yao X (2017) Long short-term memory neural network for air pollutant concentration predictions: method development and evaluation. Environ Pollut 231(Pt 1)997–1004
Song L, Pang S, Longley I et al. (2014) Spatio-temporal PM2.5 prediction by spatial data aided incremental support vector regression. In: International joint conference on neural networks. IEEE, pp 623–630
Willmott CJ, Ackleson SG, Davis RE, Feddema JJ, Klink KM, Legates DR, O’Donnell J, Rowe CM (1985) Statistics for the evaluation and comparison of models. J Geophys Res Oceans 90:8995–9005 https://doi.org/10.1029/JC090iC05p08995
Bengio Y, Lamblin P, Popovivi D, Larochelle H (2007) Greedy layer-wise training of deep networks. Adv Neural Inf Process Syst 19:153–168
Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: International conference on machine learning (ICML-10).
Chollet F (2015) keras. https://keras.io
Acknowledgements
This study was supported by the High Priority Area Research Seed Grant of the University of Houston.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Yeo, I., Choi, Y., Lops, Y. et al. Efficient PM2.5 forecasting using geographical correlation based on integrated deep learning algorithms. Neural Comput & Applic 33, 15073–15089 (2021). https://doi.org/10.1007/s00521-021-06082-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06082-8