Abstract
Public health in developed countries is heavily affected by pollution specially in highly populated areas. Amongst the pollutants with greatest impact in health, ozone is particularly addressed in this paper due to importance of its effect on cardiovascular and respiratory problems and their prevalence on developed societies. Local authorities are compelled to provide satisfactory predictions of ozone levels and thus the need of proper estimation tools rises. A data driven approach to prediction demands high quality data but those observations collected by weather stations usually fail to meet this requirement. This paper reports a new approach to robust ozone levels prediction by using an outlier detection technique in an innovative way. The aim is to assess the feasibility of using raw data without preprocessing in order to obtain similar or better results than with traditional outlier removal techniques. An experimental dataset from a location in Spain, Ponferrada, is used through an experimental stage in which such approach provides satisfactory results in a difficult case.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Folinsbee, L.J., Bedi, J.F., Horvath, S.M.: Respiratory responses in humans repeatedly exposed to low concentrations of ozone. Am. Rev. Respir. Dis. 121(3), 431–439 (1980). http://dx.doi.org/10.1164/arrd.1980.121.3.431
Goudarzi, G., Geravandi, S., Foruozandeh, H., Babaei, A.A., Alavi, N., Niri, M.V., Khodayar, M.J., Salmanzadeh, S., Mohammadi, M.J.: Cardiovascular and respiratory mortality attributed to ground-level ozone in Ahvaz, Iran. Environ. Monit. Assess. 187(8), 487 (2015)
Rad, H.D., Babaei, A.A., Goudarzi, G., Angali, K.A., Ramezani, Z., Mohammadi, M.M.: Levels and sources of BTEX in ambient air of Ahvaz metropolitan city. Air Qual. Atmos. Health 7(4), 515–524 (2014)
Gryparis, A., Forsberg, B., Katsouyanni, K., Analitis, A., Touloumi, G., Schwartz, J., Samoli, E., Medina, S., Anderson, H.R., Niciu, E.M.: Acute effects of ozone on mortality from the “air pollution and health: a European approach” project. Am. J. Respir. Crit. Care Med. 170(10), 1080–1087 (2004)
Wakamatsu, S., Kanda, I., Okazaki, Y., Saito, M., Yamamoto, M., Watanabe, T., Maeda, T., Mizohata, A.: A comparative study of urban air quality in megacities in Mexico and Japan: based on Japan-Mexico joint research project on formation mechanism of ozone, VOCs and PM 2.5, and Proposal of Countermeasure Scenario A Study on Urban Air Pollution Improvement (145) (2017)
Anh, D.D., Co, H.X., Kim Oanh, N.T.: Photochemical smog introduction and episode selection for the ground-level ozone in Hanoi, Vietnam. VNU J. Sci. Earth Environ. Sci. 24(4), 169–175 (2008)
Ministerio de la Presidencia: Boletín Oficial del Estado- Real Decreto 102/2011, de 28 de enero, relativo a la mejora de la calidad del aire, pp. 1–53 (2011)
Sousa, S.I.V., Alvim-Ferraz, M.C.M., Martins, F.G.: Health effects of ozone focusing on childhood asthma: what is now known - a review from an epidemiological point of view. Chemosphere 90(7), 2051–2058 (2013)
Rajagopalan, S., Brook, R.D.: Ozone-induced Metabolic Effects in Humans. Ieiunium, Conviviorum, aut Timor? (Fasting, Feasting, or Fear?) (2016)
(CLRTAP), C.o.L.r.T.A.P.: The European Monitoring and Evaluation Programme (EMEP) (2017). http://www.emep.int/
Zhang, Y., Meratnia, N., Havinga, P.: Outlier detection techniques for wireless sensor networks: a survey. IEEE Commun. Surv. Tutorials 12(2), 159–170 (2010). http://ieeexplore.ieee.org/document/5451757/
Bing, G., Meré, J.O., Cabrera, C.B.: Prediction models for ozone in metropolitan area of Mexico City based on artificial intelligence techniques. Int. J. Inf. Decis. Sci. 7(2), 115 (2015). http://www.inderscience.com/link.php?id=68756
Gong, B., Ordieres-Meré, J.: Prediction of daily maximum ozone threshold exceedances by preprocessing and ensemble artificial intelligence techniques: case study of Hong Kong. Environ. Model Softw. 84, 290–303 (2016). http://www.sciencedirect.com/science/article/pii/S1364815216302602
Salazar-Ruiz, E., Ordieres, J., Vergara, E., Capuz-Rizo, S.: Development and comparative analysis of tropospheric ozone prediction models using linear and artificial intelligence-based models in Mexicali, Baja California (Mexico) and Calexico, California (US). Environ. Model Softw. 23(8), 1056–1069 (2008). http://www.sciencedirect.com/science/article/pii/S1364815207002228
Hájek, P., Olej, V.: Ozone prediction on the basis of neural networks, support vector regression and methods with uncertainty. Ecol. Inf. 12, 31–42 (2012). http://www.sciencedirect.com/science/article/pii/S1574954112000891
Coman, A., Ionescu, A., Candau, Y.: Hourly ozone prediction for a 24-h horizon using neural networks. Environ. Model Softw. 23(12), 1407–1421 (2008). http://www.sciencedirect.com/science/article/pii/S1364815208000650
Lu, W.Z., Wang, D.: Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme. Sci. Total Environ. 395(2), 109–116 (2008). http://www.sciencedirect.com/science/article/pii/S0048969708000776
Gómez-Sanchis, J., Martín-Guerrero, J.D., Soria-Olivas, E., Vila-Francés, J., Carrasco, J.L., del Valle-Tascón, S.: Neural networks for analysing the relevance of input variables in the prediction of tropospheric ozone concentration. Atmos. Environ. 40(32), 6173–6180 (2006). http://www.sciencedirect.com/science/article/pii/S1352231006005243
Ordieres, J., Vergara, E., Capuz, R., Salazar, R.: Neural network prediction model for fine particulate matter (PM2.5) on the US Mexico border in El Paso (Texas) and Ciudad Juárez (Chihuahua). Environ. Model Softw. 20(5), 547–559 (2005). http://www.sciencedirect.com/science/article/pii/S1364815204000830
Ng, H.W., Winkler, S.: A data-driven approach to cleaning large face datasets. In: 2014 IEEE International Conference on Image Processing (ICIP), pp. 343–347 (2014)
Kannan, K.S., Manoj, K., Arumugam, S.: Full Length Research Paper Outlier Detection and Missing value in Time Series Ozone Data (X), 1–7 (2012)
Limas, M.C., Ordieres Meré, J.B., de Pisón Ascacibar, F.J.M., González, E.P.V.: Outlier detection and data cleaning in multivariate non-normal samples: the paella algorithm. Data Min. Knowl. Disc. 9(2), 171–187 (2004). http://dx.doi.org/10.1023/B:DAMI.0000031630.50685.7c
León, J.d.C.Y.: Red de Control de la Calidad del Aire. http://www.medioambiente.jcyl.es/web/jcyl/MedioAmbiente/es/Plantilla66y33/1197275675880/%7B_%7D/%7B_%7D/%7B_%7D
Convention on Long-Range Transboundary Air Pollution (1979). https://www.opcw.org/chemical-weapons-convention/related-international-agreements/toxic-chemicals-and-the-environment/long-range-transboundary-air-pollution/
AEMET OpenData - Agencia Estatal de Meteorología - AEMET. Gobierno de España. http://www.aemet.es/es/datos%7B_%7Dabiertos/AEMET%7B_%7DOpenData
Romano, J.P., Wolf, M.: Resurrecting weighted least squares. J. Econ. 197, 1–19 (2017). http://www.elsevier.com/locate/jeconom
Castejón-Limas, M., Alaiz-Moreton, H., Fernández-Robles, L., Fernández-Llamas, C.: Coupling the paella algorithm to predictive models (2017). Manuscript submitted for publication
Acknowledgements
We gratefully acknowledge the financial support of Spanish Ministerio de Economía, Industria y Competitividad through grant DPI2016-79960-C3-2-P.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Alaiz-Moreton, H., Fernández-Robles, L., Alfonso-Cendón, J., Castejón-Limas, M., Sánchez-González, L., Pérez, H. (2018). Data Mining Techniques for the Estimation of Variables in Health-Related Noisy Data. In: Pérez García, H., Alfonso-Cendón, J., Sánchez González, L., Quintián, H., Corchado, E. (eds) International Joint Conference SOCO’17-CISIS’17-ICEUTE’17 León, Spain, September 6–8, 2017, Proceeding. SOCO ICEUTE CISIS 2017 2017 2017. Advances in Intelligent Systems and Computing, vol 649. Springer, Cham. https://doi.org/10.1007/978-3-319-67180-2_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-67180-2_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67179-6
Online ISBN: 978-3-319-67180-2
eBook Packages: EngineeringEngineering (R0)