Skip to main content

From What and When Happen, to Why Happen in Air Pollution Using Open Big Data

  • Conference paper
  • First Online:
Book cover Web and Wireless Geographical Information Systems (W2GIS 2018)

Abstract

The air pollution phenomenon has been often studied from an environmental dimension but not from a spatial big data approach and considering social perception analysis. In order to understand such complex phenomenon a multidimensional analysis of heterogeneous environmental data might provide new insights. Notably, the Mexico government has released open data on air quality that contains the historical behavior of air pollution in Mexico City, while social networks data provides rich descriptions regarding regional social problems. In order to take into account the respective contributions of these two data sources from a spatial-temporal perspective, we introduce a multidimensional approach whose objective will be to integrate these heterogeneous data sources in an unified framework. While human perception often embedded in social media is naturally subjective, public data is rather objective and reliable, while they are described at different levels of temporal granularity and scale. Therefore, the search for a sound integration of these data sources is surely a non-straightforward issue. The research presented in this paper introduces a modelling and data mining approach to search for spatial-temporal patterns that can describe not only what happens, but also why such phenomenon happens. The whole framework is applied to the study of air pollution in Mexico City. The idea is to connect unstructured data (social data) and structured spatial data (open data) through the reconciliation of spatial-temporal correspondences between them to discover new geographic knowledge on Air Pollution phenomenon in Mexico City.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 60.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hardy, K., Maurushat, A.: Opening up government data for Big Data analysis and public benefit. Comput. Law Secur. Rev. 33(1), 30–37 (2017). https://doi.org/10.1016/j.clsr.2016.11.003. ISSN 0267-3649

    Article  Google Scholar 

  2. Chen, X., Shao, S., Tian, Z., Xie, Z., Yin, P.: Impacts of air pollution and its spatial spillover effect on public health based on China’s big data sample. J. Clean. Prod. 142, 915–925 (2017). https://doi.org/10.1016/j.jclepro.2016.02.119. ISSN 0959-6526

    Article  Google Scholar 

  3. Ang, L., Phooi, K.: Big sensor data applications in urban environments. Big Data Res. 4, 1–12 (2016). https://doi.org/10.1016/j.bdr.2015.12.003. ISSN 2214-5796

    Article  Google Scholar 

  4. Mayer-Schonberger, V., Cukier, K.: Big Data: A Revolution That Will Transform How We Live, Work, and Think. Eamon Dolan/Houghton, Mifflin Harcourt (2014)

    Google Scholar 

  5. Marien M. Global challenges for humanity (2014). http://www.millennium-project.org/millennium/challenges.html

  6. Miller, H., Han, J.: Geographic Data Mining and Knowledge Discovery. Chapman & Hall/CRC Data Mining and Knowledge Discovery Series, 2nd edn. CRC Press, Boca Raton (2007)

    Google Scholar 

  7. Chakrabarti, A.: Cross-correlation patterns in social opinion formation with sequential data. Phys. A Stat. Mech. Its Appl. 462, 442–454 (2016). ISSN 0378-4371

    Article  Google Scholar 

  8. Bakshy, E., Messing, S., Adamic, L.: Exposure to ideologically diverse news and opinion on Facebook. Science 348, 1130–1132 (2015). Sciencemag.org

    Article  MathSciNet  Google Scholar 

  9. Lee, J., Kang, M.: Geospatial big data: challenges and opportunities. Big Data Res. 2(2), 74–81 (2015). https://doi.org/10.1016/j.bdr.2015.01.003. ISSN 2214-5796

    Article  MathSciNet  Google Scholar 

  10. Blazquez, D., Domenech, J.: Big Data sources and methods for social and economic analyses. Technol. Forecast. Soc. Change (2017). https://doi.org/10.1016/j.techfore.2017.07.027. ISSN 0040-1625

  11. Zagal-Flores, R., Mata, M., Claramunt, C.: Geographical knowledge discovery applied to the social perception of pollution in the City of Mexico. In: 9th ACM SIGSPATIAL International Workshop on Location-Based Social Networks (2016). https://doi.org/10.1145/3021304.3021307

  12. Mata, F., et al.: A mobile information system based on crowd-sensed and official crime data for finding safe routes: a case study of Mexico City. Mob. Inf. Syst. 11 p. (2016). https://doi.org/10.1155/2016/806. Article ID 8068209

  13. Zagal-Flores, R., Mata-Rivera, F., Claramunt, C.: Discovering geographical patterns of crime localization in Mexico City. In: WEB 2017: The Fifth International Conference on Building and Exploring Web Based Environments (2017). ISBN 978-1-61208-557-9

    Google Scholar 

  14. Mata, F., Torres-Ruiz, M., Zagal, R.: A cross-domain framework for designing healthcare mobile applications mining social networks to generate recommendations of training and nutrition planning. Telematics Inform. (2017). https://doi.org/10.1016/j.tele.2017.04.005

    Article  Google Scholar 

  15. Yuan, M.: Use of knowledge acquisition to build wildfire representation in geographic information systems. Int. J. Geogr. Inf. Syst. 11, 723–745 (1997)

    Article  Google Scholar 

  16. Di Martino, S., Bimonte, S., Bertolotto, M., Ferrucci, F., Leano, V.: Spatial online analytical processing of geographic data through the Google earth interface. In: Murgante, B., Borruso, G., Lapucci, A. (eds.) Geocomputation, Sustainability and Environmental Planning. Studies in Computational Intelligence, vol. 348. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-19733-8_10

    Chapter  Google Scholar 

  17. Mahboubi, H., et al.: Semi-automatic design of spatial data cubes from simulation model results. Int. J. Data Warehous. Min. 9(1), 70–95 (2013). Academic OneFile. http://link.galegroup.com/apps/doc/A340297894/AONE?u=pu&sid=AONE&xid=01ca4c69

    Article  Google Scholar 

  18. Wakamiya, S., Belouaer, L., Brosset, D., Kawai, Y., Claramunt, C., Sumiya, K.: Exploring geographical crowd’s emotions with Twitter. Inf. Media Technol. (2015). https://doi.org/10.11185/imt.10.35. Online ISSN 1881-0896

  19. Bishop, C.: Pattern Recognition and Machine Learning. Springer, New York (2006). ISBN 978-0-387-31073-2

    MATH  Google Scholar 

  20. Abu-Mostafa, Y., Magdon-Ismail, M., Lin, H.: Learning from Data: A Short Course (2012). AMLBOOK.com

  21. Zhang, C., Liu, C., Zhang, X., Almpanidis, G.: An up-to-date comparison of state-of-the-art classification algorithms. Expert Syst. Appl. 82, 129–150 (2017). ISSN 0957-4174

    Google Scholar 

  22. Srivastava, A., Text, S.M.: Mining: Classification, Clustering, and Applications. CRC Press, Boca Raton (2009). 328 pages

    Book  Google Scholar 

  23. Zheng, Y., Chen, X., Jin, Q., Chen, Y., Qu, X., Liu, X., Chang, E., Ma, W., Rui, Y., Sun, W.: A cloud-based knowledge discovery system for monitoring fine-grained air quality. MSR-TR-2014-40. Microsoft Research Asia (2014)

    Google Scholar 

  24. Zheng, Y., Yi, X., Li, M., Li, R., Shan, Z., Chang, E., Li, T.: Forecasting fine-grained air quality based on Big Data. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2015), pp. 2267–2276. ACM, New York (2015). https://doi.org/10.1145/2783258.2788573

  25. Chen, X., Shao, S., Tian, Z., Xie, Z., Peng, Y.: Impacts of air pollution and its spatial spillover effect on public health based on China’s big data sample. J. Clean. Prod. 142(Part 2), 915–925 (2017). ISSN 0959-6526

    Article  Google Scholar 

Download references

Acknowledgment

The authors of this paper thank God, CONACYT project number 1051, the Laboratorio de Cómputo Móvil-UPIITA, COFAA-IPN, SIP-IPN project 20171086, and Instituto Politécnico Nacional (IPN) for their support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christophe Claramunt .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zagal-Flores, R., Felix Mata, M., Claramunt, C. (2018). From What and When Happen, to Why Happen in Air Pollution Using Open Big Data. In: R. Luaces, M., Karimipour, F. (eds) Web and Wireless Geographical Information Systems. W2GIS 2018. Lecture Notes in Computer Science(), vol 10819. Springer, Cham. https://doi.org/10.1007/978-3-319-90053-7_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-90053-7_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-90052-0

  • Online ISBN: 978-3-319-90053-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics