skip to main content
10.1145/3589132.3625575acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
short-paper

Unleashing Realistic Air Quality Forecasting: Introducing the Ready-to-Use PurpleAirSF Dataset

Published: 22 December 2023 Publication History

Abstract

Air quality forecasting has garnered significant attention recently, with data-driven models taking center stage due to advancements in machine learning and deep learning models. However, researchers face challenges with complex data acquisition and the lack of open-sourced datasets, hindering efficient model validation. This paper introduces PurpleAirSF, a comprehensive and easily accessible dataset collected from the PurpleAir network. With its high temporal resolution, various air quality measures, and diverse geographical coverage, this dataset serves as a useful tool for researchers aiming to develop novel forecasting models, study air pollution patterns, and investigate their impacts on health and the environment. We present a detailed account of the data collection and processing methods employed to build PurpleAirSF. Furthermore, we conduct preliminary experiments using both classic and modern spatio-temporal forecasting models, thereby establishing a benchmark for future air quality forecasting tasks.

References

[1]
AQI & Health | AirNow.gov. 2023. https://www.airnow.gov/aqi-and-health/
[2]
Ane Blázquez-García, Angel Conde, Usue Mori, and Jose A Lozano. 2021. A review on outlier/anomaly detection in time series data. ACM Computing Surveys (CSUR) 54, 3 (2021), 1--33.
[3]
Song Chen. 2019. Beijing Multi-Site Air-Quality Data. UCI Machine Learning Repository.
[4]
EPA. 2018. World's Air Pollution: Real-time Air Quality Index. https://waqi.info/{#}/c/6.769/8.931/2.1z{%}0Ahttps://waqi.info/
[5]
Jindong Han, Hao Liu, Haoyi Xiong, and Jing Yang. 2022. Semi-Supervised Air Quality Forecasting via Self-Supervised Hierarchical Graph Neural Network. IEEE Transactions on Knowledge and Data Engineering (may 2022).
[6]
Yuxuan Liang, Yutong Xia, Songyu Ke, Yiwei Wang, Qingsong Wen, Junbo Zhang, Yu Zheng, and Roger Zimmermann. 2022. AirFormer: Predicting Nationwide Air Quality in China with Transformers. (2022). arXiv:2211.15979
[7]
Haitao Lin, Zhangyang Gao, Yongjie Xu, Lirong Wu, Ling Li, and Stan Z Li. 2022. Conditional local convolution for spatio-temporal meteorological forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 7470--7478.
[8]
Manuel Méndez, Mercedes G. Merayo, and Manuel Núñez. 2023. Machine learning algorithms to forecast air quality: a survey. Artif. Intell. Rev. (feb 2023), 1--36.
[9]
OpenAQ. 2023. https://openaq.org/
[10]
PurpleAir | Real-time Air Quality Monitoring. 2022. https://www2.purpleair.com/
[11]
Saverio Vito. 2016. Air Quality. UCI Machine Learning Repository.
[12]
Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019. Graph wavenet for deep spatial-temporal graph modeling. In IJCAI. 1907--1913.
[13]
Jinsung Yoon, Daniel Jarrett, and Mihaela Van der Schaar. 2019. Time-series generative adversarial networks. NeurIPS 32 (2019).
[14]
Yu Zheng, Furui Liu, and Hsun-Ping Hsieh. 2013. U-air: When urban air quality inference meets big data. In KDD. 1436--1444.
[15]
Jingwei Zuo, Karine Zeitouni, and Yehia Taher. 2021. Smate: Semi-supervised spatio-temporal representation learning on multivariate time series. In 2021 IEEE International Conference on Data Mining (ICDM). IEEE, 1565--1570.
[16]
Jingwei Zuo, Karine Zeitouni, Yehia Taher, and Sandra Garcia-Rodriguez. 2023. Graph convolutional networks for traffic forecasting with missing values. Data Mining and Knowledge Discovery 37, 2 (2023), 913--947.

Cited By

View all

Index Terms

  1. Unleashing Realistic Air Quality Forecasting: Introducing the Ready-to-Use PurpleAirSF Dataset

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          SIGSPATIAL '23: Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems
          November 2023
          686 pages
          ISBN:9798400701689
          DOI:10.1145/3589132
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 22 December 2023

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. air quality monitoring
          2. air quality forecasting
          3. air quality dataset
          4. spatio-temporal forecasting
          5. spatio-temporal data

          Qualifiers

          • Short-paper

          Conference

          SIGSPATIAL '23
          Sponsor:

          Acceptance Rates

          Overall Acceptance Rate 257 of 1,238 submissions, 21%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 80
            Total Downloads
          • Downloads (Last 12 months)57
          • Downloads (Last 6 weeks)2
          Reflects downloads up to 01 Mar 2025

          Other Metrics

          Citations

          Cited By

          View all

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media