Skip to main content

Advertisement

Log in

A new method for prediction of air pollution based on intelligent computation

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

The detection and treatment of increasing air pollution due to technological developments represent some of the most important challenges facing the world today. Indeed, there has been a significant increase in levels of environmental pollution in recent years. The aim of the work presented herein is to design an intelligent predictor for the concentrations of air pollutants over the next 2 days based on deep learning techniques using a recurrent neural network (RNN). The best structure for its operation is then determined using a particle swarm optimization (PSO) algorithm. The new predictor based on intelligent computation relying on unsupervised learning, i.e., long short-term memory (LSTM) and optimization (i.e., PSO), is called the smart air quality prediction model (SAQPM). The main goal is to predict six the concentrations of six types of air pollution, viz. PM2.5 particulate matter, PM10, particulate matter, nitrogen dioxide (NO2), carbon monoxide (CO), ozone (O3), and sulfur dioxide (SO2). SAQPM consists of four stages. The first stage involves data collection from multiple stations (35 in this case). The second stage involves preprocessing of the data, including (a) separation of each station with an independent focus, (b) handle missing values, and (c) normalization of the dataset to the range of (0, 1) using the MinMaxScalar method. The third stage relates to building the predictor based on the LSTM method by identifying the best structure and parameter values (weight, bias, number of hidden layers, number of nodes in each hidden layer, and activation function) for the network using the functional PSO algorithm to achieve a goal. Thereafter, the dataset is split into training and testing parts based on the ten cross-validation principle. The training dataset is then used to build the predictor. In the fourth stage, evaluation results for each station are obtained by reading the concentration of each pollutant each hour for at most 30 days then taking the average of the symmetric mean absolute percentage error (SMAPE) for 25 days only.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Samaher Al-Janabi.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Communicated by V. Loia.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

Item

Description

DLSTM

Developed long short-term memory

LSTM

Long short-term memory

PSO

Particle swarm optimization

SMAPE

Symmetric mean absolute percentage error

PM2.5

Particulate matter with diameter less than 2.5 μm

PM10

Particulate matter with diameter less than 10 μm

O3

Ozone, the unstable triatomic form of oxygen

SOx

Sulfur oxides

CO

Carbon monoxide

NOx

Nitrogen oxides

\( \odot \)

Elementwise or Hadamard product

Outer product

\( \sigma \)

Sigmoid function

at

Input activation

it

Input gate

ft

Forget gate

ot

Output gate

Statet

Internal state

Outt

Output

W

The weights of the input

U

The weights of recurrent connections

\( V_{i}^{t} \):

Velocity of particle i in swarm in dimension j and frequency t

\( X_{i}^{t} \)

Location of particle i in swarm in dimension j and frequency t

\( c_{1} \)

Acceleration factor related to Pbest

\( c_{2} \)

Acceleration factor related to gbest

\( r_{1}^{t} \), \( r_{2}^{t} \):

Random number between 0 and 1

t

Number of occurrences specified by type of problem

\( G_{{{\text{best}},i}}^{t} \)

gbest position of swarm

\( P_{{{\text{best}},i}}^{t} \)

pbest position of particle

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Al-Janabi, S., Mohammad, M. & Al-Sultan, A. A new method for prediction of air pollution based on intelligent computation. Soft Comput 24, 661–680 (2020). https://doi.org/10.1007/s00500-019-04495-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-019-04495-1

Keywords

Navigation