Skip to main content

Advertisement

Log in

Hga-lstm: LSTM architecture and hyperparameter search by hybrid GA for air pollution prediction

  • Published:
Genetic Programming and Evolvable Machines Aims and scope Submit manuscript

Abstract

Air pollution prediction is a process of predicting the levels of air pollutants in a specific area over a given period. Since LSTM (Long Short-Term Memory) networks are particularly effective in capturing long-term dependencies and patterns in sequential data, they are widely-used for air pollution prediction. However, designing appropriate LSTM architectures and hyperparameters for given tasks can be challenging, which are normally determined by users in existing LSTM-based methods. Note that Genetic Algorithm (GA) is an effective optimization technique, and local search in augmenting the global search ability of GA has been proved, which is rarely considered by existing GA-optimzied LSTM methods. In this work, simultaneous LSTM architecture and hyperparameter search based on GA and local search techniques is investigated for air pollution prediction. Specifically, a new LSTM model search method is designed, termed as HGA-LSTM. HGA is a hybrid GA, which is proposed by integrating GA with local search adaptively. Based on HGA, HGA-LSTM is developed to search for LSTM models with simultaneous LSTM architecture and hyperparameter optimization. In HGA-LSTM, a new crossover is designed to be adaptive to the variable-length representation of LSTM models. The proposed HGA-LSTM is compared with widely-used LSTM-based and nonLSTM-based prediction methods on UCI (University of California Irvine) datasets for air pollution prediction. Results show that HGA-LSTM is generally better than both types of reference methods with its evolved LSTM models achieving lower mean square/absolute errors. Moreover, compared with a baseline method (a GA without local search), HGA-LSTM converges to lower error values, which reflects that HGA has better search ability than GA.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm 1
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The data/material used or generated in this work is available from the corresponding author on reasonable request.

References

  1. H.B. Ali, A. Roman, Designing urban transit network using memetic algorithm, in 2021 IEEE Congress on Evolutionary Computation (CEC) (2021)

  2. M. Asadujjaman, H.F. Rahman, R.K. Chakrabortty, M.J. Ryan, Multi-operator immune genetic algorithm for project scheduling with discounted cash flows. Expert Syst. Appl. 195, 116589 (2022)

    Article  Google Scholar 

  3. S. Balaraman, P. Partheeban, P.N. Elamparithi, S. Manimozhi, Application of LSTM models in predicting particulate matter (pm2.5) levels for urban area. J. Eng. Res. 10(3B), 71–90 (2022)

    Google Scholar 

  4. A.H.C. Correia, D.E. Worrall, R. Bondesan, Neural simulated annealing (2022)

  5. H. Dai, G. Huang, H. Zeng, F. Yang, PM2.5 concentration prediction based on spatiotemporal feature selection using XGBoost-MSCNN-GA-LSTM. Sustainability 13(21), 12071–12094 (2021)

    Article  Google Scholar 

  6. X. Dai, J. Liu, Y. Li, A recurrent neural network using historical data to predict time series indoor PM2.5 concentrations for residential buildings. Indoor Air 31(4), 1228–1237 (2021)

    Article  Google Scholar 

  7. F. D’Angelo, GGA: A modified genetic algorithm with gradient-based local search for solving constrained optimization problems. Inform. Sci. Int. J. 547(1), 136–162 (2021)

    Article  MathSciNet  Google Scholar 

  8. D.D. Hema, K.A. Kumar, An optimized intelligent driver’s aggressive behaviour prediction model using GA-LSTM. Int. J. Perform. Eng. 17(10), 880–888 (2021)

    Article  Google Scholar 

  9. S.D. Immanuel, U.K. Chakraborty, Genetic algorithm: an approach on optimization. In 2019 International Conference on Communication and Electronics Systems (ICCES) (2019)

  10. D. Jiahui, G. Yaping, L. Jun, Z. Zhiyao, Air-quality prediction based on the ARIMA-CNN-LSTM combination model optimized by dung beetle optimizer. Sci. Rep. 13, 12127 (2023)

    Article  Google Scholar 

  11. W. Kai, H. Yu, H. Lianzhong, G. Xin, L. Xing, M. Zhongmin, M. Ranqi, J. Xiaoli, A novel GA-LSTM-based prediction method of ship energy usage based on the characteristics analysis of operational data. Energy 282, 128910 (2023)

    Article  Google Scholar 

  12. S. Kumari, N. Kumar, P.S. Rana, Big data analytics for energy consumption prediction in smart grid using genetic algorithm and long short term memory. Comput. Inform. 40(1), 29–56 (2021)

    Article  MathSciNet  Google Scholar 

  13. A. Kuri, A statistical genetic algorithm, in National Computation Meeting, pp. 1–7 (2022)

  14. D. Li, J. Liu, Y. Zhao, Prediction of multi-site PM2.5 concentrations in Beijing using CNN-Bi LSTM with CBAM. Atmosphere 13(10), 1719–1737 (2022)

    Article  Google Scholar 

  15. Z. Li, Z. Li, Z. Li, Y. Li, Application of GA-LSTM model in cable joint temperature prediction. In 2020 7th International Forum on Electrical Engineering and Automation (IFEEA) (2020)

  16. J. Liang, Y. Xue, Bloat-aware GP-based methods with bloat quantification. Appl. Intell. 52(4), 4211–4225 (2022)

    Article  Google Scholar 

  17. J. Liang, Y. Xue, J. Wang, Genetic programming based feature construction methods for foreground object segmentation. Eng. Appl. Artif. Intell. 89(Mar.), 103334.1-103334.12 (2020)

    Google Scholar 

  18. B. Lindemann, T. Müller, H. Vietz, N. Jazdi, M. Weyrich, A survey on long short-term memory networks for time series prediction. Proc. CIRP 99, 650–655 (2020)

    Article  Google Scholar 

  19. J. Liu, D. Zhou, W. Jin, Prediction of nucleosome dynamic interval based on long-short-term memory network (LSTM). J. Bioinform. Comput. Biol. 20(3), 2250009 (2022)

    Article  Google Scholar 

  20. P. Mohapatra, S. Roy, K.N. Das, S. Dutta, M.S.S. Raju, A review of evolutionary algorithms in solving large scale benchmark optimisation problems. Int. J. Math. Op. Res. 21(1), 104–126 (2022)

    Article  MathSciNet  Google Scholar 

  21. T.H.T. Nguyen, Q.B. Phan, Hourly day ahead wind speed forecasting based on a hybrid model of EEMD, CNN-Bi-LSTM embedded with GA optimization, in 2022 The 4th International Conference on Clean Energy and Electrical Systems, vol. 8, pp. 53–60 (2022)

  22. L. Qing, PM2.5 concentration prediction using GRA-GRU network in air monitoring. Sustainability 15(3), 1973–1988 (2023)

    Article  Google Scholar 

  23. W. Rui, Z. Zhihe, T. Hongfeng, P. Wojciech, S. Vladimir, Q-learning based fault estimation and fault tolerant iterative learning control for mimo systems. ISA Trans. 142, 123–135 (2023)

    Article  Google Scholar 

  24. M. Saez, M.A. Barcelo, Spatial prediction of air pollution levels using a hierarchical bayesian spatiotemporal model in Catalonia. Spain. Environ. Model. Softw. 151(May), 105369 (2022)

    Article  Google Scholar 

  25. D. Santra, A. Mukherjee, K. Sarker, S. Mondal, Hybrid genetic algorithm-gravitational search algorithm to optimize multi-scale load dispatch. Int. J. Appl. Metaheur. Comput. 12(3), 28–53 (2021)

    Article  Google Scholar 

  26. F. Shahid, A. Zameer, M. Muneeb, A novel genetic LSTM model for wind power forecast. Energy 223(1), 120069 (2021)

    Article  Google Scholar 

  27. W. Shih-Jung, H. Bo-Jhen, H. Ming-Hui, A deep learning-based air quality index prediction model using lstm and reference stations: a real application in taiwan. In Australasian Telecommunication Networks and Applications Conference (2023)

  28. P. Siarry, Handbook of memetic algorithms. Comput. Rev. 53(10), 597 (2012)

    Google Scholar 

  29. S. Tsokov, M. Lazarova, A. Aleksieva-Petrova, A hybrid spatiotemporal deep model based on CNN and LSTM for air pollution prediction. Sustainability 14(9), 5104–5141 (2022)

    Article  Google Scholar 

  30. J. Wang, Z. Wang, M. Deng, H. Zou, K. Wang, Heterogeneous spatiotemporal copula-based kriging for air pollution prediction. Trans. GIS 25(6), 3210–3232 (2021)

    Article  Google Scholar 

  31. Y. Yu, M. Zhang, Control chart recognition based on the parallel model of CNN and LSTM with ga optimization. Expert Syst. Appl. 185, 115689–1156702 (2021)

    Article  Google Scholar 

  32. M. Zhang, D. Wu, R. Xue, Hourly prediction of PM2.5 concentration in beijing based on Bi-LSTM neural network. Multim. Tools Appl. 80(16), 24455–24468 (2021)

    Article  Google Scholar 

  33. C. Zhou, H. Tao, Y. Chen, V. Stojanovic, W. Paszke, Robust point-to-point iterative learning control forconstrained systems: A minimum energy approach. Int. J. Robust Nonlinear Control 32, 10139–10161 (2022)

    Article  Google Scholar 

  34. B. Zhuohao, Residential electricity prediction based on GA-LSTM modeling. Energy Rep. 11, 6223–6232 (2024)

    Article  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This work is supported by National Natural Science Foundation of China with the grant number 61902281.

Author information

Authors and Affiliations

Authors

Contributions

Jiayu Liang is responsible for methodology, result analyses and writing. Yaxin Lu is responsible for experiments and result visualization. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jiayu Liang.

Ethics declarations

Conflict of interest

The authors declare that they have no Conflict of interest.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liang, J., Lu, Y. & Su, M. Hga-lstm: LSTM architecture and hyperparameter search by hybrid GA for air pollution prediction. Genet Program Evolvable Mach 25, 20 (2024). https://doi.org/10.1007/s10710-024-09493-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10710-024-09493-3

Keywords