Skip to main content
Log in

Attention based convolutional networks for traffic flow prediction

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Real-time and accurate prediction of traffic flow plays an important role in intelligent transportation systems. However, short-term traffic flow forecasting is extremely challenging due to the highly nonlinear nature of the traffic system and the dynamic spatial and temporal correlation. Although various methods, including deep learning based ones, have been proposed, most of them still suffer from problems such as spatial nonstationarity and thus cannot achieve good prediction performance. Inspired by the recent superior performance of attention mechanism, we introduce it into the model for traffic flow prediction with regular grided input. To be specific, we propose a novel deep learning framework, Spatial-Temporal Attention Based Convolutional Networks (STAtt-Net), for accurate forecasting of citywide traffic flow. First, we model the traffic data as a two-dimensional matrix with two channels. Each cell in the matrix represents the traffic in the corresponding region. Taking into account the temporal correlation and dependence of traffic system, the periodic patterns contained in traffic data are modeled by three major components for weekly trend, daily periodicity, and hourly closeness respectively. Then, STAtt-Net employs a STBlock as the basis unit to learn temporal dependence and spatial dependence of traffic flow, taking advantage of attention mechanism. We conduct extensive experiments to evaluate the performance of our model on three real-world datasets (TaxiBJ, BikeNYC, TaxiSZ), with the results revealing better prediction accuracy and efficiency of the proposed model against existing ones.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Atrish A, Singh N, Kumar K, Kumar V (2017) An automated hierarchical framework for player recognition in sports image. In: Proceedings of the international conference on video and image processing, pp 103–108

  2. Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  Google Scholar 

  3. Chang H, Lee Y, Yoon B, Baek S (2012) Dynamic near-term traffic flow prediction: system-oriented approach based on past experiences. IET Intell Transp Syst 6(3):292–305

    Article  Google Scholar 

  4. Cheng S, Lu F, Peng P (2020) Short-term traffic forecasting by mining the non-stationarity of spatiotemporal patterns. IEEE Trans Intell Transp Syst 22(10):6365–6383

    Article  Google Scholar 

  5. Choi H, Cho K, Bengio Y (2018) Fine-grained attention mechanism for neural machine translation. Neurocomputing 284:171–176

    Article  Google Scholar 

  6. Deng D, Shahabi C, Demiryurek U, Zhu L, Yu R, Liu Y (2016) Latent space model for road networks to predict time-varying traffic. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1525–1534

  7. Ding Q Y, Wang X F, Zhang X Y, Sun Z Q (2011) Forecasting traffic volume with space-time arima model. In: Advanced materials research, vol 156. Trans Tech Publications, pp 979–983

  8. Fu R, Zhang Z, Li L (2016) Using lstm and gru neural network methods for traffic flow prediction. In: 2016 31st Youth academic annual conference of Chinese association of automation (YAC). IEEE, pp 324–328

  9. Guo S, Lin Y, Li S, Chen Z, Wan H (2019) Deep spatial–temporal 3d convolutional neural networks for traffic data forecasting. IEEE Trans Intell Transp Syst 20(10):3913–3926

    Article  Google Scholar 

  10. Harvey A C (1990) Forecasting structural time series models and the kalman filter

  11. He Z, Chow C -Y, Zhang J -D (2019) Stcnn: a spatio-temporal convolutional neural network for long-term traffic prediction. In: 2019 20th IEEE international conference on mobile data management (MDM). IEEE, pp 226–233

  12. Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141

  13. Huang W, Song G, Hong H, Xie K (2014) Deep architecture for traffic flow prediction: deep belief networks with multitask learning. IEEE Trans Intell Transp Syst 15(5):2191–2201

    Article  Google Scholar 

  14. Hyndman R J, Koehler A B (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688

    Article  Google Scholar 

  15. Ke R, Li W, Cui Z, Wang Y (2020) Two-stream multi-channel convolutional neural network for multi-lane traffic speed prediction considering traffic volume impact. Transp Res Rec 2674(4):459–470

    Article  Google Scholar 

  16. Kumar K (2021) Text query based summarized event searching interface system using deep learning over cloud. Multimed Tools Appl 80(7):11079–11094

    Article  Google Scholar 

  17. Kumar K, Kumar A, Bahuguna A (2017) D-cad: deep and crowded anomaly detection. In: Proceedings of the 7th international conference on computer and communication technology, pp 100–105

  18. Kumar K, Shrimankar D D, Singh N (2018) Somes: an efficient som technique for event summarization in multi-view surveillance videos 383–389

  19. Kumar A, Purohit K, Kumar K (2019) Stock price prediction using recurrent neural network and long short-term memory. In: International conference on deep learning, artificial intelligence and robotics. Springer, pp 153–160

  20. Lim B, Arık SÖ, Loeff N, Pfister T (2021) Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int J Forecast 37 (4):1748–1764

    Article  Google Scholar 

  21. Liu K, Gao S, Qiu P, Liu X, Yan B, Lu F (2017) Road2vec: measuring traffic interactions in urban road system from massive travel routes. ISPRS Int J Geo-Inf 6(11):321

    Article  Google Scholar 

  22. Lu F, Liu K, Duan Y, Cheng S, Du F (2018) Modeling the heterogeneous traffic correlations in urban road systems using traffic-enhanced community detection approach. Physica A: Stat Mech Appl 501:227–237

    Article  Google Scholar 

  23. Luo Q, Zhou Y (2021) Spatial-temporal structures of deep learning models for traffic flow forecasting: a survey. In: 2021 4th International conference on intelligent autonomous systems (ICoIAS). IEEE, pp 187–193

  24. Luong M -T, Pham H, Manning C D (2015) Effective approaches to attention-based neural machine translation. arXiv:1508.04025

  25. Ma X, Tao Z, Wang Y, Yu H, Wang Y (2015) Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transp Res Part C: Emerg Technol 54:187–197

    Article  Google Scholar 

  26. Martínez LM, Viegas JM, Silva EA (2009) A traffic analysis zone definition: a new methodology and algorithm. Transportation 36(5):581–599

    Article  Google Scholar 

  27. Mnih V, Heess N, Graves A (2014) Recurrent models of visual attention. In: Advances in neural information processing systems, pp 2204–2212

  28. Moorthy C, Ratcliffe B (1988) Short term traffic forecasting using time series methods. Transp Plan Technol 12(1):45–56

    Article  Google Scholar 

  29. Negi A, Kumar K (2021) Face mask detection in real-time video stream using deep learning. In: Computational intelligence and healthcare informatics, pp 255–268

  30. Negi A, Kumar K (2022) Chapter 1—ai-based implementation of decisive technology for prevention and fight with covid-19 1–14

  31. Negi A, Kumar K, Chaudhari N S, Singh N, Chauhan P (2021) Predictive analytics for recognizing human activities using residual network and fine-tuning. In: International conference on big data analytics. Springer, pp 296–310

  32. Okutani I, Stephanedes Y J (1984) Dynamic prediction of traffic volume through kalman filtering theory. Transp Res Part B: Methodol 18(1):1–11

    Article  Google Scholar 

  33. Openshaw S (1984) The modifiable areal unit problem. Geo Books, Norwick

    Google Scholar 

  34. Sharma S, Kumar K (2021) Asl-3dcnn: American sign language recognition technique using 3-d convolutional neural networks. Multimed Tools Appl 80(17):26319–26331

    Article  Google Scholar 

  35. Shi X, Chen Z, Wang H, Yeung D -Y, Wong W -K, Woo W-C (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. arXiv:1506.04214

  36. Smola A J, Schölkopf B (2004) A tutorial on support vector regression. Stat Comput 14(3):199–222

    Article  MathSciNet  Google Scholar 

  37. Solanki A, Bamrara R, Kumar K, Singh N (2020) VEDL: a novel video event searching technique using deep learning. In: Soft computing: theories and applications, pp 905–914

  38. Srinivasu P N, Balas V E (2021) Self-learning network-based segmentation for real-time brain mr images through haris. PeerJ Comput Sci 7:654

    Article  Google Scholar 

  39. Stathopoulos A, Karlaftis M (2001) Temporal and spatial variations of real-time traffic data in urban areas. Transp Res Rec 1768(1):135–140

    Article  Google Scholar 

  40. Tang C, Zhu X, Liu X, Wang L, Zomaya A (2019) Defusionnet: defocus blur detection via recurrently fusing and refining multi-scale deep features. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2700–2709

  41. Tang C, Liu X, An S, Wang P (2021) Br2net: defocus blur detection via a bidirectional channel attention residual refining network. IEEE Trans Multimed 23:624–635. https://doi.org/10.1109/TMM.2020.2985541

    Article  Google Scholar 

  42. Tobler W R (1970) A computer movie simulating urban growth in the detroit region. Econ Geogr 46(2)

  43. Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803

  44. Williams B M, Hoel L A (2003) Modeling and forecasting vehicular traffic flow as a seasonal arima process: theoretical basis and empirical results. J Transp Eng 129(6):664–672

    Article  Google Scholar 

  45. Williams B M, Durvasula P K, Brown D E (1998) Urban freeway traffic flow prediction: application of seasonal autoregressive integrated moving average and exponential smoothing models. Transp Res Rec 1644(1):132–141

    Article  Google Scholar 

  46. Xingjian S, Chen Z, Wang H, Yeung D -Y, Wong W -K, Woo W-C (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. In: Advances in neural information processing systems, pp 802–810

  47. Yan Q, Wang B, Zhang W, Luo C, Xu W, Xu Z, Zhang Y, Shi Q, Zhang L, You Z (2020) Attention-guided deep neural network with multi-scale feature fusion for liver vessel segmentation. IEEE J Biomed Health Inform 25(7):2629–2642

    Article  Google Scholar 

  48. Yan Q, Wang B, Zhang W, Luo C, Xu W, Xu Z, Zhang Y, Shi Q, Zhang L, You Z (2021) Attention-guided deep neural network with multi-scale feature fusion for liver vessel segmentation. IEEE J Biomed Health Inform 25(7):2629–2642. https://doi.org/10.1109/JBHI.2020.3042069

    Article  Google Scholar 

  49. Yao Z-S, Shao C-F, Gao Y-L (2006) Research on methods of short-term traffic forecasting based on support vector regression [j]. J Beijing Jiaotong Univ 30(3):19–22

    Google Scholar 

  50. Yao H, Tang X, Wei H, Zheng G, Li Z (2019) Revisiting spatial-temporal similarity: a deep learning framework for traffic prediction. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 5668–5675

  51. Yue Y (2006) Spatial-temporal dependency of traffic flow and its implications for short-term traffic forecasting. HKU Theses Online (HKUTO)

  52. Zeng W, Lin C, Lin J, Jiang J, Xia J, Turkay C, Chen W (2020) Revisiting the modifiable areal unit problem in deep traffic prediction with visual analytics. IEEE Trans Vis Comput Graph 27(2):839–848

    Article  Google Scholar 

  53. Zeng W, Lin C, Liu K, Lin J, Tung A K (2021) Modeling spatial nonstationarity via deformable convolutions for deep traffic flow prediction. IEEE Trans Knowl Data Eng

  54. Zeng W, Lin C, Lin J, Jiang J, Xia J, Turkay C, Chen W (2021) Revisiting the modifiable areal unit problem in deep traffic prediction with visual analytics. IEEE Trans Visual Comput Graph 27(2):839–848. https://doi.org/10.1109/TVCG.2020.3030410

    Article  Google Scholar 

  55. Zhao L, Song Y, Zhang C, Liu Y, Wang P, Lin T, Deng M, Li H (2019) T-gcn: a temporal graph convolutional network for traffic prediction. IEEE Trans Intell Transp Syst 21(9):3848–3858

    Article  Google Scholar 

  56. Zhang X-L, He G-G, Lu H-P (2009) Short-term traffic flow forecasting based on k-nearest neighbors non-parametric regression. J Syst Eng 24(2):178–183

    Google Scholar 

  57. Zhang J, Zheng Y, Qi D (2017) Deep spatio-temporal residual networks for citywide crowd flows prediction. In: Proceedings of the AAAI conference on artificial intelligence, vol 31

  58. Zheng C, Fan X, Wang C, Qi J (2020) Gman: a graph multi-attention network for traffic prediction. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 1234–1241

Download references

Funding

This work is supported by National Natural Science Foundation of China (62077039) and Research Project (PZ2020016).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qi Ye.

Ethics declarations

Conflict of interest

There are no other competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lin, J., Lin, C. & Ye, Q. Attention based convolutional networks for traffic flow prediction. Multimed Tools Appl 83, 7379–7394 (2024). https://doi.org/10.1007/s11042-023-15395-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-15395-w

Keywords

Navigation