Abstract
Traffic flow forecasting, which requires modelling involuted spatial and temporal dependence and uncertainty regarding road networks and traffic conditions, is a challenge for intelligent transportation systems (ITS). Recent studies have mainly focused on modelling spatial-temporal dependence through a fixed weighted graph based on prior knowledge. However, collecting up-to-date and accurate road information is costly. Moreover, is a single fixed graph enough to describe the correlation between sensors? The fixed weighted graph cannot directly relate to prediction tasks, which may result in considerable biases. To tackle this issue, in this paper, we propose a novel deep learning model framework: an adaptive spatial-temporal graph attention network (ASTGAT). Our ASTGAT simultaneously learns the dynamic graph structure and spatial-temporal dependency for traffic flow forecasting. Specifically, our framework consists of two joint training parts: a Network Generator model that generates a discrete graph with the Gumbel-Softmax technique and a Spatial-Temporal model that utilizes the generated network to predict traffic speed. Our Network Generator can adaptively infer the hidden correlations from data. Moreover, we propose a graph talking-heads attention layer (GTHA) for capturing spatial dependencies and design a gate temporal convolution (GTCN) layer for handling long temporal sequences. We evaluated our ASTGAT on two public datasets: METR-LA is collected in Los Angeles and PEMS-BAY is collected in California. Experimental results indicate that our ASTGAT outperforms the state-of-the-art (SOTA) baselines. Finally, to further describe our model, we visualize the forecasting results and the generated graph.
Similar content being viewed by others
References
Lv Y, Duan Y, Kang W, Li Z, Wang FY (2014) Traffic flow prediction with big data: a deep learning approach. IEEE Trans Intell Transp Syst 16(2):865
Wang Y, Zhang D, Liu Y, Dai B, Lee LH (2019) Enhancing transportation systems via deep learning: A survey. Transp Res C Emerg Technol 99:144. https://doi.org/10.1016/j.trc.2018.12.004
Yu H, Wu Z, Wang S, Wang Y, Ma X (2017) Spatiotemporal recurrent convolutional networks for traffic prediction in transportation networks. Sensors 17(7):1501
Tan H, Wu Y, Shen B, Jin PJ, Ran B (2016) Short-term traffic prediction based on dynamic tensor completion. IEEE Trans Intell Transp Syst 17(8):2123
Ma X, Dai Z, He Z, Ma J, Wang Y, Wang Y (2017) Learning traffic as images: a deep convolutional neural network for large-scale transportation network speed prediction. Sensors 17(4):818
Zhang J, Zheng Y, Qi D, Li R, Yi X, Li T (2018) Predicting citywide crowd flows using deep spatio-temporal residual networks. Artif Intell 259:147
Kipf TN, Welling M (2017) Semi-Supervised classification with graph convolutional networks. In: International conference on learning representations (ICLR)
Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations. https://openreview.net/forum?id=rJXMpikCZ
Seo Y, Defferrard M, Vandergheynst P, Bresson X (2018) Structured sequence modeling with graph convolutional recurrent networks. In: Cheng L, Leung ACS, Ozawa S (eds) Neural information processing. Springer International Publishing, Cham, pp 362–373
Li Y, Yu R, Shahabi C, Liu Y (2018) Diffusion convolutional recurrent neural network: data-driven traffic forecasting. https://openreview.net/forum?id=SJiHXGWAZ
Pan Z, Liang Y, Wang W, Yu Y, Zheng Y, Zhang J (2019) Urban traffic prediction from spatio-temporal data using deep meta learning. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp 1720–1730
Yu B, Yin H, Zhu Z (2018) Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting. pp 3634–3640. http://dl.acm.org/citation.cfm?id=3304222.3304273
Wu Z, Pan S, Long G, Jiang J, Zhang C (2019) Graph WaveNet for deep spatial-temporal graph modeling. In: The 28th international joint conference on artificial intelligence (IJCAI) (International Joint Conferences on Artificial Intelligence Organization
Haklay M, Weber P (2008) Openstreetmap: User-generated street maps. IEEE Pervasive Comput 7(4):12
Yu B, Li M, Zhang J, Zhu Z (2019) 3d graph convolutional networks with temporal graphs: A spatial information free framework for traffic forecasting. arXiv:1903.00919
Li M, Zhu Z (2020) Spatial-temporal fusion graph neural networks for traffic flow forecasting. arXiv:2012.09641
Geng X, Li Y, Wang L, Zhang L, Yang Q, Ye J, Liu Y (2019) Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting. In: Proceedings of the AAAI conference on artificial intelligence, vol. 33. pp 3656–3663
Chen Y, Wu L, Zaki M (2020) Iterative deep graph learning for graph neural networks: better and robust node embeddings. Adv Neural Inf Process Syst 33
Franceschi L, Niepert M, Pontil M, He X (2019) Learning discrete structures for graph neural networks. In: Chaudhuri K, Salakhutdinov R (eds) Proceedings of the 36th international conference on machine learning, proceedings of machine learning research, (PMLR). http://proceedings.mlr.press/v97/franceschi19a.html, vol 97, pp 1972–1982
Zhang Z, Zhao Y, Liu J, Wang S, Tao R, Xin R, Zhang J (2019) A general deep learning framework for network reconstruction and dynamics learning. Appl Netw Sci 4(1):1
Jang E, Gu S, Poole B. (2016) Categorical reparameterization with gumbel-softmax. arXiv:1611.01144
Shazeer N, Lan Z, Cheng Y, Ding N, Hou L (2020) Talking-heads attention. arXiv:2003.02436
Abbasi M, Shahraki A, Taherkordi A (2021) Deep learning for network traffic monitoring and analysis (ntma): a survey. Comput Commun
Lin K, Xu X, Gao H (2021) TSCRNN: A novel classification scheme of encrypted traffic based on flow spatiotemporal features for efficient management of IIoT. Comput Netw 190:107974
Gao H, Liu C, Li Y, Yang X (2020) V2VR: reliable hybrid-network-oriented V2V data transmission and routing considering RSUs and connectivity probability. IEEE Trans Intell Transp Syst
Nishi T, Otaki K, Hayakawa K, Yoshimura T (2018) Traffic signal control based on reinforcement learning with graph convolutional neural nets. In: 2018 21st International conference on intelligent transportation systems (ITSC). IEEE, pp 877–883
Zhang Y, Zhou Y, Lu H, Fujita H (2021) Spark cloud-based parallel computing for traffic network flow predictive control using non-analytical predictive model. IEEE Trans Intell Transp Syst
Bi Z, Yu L, Gao H, Zhou P, Yao H (2020) Improved VGG model-based efficient traffic sign recognition for safe driving in 5G scenarios. Int J Mach Learn Cybern :1–12
Kuang L, Hua C, Wu J, Yin Y, Gao H (2020) Traffic volume prediction based on multi-sources GPS trajectory data by temporal convolutional network. Mobile Netw Appl 25(4):1405
Nagy AM, Simon V (2018) Survey on traffic prediction in smart cities. Pervasive Mob Comput : S1574119217306,521–
Ahmed MS, Cook AR (1979) Analysis of freeway traffic time-series data by using Box-Jenkins techniques. 722
Kamarianakis Y, Prastacos P (2003) Forecasting traffic flow conditions in an urban network: Comparison of multivariate and univariate approaches. Transp Res Rec 1857(1):74
Van Der Voort M, Dougherty M, Watson S (1996) Combining Kohonen maps with ARIMA time series models to forecast traffic flow. Transp Res C Emerg Technol 4(5):307
Min W, Wynter L (2011) Real-time road traffic prediction with spatio-temporal correlations. Transp Res C Emerg Technol 19(4):606
Chien SIJ, Kuchipudi CM (2003) Dynamic travel time prediction with real-time and historic data. J Transp Eng 129(6):608
Anand A, Ramadurai G, Vanajakshi L (2014) Data fusion-based traffic density estimation and prediction. J Intell Transp Syst 18(4):367
Nikovski D, Nishiuma N, Goto Y, Kumazawa H (2005) Univariate short-term prediction of road travel times. In: Proceedings. 2005 IEEE Intelligent transportation systems, 2005. IEEE, pp 1074–1079
Li S, Shen Z, Wang FY (2012) A weighted pattern recognition algorithm for short-term traffic flow forecasting. In: Proceedings of 2012 9th IEEE international conference on networking, sensing and control. IEEE, pp 1–6
Ma X, Tao Z, Wang Y, Yu H, Wang Y (2015) Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transp Res C Emerg Technol 54:187
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555
Zhang J, Shi X, Xie J, Ma H, King I, Yeung D (2018) GaAN: Gated attention networks for learning on large and spatiotemporal graphs. arXiv:1803.07294
Guo S, Lin Y, Feng N, Song C, Wan H (2019) Multi-range attentive bicomponent graph convolutional network for traffic forecasting. In: Proceedings of the AAAI conference on artificial intelligence. https://doi.org/10.1609/aaai.v33i01.3301922, vol 33, p 922
Chen W, Chen L, Xie Y, Cao W, Gao Y, Feng X (2020) Multi-range attentive bicomponent graph convolutional network for traffic forecasting. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 3529–3536
Zheng C, Fan X, Wang C, Qi J (2020) GMAN: a graph multi-attention network for traffic prediction. In: AAAI, pp 1234– 1241
Park C, Lee C, Bahng H, Tae Y, Jin S, Kim K, Ko S, Choo J (2020) ST-GRAT: a novel spatio-temporal graph attention networks for accurately forecasting dynamically changing road speed. In: Proceedings of the 29th ACM international conference on information & knowledge management, pp 1215–1224
Gori M, Monfardini G, Scarselli F (2005) A new model for learning in graph domains. In: IEEE International joint conference on neural networks
Franco S, Marco G, Chung TAh, Markus H, Gabriele M (2009) The graph neural network model. IEEE Trans Neural Netw 20(1):61
Wu Z, Pan S, Chen F, Long G, Zhang C, Philip SY (2020) A comprehensive survey on graph neural networks. IEEE Trans Neural Netw Learn Syst
Bruna J, Zaremba W, Szlam A, LeCun Y (2013) Spectral networks and locally connected networks on graphs
Defferrard M, Bresson X, Vandergheynst P (2016) Convolutional neural networks on graphs with fast localized spectral filtering. In: Lee DD, Sugiyama M, Luxburg UV, Guyon I, Garnett R (eds) Advances in neural information processing systems, vol 29. Curran associates Inc., pp 3844–3852
Bahdanau D, Cho KH, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: 3rd international conference on learning representations, ICLR 2015
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł., Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, pp 5998–6008
Iida S, Kimura R, Cui H, Hung PH, Utsuro T, Nagata M (2019) Attention over heads: A multi-hop attention for neural machine translation. In: Proceedings of the 57th annual meeting of the association for computational linguistics: student research workshop, pp 217–222
Shuman DI, Narang SK, Frossard P, Ortega A, Vandergheynst P (2013) The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Process Mag 30(3):83
Grover A, Leskovec J (2016) node2vec: Scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining, pp 855–864
Nadarajah S, Kotz S (2004) The beta Gumbel distribution. Math Probl Eng 2004(4):323
Kong X, Xing W, Wei X, Bao P, Zhang J, Lu W (2020) STGAT: spatial-temporal graph attention networks for traffic flow forecasting. IEEE Access 8:134363
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Jagadish HV, Gehrke J, Labrinidis A, Papakonstantinou Y, Patel JM, Ramakrishnan R, Shahabi C (2014) Big data and its technical challenges. Commun Acm 57(7):86
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. Adv Neural Inf Process Syst :3104–3112
Oreshkin BN, Amini A, Coyle L, Coates MJ (2021) FC-GAGA: Fully connected gated graph architecture for spatio-temporal traffic forecasting. In: AAAI
Kingma DP, Ba J (2015) Adam: A Method for Stochastic Optimization. In: ICLR (Poster)
Acknowledgements
The authors would like to thank the National Natural Science Foundation of China (61876017, 61876018, 61906014) for their support in this research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kong, X., Zhang, J., Wei, X. et al. Adaptive spatial-temporal graph attention networks for traffic flow forecasting. Appl Intell 52, 4300–4316 (2022). https://doi.org/10.1007/s10489-021-02648-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02648-0