Skip to main content

Advertisement

Log in

Multi-city traffic flow forecasting via multi-task learning

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Traffic flow forecasting or prediction plays an important role in the traffic control and management of a city. Existing works mostly train a model using the traffic flow data of a city and then test the trained model using the data of the same city. It may not be truly intelligent as there are many cities around us and there should be some shared knowledge among different cities. The data of a city and its knowledge can be used to help improve the traffic flow forecasting of other cities. To address this motivation, we study building a universal deep learning model for multi-city traffic flow forecasting. In this paper, we exploit spatial-temporal correlations among different cities with multi-task learning to approach the traffic flow forecasting tasks of multiple cities. As a result, we propose a M ulti-city T raffic flow forecasting N etwork (MTN) via multi-task learning to extract the spatial dependency and temporal regularity among multiple cities later used to improve the performance of each individual city traffic flow forecasting collaboratively. In brief, the proposed model is a quartet of methods: (1) It integrates three temporal intervals and formulates a multi-interval component for each city to extract temporal features of each city; (2) A spatial-temporal attention layer with 3D Convolutional kernels is plugged into the neural networks to learn spatial-temporal relationship; (3) As traffic peak distributions of different cities are often similar, it proposes to use a peak zoom network to learn the peak effect of multiple cities and enhance the prediction performance on important time steps in different cities; (4) It uses a fusion layer to merge the outputs from distinct temporal intervals for the final forecasting results. Experimental results using real-world datasets from DIDI show the superior performance of the proposed model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Notes

  1. https://gaia.didichuxing.com

References

  1. Abadi A, Rajabioun T, Ioannou P (2015) Traffic flow prediction for road transportation networks with limited traffic data. IEEE Trans Intell Transp Syst 16(2):653–662

    Google Scholar 

  2. Anwar T, Liu C, Vu H, Islam MS, Sellis T (2018) Capturing the spatiotemporal evolution in road traffic networks. IEEE Trans Knowl Data Eng 30(8):1426–1439

    Article  Google Scholar 

  3. Bai L, Yao L, Kanhere S, Wang X, Sheng Q (2019a) Cross-city transfer learning for deep spatio-temporal prediction. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 1893–1899

  4. Bai L, Yao L, Kanhere S, Wang X, Sheng Q (2019b) Stg2seq: Spatial-temporal graph to sequence model for multi-step passenger demand forecasting. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 1981–1987

  5. Bian J, Tian D, Tang Y, Tao D (2019) Trajectory data classification: A review. ACM Trans Intell Syst Technol 10(4)

  6. Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder−decoder for statistical machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP-14), pp 1724–1734

  7. Chopra S, Hadsell R, LeCun Y (2005) Learning a similarity metric discriminatively with application to face verification. In: Proceedings of the 2005 IEEE conference on computer vision and pattern recognition (CVPR-05)

  8. Eigen D, Fergus R (2015) Predicting depth, surface normals and semantics labels with a common multi-scale convolutional architecture. In: Proceedings of the 2015 IEEE international conference on computer vision (ICCV-15), pp 2650–2658

  9. Fang S, Zhang Q, Meng G, Xiang S, Pan C (2019) Gstnet: Global spatial-temporal network for traffic flow prediction. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 2286–2293

  10. Gao R, Guo X, Sun F, Dai L, Zhu J, Hu C, Li H (1689) Aggressive driving saves more time? multi-task learning for customized travel time estimation. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19)

  11. Gao Y, Ma J, Liu W, Alan L, Yuille MZ (2019b) Nddr-cnn: Layer-wise feature fusing in multi-task cnn. In: Proceedings of the 2019 IEEE conference on computer vision and pattern recognition (CVPR-19)

  12. Gong Y, Li Z, Zhang J, Liu W, Zheng Y, Kirsch C (2018) Network-wide crowd flow prediction of sydney trains via customized online non-negative matrix factorization. In: Proceedings of the 27th ACM international conference on information and knowledge management (CIKM-18), pp 1243–1252

  13. Hoang MX, Zheng Y, Singh AK (2016) Fccf: Forecasting citywide crowd flows based on big data. In: Proceedings of the 24th ACM SIGSPATIAL international conference on advances in geographic information systems

  14. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  15. Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd international conference on machine learning (ICML-15), pp 448–456

  16. Kingma DP, Ba J (2015) Adam: A method for stochastic optimization. In: Proceedings of the 3rd international conference on learning representations (ICLR-15)

  17. LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444

    Article  Google Scholar 

  18. Li Y, Zheng Y, Zhang H, Chen L (2015) Traffic prediction in a bike-sharing system. In: Proceedings of the 23rd ACM SIGSPATIAL international conference on advances in geographic information systems

  19. Li Y, Yu R, Shahabi C, Liu Y (2018) Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In: Proceedings of the 6th international conference on learning representations (ICLR-18). https://openreview.net/forum?id=SJiHXGWAZ

  20. Liao Q, Ding Y, Jiang ZL, Wang X, Zhang C, Zhang Q (2019) Multi-task deep convolutional neural network for cancer diagnosis. Neurocomputing 348:66–73

    Article  Google Scholar 

  21. Ling H, Wang Z, Li P, Shi Y, Chen J, Zou F (2019) Improving person re-identification by multi-task learning. Neurocomputing 347:109–118

    Article  Google Scholar 

  22. Misra I, Shrivastava A, Gupta A, Hebert M (2016) Cross-stitch networks for multi-task learning. In: Proceedings of the 2016 IEEE conference on computer vision and pattern recognition (CVPR-16), pp 3994–4003

  23. Pan Z, Liang Y, Wang W, Yu Y, Zhang J, Zheng Y (2019) Urban traffic prediction from spatio-temporal data using deep meta learning. In: Proceedings of the 25th SIGKDD conference on knowledge discovery and data mining (KDD-19), pp 1720–1730

  24. Ranjan R, Patel VM, Chellappa R (2019) Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans Pattern Anal Mach Intell 41(1):121–135

    Article  Google Scholar 

  25. Savchuk OP (2010) Large-scale dynamics of hypoxia in the baltic sea. In: Chemical structure of pelagic redox interfaces: observation and modeling, pp 137–160

  26. Shi X, Chen Z, Wang H, Yeung DY, Wong WK, WOO W-C (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Proceedings of the 28th international conference on neural information processing systems (NIPS-15), pp 802–810

  27. Silva R, Kang SM, Airoldi EM (2015) Predicting traffic volumes and estimating the effects of shocks in massive transportation systems. In: Proceedings of the national academy of sciences, pp 5643–5648

  28. Tian Y, Zhang K, Li J, Lin X, Yang B (2018) Multi-task deep convolutional neural network for cancer diagnosis. Neurocomputing 318:297–305

    Article  Google Scholar 

  29. Wang H, Li Z, Kuo YH, Kifer D (2019a) A simple baseline for travel time estimation using large-scale trip data. ACM Trans Intell Syst Technol 10(2)

  30. Wang Y, Zhang Y, Piao X, Liu H, Zhang K (2019b) Traffic data reconstruction via adaptive spatial-temporal correlations. IEEE Trans Intell Transp Syst 20(4):1531–1543

    Article  Google Scholar 

  31. Wu F, Wang H, Li Z (2016) Interpreting traffic dynamics using ubiquitous urban data. In: Proceedings of the 24th ACM SIGSPATIAL international conference on advances in geographic information systems

  32. Wu R, Luo G, Shao J, Tian L, Peng C (2018) Location prediction on trajectory data: A review. Big Data Min Anal 1(2):108–127

    Article  Google Scholar 

  33. Wu Z, Pan S, Long G, Jiang J, Zhang C (2019) Graph wavenet for deep spatial-temporal graph modeling. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 1907–1913

  34. Xiao Q, Dai J, Luo J, Fujita H (2019) Multi-view manifold regularized learning-based method for prioritizing candidate disease mirnas. Knowl.-Based Syst 175:118–129

    Article  Google Scholar 

  35. Xu Y, Kong Q, Klette R, Liu Y (2014) Accurate and interpretable bayesian mars for traffic flow prediction. IEEE Trans Intell Transp Syst 15(6):2457–2469

    Article  Google Scholar 

  36. Yang P, Tan Q, Tong H, He J (2019a) Task-adversarial co-generative nets. In: Proceedings of the 25th SIGKDD conference on knowledge discovery and data mining (KDD-19), pp 1596–1604

  37. Yang P, Tan Q, Ye J, Tong H, He J (2019b) Deep multi-task learning with adversarial-and-cooperativenets. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence (IJCAI-19), pp 4078–4084

  38. Yao H, Wu F, Ke J, Tang X, Jia Y, Lu S, Gong P, Ye J (2018) Deep multi-view spatial-temporal network for taxi demand prediction. In: Proceedings of the 32nd AAAI conference on artificial intelligence (AAAI-18), pp 2588–2595

  39. Yao H, Tang X, Wei H, Zheng G, Li Z (2019) Revisiting spatial-temporal similarity: a deep learning framework for traffic prediction. In: Proceedings of the 33rd AAAI conference on artificial intelligence (AAAI-19)

  40. Yim J, Jung H, Yoo B, Choi C, Park D, Kim J (2015) Rotating your face using multi-task deep neural network. In: Proceedings of the 2015 IEEE conference on computer vision and pattern recognition (CVPR-15), pp 676–684

  41. Zhang H, Goodfellow I, Metaxas D, Odena A (2018a) Self-attention generative adversarial networks. arXiv:180508318

  42. Zhang H, Zheng Y, Yu Y (2018b) Detectingurban anomalies using multiple spatio-temporal data sources. In: Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies, vol 2

  43. Zhang J, Zheng Y, Qi D, Li R, Yi X (2016) Dnn-based prediction model for spatio-temporal data. In: Proceedings of the 24th ACM SIGSPATIAL international conference on advances in geographic information systems

  44. Zhang J, Zheng Y, Qi D, Li R, Yi X, Li T (2018c) Predicting citywide crowd flows using deep spatio-temporal residual networks. Artif Intell 259:147–166

    Article  MathSciNet  Google Scholar 

  45. Zhang J, Zheng Y, Sun J, Qi D (2019a) Flow prediction in spatio-temporal networks based on multitask deep learning. IEEE Trans Knowl Data Eng

  46. Zhang Y, Yang Q (2017) A survey on multi-task learning. arXiv:170708114

  47. Zhang Y, Yang Y, Li T, Fujita H (2019b) A multitask multiview clustering algorithm in heterogeneous situations based on lle and le. Knowl-Based Syst 163:776–786

    Article  Google Scholar 

  48. Zhao L, Sun Q, Ye J, Chen F, Lu CT, Ramakrishnan N (2017) Feature constrained multi-task learning models for spatiotemporal event forecasting. IEEE Trans Knowl Data Eng 29(5):1059–1072

    Article  Google Scholar 

  49. Zhou Y, Tao X, Yu Z, Fujita H (2019) Train-movement situation recognition for safety justification using moving-horizon tbm-based multisensor data fusion. Knowl-Based Syst 177:117– 126

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Nos. 61976247 and 61572407) and the National Key Technology Research and Development Program (No. 2015BAH19F02). The datasets used in the experiments are from DIDI Chuxing GAIA Initiative. Hao Wang would like to thank a grant from Project Funded by China Postdoctoral Science Foundation (No. 2020M681960).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yan Yang.

Ethics declarations

Conflict of interests

We declare that all authors certify that they have NO affiliations with or involvement in any organization or entity with any financial interest (such as honoraria; educational grants; participation in speakers’ bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licensing arrangements), or non-financial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Y., Yang, Y., Zhou, W. et al. Multi-city traffic flow forecasting via multi-task learning. Appl Intell 51, 6895–6913 (2021). https://doi.org/10.1007/s10489-020-02074-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-020-02074-8

Keywords

Navigation