skip to main content
10.1145/3360774.3360807acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmobiquitousConference Proceedingsconference-collections
research-article

A deep spatio-temporal attention-based neural network for passenger flow prediction

Published: 03 February 2020 Publication History

Abstract

Predicting the passenger flows in a city, especially in a metropolis, can guide traffic dispersion, and help assessing the risks of public safety and improving urban planning. However, it is challenging as passenger flows in a road network may vary with time and space, affected by weather conditions, urban activities, etc. In the paper, we propose a passenger flow prediction approach named Yildun, which constructs an encoder-decoder neural network and captures the spatial and temporal correlations inherent in passenger flows. More specifically, to predict the passenger flows at each and every station, a spatial attention mechanism is presented to adaptively extract inter-station correlations of flows by referring to the previous hidden state of the encoder at each time step. Meanwhile, a temporal attention mechanism is employed to capture time-dependent connections of flows by selecting relevant hidden states of the encoder across all time steps. Further, extra factors, such as POI (Point of Interest) data and day of the week, are fused in the decoder. With this spatio-temporal attention scheme, Yildun not only can make predictions effectively, but also is easily explainable. Extensive experiments are conducted on large-scale real-world data. The experimental results show that Yildun can predict passenger flows with small prediction errors and outperforms five baselines significantly.

References

[1]
Ni M, He Q, Gao J. 2017. Forecasting the subway passenger flow under event occurrences with social media. IEEE Transactions on Intelligent Transportation Systems, 18(6), 1623--1632.
[2]
Zheng Y, Capra L, Wolfson O, et al. 2014. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology, 5(3), 38.
[3]
Pan B, Demiryurek U, Shahabi C. 2012. Utilizing real-world transportation data for accurate traffic prediction. In proceedings of the 12th IEEE International Conference on Data Mining (ICDM'12), 595--604.
[4]
Zhou J, Tung A K H. 2015. Smiler: A semi-lazy time series prediction system for sensors. In Proceedings of the 2015 ACM International Conference on Management of Data (SIGMOD'15). ACM, 1871--1886.
[5]
Hu J, Gao P, Yao Y, et al. 2014. Traffic flow forecasting with particle swarm optimization and support vector regression. In Proceedings of the IEEE 17th International Conference on Intelligent Transportation Systems (ITSC'14), 2267--2268.
[6]
Yang B, Guo C, Jensen C S. 2013. Travel cost inference from sparse, spatio temporally correlated time series using Markov models. Proceedings of the VLDB Endowment, 6(9), 769--780.
[7]
Qin Y, Song D, Chen H, et al. 2017. A dual-stage attention-based recurrent neural network for time series prediction. arXiv preprint arXiv, 1704.02971.
[8]
E. Vlahogianni, M. Karlaftis, and J. Golias. 2014. Short-term traffic forecasting: Where we are and where we're going. Transportation Research Part C: Emerging Technologies, 43, 3--19.
[9]
Kamarianakis Y, Prastacos P. 2003. Forecasting traffic flow conditions in an urban network: Comparison of multivariate and univariate approaches. Transportation Research Record: Journal of the Transportation Research Board, 1857, 74--84.
[10]
Williams B M, Hoel L A. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of transportation engineering, 129(6), 664--672.
[11]
Li L, Su X, Zhang Y, et al. 2015. Trend modeling for traffic time series analysis: An integrated study. IEEE Transactions on Intelligent Transportation Systems, 16(6), 3430--3439.
[12]
Guo J, Huang W, Williams B M. 2014. Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification. Transportation Research Part C: Emerging Technologies, 43, 50--64.
[13]
Chang H, Lee Y, Yoon B, et al. 2012. Dynamic near-term traffic flow prediction: Systemoriented approach based on past experiences. IET intelligent transport systems, 6(3), 292--305.
[14]
Jeong Y S, Byon Y J, Castro-Neto M M, et al. 2013. Supervised weighting-online learning algorithm for short-term traffic flow prediction. IEEE Transactions on Intelligent Transportation Systems, 14(4), 1700--1707.
[15]
Lippi M, Bertini M, Frasconi P. 2013. Short-term traffic flow forecasting: An experimental comparison of time-series analysis and supervised learning. IEEE Transactions on Intelligent Transportation Systems, 14(2), 871--882.
[16]
Xu Y, Kong Q J, Klette R, et al. 2014. Accurate and interpretable Bayesian mars for traffic flow prediction. IEEE Transactions on Intelligent Transportation Systems, 15(6), 2457--2469.
[17]
Chen P T, Chen F, Qian Z. 2014. Road traffic congestion monitoring in social media with hinge-loss Markov random fields. In Proceedings of the IEEE International Conference on Data Mining (ICDM' 14), 80--89.
[18]
D Deng, C Shahabi, U Demiryurek, L Zhu, R Yu. 2016. Latent Space Model for Road Networks to Predict Time-varying Traffic. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'16), 1525--1534.
[19]
Gong Y, Li Z, Zhang J, et al. 2018. Network-wide Crowd Flow Prediction of Sydney Trains via Customized Online Non-negative Matrix Factorization. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM'18). ACM, 1243--1252.
[20]
Wang D, Zhang J, Cao W, et al. 2018. When Will You Arrive? Estimating Travel Time Based on Deep Neural Networks. In Proceedings of the 32nd Association for the Advancement of Artificial Intelligence (AAAI'18).
[21]
Zhang J, Zheng Y, Qi D, et al. 2016. DNN-based prediction model for spatio-temporal data. In Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 92.
[22]
Lv Y, Duan Y, Kang W, et al. 2014. Traffic Flow Prediction With Big Data: A Deep Learning Approach. IEEE Transactions on Intelligent Transportation Systems, 16(2), 1--9.
[23]
Huang W, Song G, Hong H, et al. 2014. Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning. IEEE Transactions on Intelligent Transportation Systems, 15(5), 2191--2201.
[24]
Zhang J, Zheng Y, Qi D. 2017. Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction. In Proceedings of the 31st Association for the Advancement of Artificial Intelligence (AAAI'17), 1655--1661.
[25]
Hochreiter S, Schmidhuber J. 1997. Long short-term memory. Neural computation, 9(8), 1735--1780.
[26]
Cho K, Van Merriënboer B, Gulcehre C, et al. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.
[27]
Zhang Y, Zhang Y, Haghani A. 2014. A hybrid short-term traffic flow forecasting method based on spectral analysis and statistical volatility model. Transportation Research Part C: Emerging Technologies, 43: 65--78.
[28]
Lin L, Ni M, He Q, et al. 2015. Modeling the impacts of inclement weather on freeway traffic speed: exploratory study with social media data[J]. Transportation Research Record: Journal of the Transportation Research Board, 2482, 82--89.
[29]
Zhang F, Jin B, Ge T, et al. 2016. Who are My Familiar Strangers?: Revealing Hidden Friend Relations and Common Interests from Smart Card Data. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 619--628.
[30]
Sutskever I, Vinyals O, Le Q V. 2014. Sequence to sequence learning with neural networks. In Proceedings of the Advances in neural information processing systems, 3104--3112.
[31]
Cho K, Van Merriënboer B, Bahdanau D, et al. 2014. On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259.
[32]
Kingma D P, Ba J. 2014. Adam: A method for stochastic optimization[J]. arXiv preprint arXiv:1412.6980.
[33]
Abadi M, Barham P, Chen J, et al. 2016. Tensorflow: a system for large-scale machine learning[C]. OSDI. 16: 265--283.
[34]
Box G E P, Pierce D A. 1970. Distribution of residual autocorrelations in autoregressive-integrated moving average time series models. Journal of the American statistical Association, 65(332), 1509--1526.
[35]
Zivot E, Wang J. 2006. Vector autoregressive models for multivariate time series. Modeling Financial Time Series with S-PLUS®, 385--429.
[36]
Friedman J H. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics, 1189--1232.

Cited By

View all
  • (2024)Predicting dust pollution from dry bulk ports in coastal cities: A hybrid approach based on data decomposition and deep learningEnvironmental Pollution10.1016/j.envpol.2024.124053350(124053)Online publication date: Jun-2024
  • (2024)Machine Learning for public transportation demand prediction: A Systematic Literature ReviewEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109166137(109166)Online publication date: Nov-2024
  • (2024)Spatial Network-Wide Traffic Flow Imputation with Graph Neural NetworkInternational Journal of Intelligent Transportation Systems Research10.1007/s13177-024-00456-7Online publication date: 10-Dec-2024
  • Show More Cited By

Index Terms

  1. A deep spatio-temporal attention-based neural network for passenger flow prediction

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    MobiQuitous '19: Proceedings of the 16th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services
    November 2019
    545 pages
    ISBN:9781450372831
    DOI:10.1145/3360774
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 February 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. attention mechanism
    2. deep learning
    3. intelligent transportation system
    4. passenger flow prediction

    Qualifiers

    • Research-article

    Funding Sources

    • Ministry of Transportation of China
    • National Natural Science Foundation of China

    Conference

    MobiQuitous
    MobiQuitous: Computing, Networking and Services
    November 12 - 14, 2019
    Texas, Houston, USA

    Acceptance Rates

    Overall Acceptance Rate 26 of 87 submissions, 30%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Predicting dust pollution from dry bulk ports in coastal cities: A hybrid approach based on data decomposition and deep learningEnvironmental Pollution10.1016/j.envpol.2024.124053350(124053)Online publication date: Jun-2024
    • (2024)Machine Learning for public transportation demand prediction: A Systematic Literature ReviewEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109166137(109166)Online publication date: Nov-2024
    • (2024)Spatial Network-Wide Traffic Flow Imputation with Graph Neural NetworkInternational Journal of Intelligent Transportation Systems Research10.1007/s13177-024-00456-7Online publication date: 10-Dec-2024
    • (2024)A survey on overlapping community detection: label propagationMultimedia Tools and Applications10.1007/s11042-024-20485-4Online publication date: 16-Dec-2024
    • (2023)Multivariate Time-Series Forecasting: A Review of Deep Learning Methods in Internet of Things Applications to Smart CitiesSmart Cities10.3390/smartcities60501146:5(2519-2552)Online publication date: 23-Sep-2023
    • (2023)Self-Adaptive Predictive Passenger Flow Modeling for Large-Scale Railway SystemsIEEE Internet of Things Journal10.1109/JIOT.2023.327042710:16(14182-14196)Online publication date: 15-Aug-2023
    • (2022)A Two-Stage Self-adaptive Model for Passenger Flow Prediction on Schedule-Based Railway SystemAdvances in Knowledge Discovery and Data Mining10.1007/978-3-031-05981-0_12(147-160)Online publication date: 10-May-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media