Skip to main content
Log in

A distributed EMDN-GRU model on Spark for passenger waiting time forecasting

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

It is hard to forecast waiting time from mobile trajectory big data on the traditional centralized mining platform, and especially the taxi driving direction cannot be clearly distinguished by the GPS trajectories of taxicabs in intelligent transportation systems. To this end, we propose a direction identification method (named CA-D) combined with Coordinate Axis and GPS Direction to distinguish taxi driving direction and then establish a distributed model (named EMDN-GRU) on Spark to forecast passenger waiting time based on an Empirical Mode Decomposition (EMD) algorithm with Normalization and a Gated Recurrent Unit (GRU) model. Specifically, in the process of waiting time forecasting, the CA-D method is used to differentiate directions to reduce the data interference caused by different taxi driving directions. Furthermore, the EMD algorithm with normalization is utilized to process large-scale GPS trajectory data. Finally, the GRU model with adjusted parameters is employed to forecast the time-series data obtained in the previous step, and the forecasting results are denormalized and superimposed to produce waiting time for passengers. Compared with LSTM, GRU, EMD-LSTM, EMD-GRU, CNN, and BP, the experimental results from a case study indicate that EMDN-GRU is significantly superior to others. In particular, from three data sets of weekday, weekend, and one week, the MAPE values of EMDN-GRU are reduced by 92.48%, 95.01%, and 90.47% at most.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Yang C, Chen J (2016) A scalable data chunk similarity based compression approach for efficient big sensing data processing on cloud. IEEE Transactions on Knowledge and Data Engineering 29:1144–1157

    Article  Google Scholar 

  2. Basanta-Val P, Audsley NC, Wellings AJ, Gray I, Fernández-García N (2016) Architecting time-critical big-data systems. IEEE Transactions on Big Data 2:310–324

    Article  Google Scholar 

  3. Asadianfam S, Shamsi M, Kenari AR (2020) Big data platform of traffic violation detection system: identifying the risky behaviors of vehicle drivers. Multimedia Tools and Applications 79:24645–24684

    Article  Google Scholar 

  4. Xia D, Zhang M, Yan X, Bai Y, Zheng Y, Li Y, Li H (2021) A distributed WND-LSTM model on MapReduce for short-term traffic flow prediction. Neural Computing and Applications 33:2393–2410

    Article  Google Scholar 

  5. Xia D, Jiang S, Yang N, Hu Y, Li Y, Li H, Wang L (2021) Discovering spatiotemporal characteristics of passenger travel with mobile trajectory big data. Physica A: Statistical Mechanics and its Applications 578:126056

    Article  Google Scholar 

  6. Lu R, Jin X, Zhang S, Qiu M, Wu X (2018) A study on big knowledge and its engineering issues. IEEE Transactions on Knowledge and Data Engineering 31:1630–1644

    Article  Google Scholar 

  7. Guo S, Yu L, Chen X, Zhang Y (2010) The modeling of waiting time for passengers to transfer from rail to buses based-on passenger classification, Technical Report

  8. Vázquez JJ, Arjona J, Linares M, Casanovas-Garcia J (2020) A comparison of deep learning methods for urban traffic forecasting using floating car data. Transportation Research Procedia 47:195–202

    Article  Google Scholar 

  9. Saâdaoui F, Messaoud OB (2020) Multiscaled neural autoregressive distributed lag: A new empirical mode decomposition model for nonlinear time series forecasting. International Journal of Neural Systems 30:2050039

    Article  Google Scholar 

  10. Liu Z, Liu J (2020) A robust time series prediction method based on empirical mode decomposition and high-order fuzzy cognitive maps. Knowledge-Based Systems 203:1–33

    Article  Google Scholar 

  11. Nunes JC, Bouaoune Y, Delechelle E, Niang O, Bunel P (2003) Image analysis by bidimensional empirical mode decomposition. Image and Vision Computing 21:1019–1026

    Article  Google Scholar 

  12. Wang Z, Tianrui LI, Cheng Y, Wang Y, Xiuwen YI Prediction of probability of hitting vacant taxi and waiting time based on empirical distribution, In: 2015 Computer Engineering and Applications, IEEE, pp. 254–259

  13. Qi G, Pan G, Li S, Wu Z, Zhang D, Sun L, Yang LT How long a passenger waits for a vacant taxi–large-scale taxi trace mining for smart cities, In: 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, IEEE, pp. 1029–1036

  14. Xu X, Zhou J, Liu Y, Xu Z, Zhao X (2014) Taxi-RS: Taxi-hunting recommendation system based on taxi GPS data. IEEE Transactions on Intelligent Transportation Systems 16:1716–1727

    Article  Google Scholar 

  15. Hwang R-H, Hsueh Y-L, Chen Y-T (2015) An effective taxi recommender system based on a spatio-temporal factor analysis model. Information Sciences 314:28–40

    Article  Google Scholar 

  16. Qiu Z, Li H, Hong S, Lin Y, Fan N, Ou G, Wang T, Fan L Finding vacant taxis using large scale GPS traces, In: 2014 International Conference on Web-Age Information Management, Springer, pp. 793–804

  17. Jing W, Hu L, Shu L, Mukherjee M, Hara T (2016) RPR: recommendation for passengers by roads based on cloud computing and taxis traces data. Personal and Ubiquitous Computing 20:337–347

    Article  Google Scholar 

  18. Qiu J, Du L, Zhang D, Su S, Tian Z (2019) Nei-TTE: intelligent traffic time estimation based on fine-grained time derivation of road segments for smart city. IEEE Transactions on Industrial Informatics 16:2659–2666

    Article  Google Scholar 

  19. Wang D, Zhang J, Cao W, Li J, Zheng Y When will you arrive? estimating travel time based on deep neural networks, In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2500–2507

  20. Fei J, Lu Y, Guo Y, Zhang H (2020) Predicting bus arrival time using BP neural network and dynamic transfer. Procedia Computer Science 174:95–100

    Article  Google Scholar 

  21. Pang J, Huang J, Du Y, Yu H, Huang Q, Yin B (2018) Learning to predict bus arrival time from heterogeneous measurements via recurrent neural network. IEEE Transactions on Intelligent Transportation Systems 20:3283–3293

    Article  Google Scholar 

  22. Chen C, Wang H, Yuan F, Jia H, Yao B (2020) Bus travel time prediction based on Deep Belief Network with back-propagation. Neural Computing and Applications 32:10435–10449

    Article  Google Scholar 

  23. Ma J, Chan J, Ristanoski G, Rajasegarar S, Leckie C (2019) Bus travel time prediction with real-time traffic information. Transportation Research Part C: Emerging Technologies 105:536–549

    Article  Google Scholar 

  24. He P, Jiang G, Lam S-K, Tang D (2018) Travel-time prediction of bus journey with multiple bus trips. IEEE Transactions on Intelligent Transportation Systems 20:4192–4205

    Article  Google Scholar 

  25. Petersen NC, Rodrigues F, Pereira FC (2019) Multi-output bus travel time prediction with convolutional LSTM neural network. Expert Systems with Applications 120:426–435

    Article  Google Scholar 

  26. He P, Jiang G, Lam S-K, Sun Y (2020) Learning heterogeneous traffic patterns for travel time prediction of bus journeys. Information Sciences 512:1394–1406

    Article  Google Scholar 

  27. Achar A, Bharathi D, Kumar BA, Vanajakshi L (2019) Bus arrival time prediction: A spatial kalman filter approach. IEEE Transactions on Intelligent Transportation Systems 21:1298–1307

    Article  Google Scholar 

  28. Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen N.-C, Tung C.C, Liu HH (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences 454 ,903–995

  29. Lee T (2020) EMD and LSTM hybrid deep learning model for predicting sunspot number time series with a cyclic pattern. Solar Physics 295:1–23

    Article  Google Scholar 

  30. Duan WY, Huang LM, Han Y, Zhang YH, Huang S (2015) A hybrid AR-EMD-SVR model for the short-term prediction of nonlinear and non-stationary ship motion. Journal of Zhejiang University-SCIENCE A 16:562–576

    Article  Google Scholar 

  31. Chen Q, Wen D, Li X, Chen D, Lv H, Zhang J, Gao P (2019) Empirical mode decomposition based long short-term memory neural network forecasting model for the short-term metro passenger flow. PloS one 14:e0222365

    Article  Google Scholar 

  32. Rezaei H, Faaljou H, Mansourfar G (2021) Stock price prediction using deep learning and frequency decomposition. Expert Systems with Applications 169:114332

    Article  Google Scholar 

  33. Zhao W, Yang H, Li J, Shang L, Hu L, Fu Q Network traffic prediction in network security based on EMD and LSTM, In: 2021 Proceedings of the 9th International Conference on Computer Engineering and Networks, Springer, pp. 509–518

  34. Jiang T, Zhou C, Zhang H Time series forecasting with an EMD-LSSVM-PSO ensemble adaptive learning paradigm, In: Proceedings of the 2018 International Conference on Computational Intelligence and Intelligent Systems, pp. 44–50

  35. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780

    Article  Google Scholar 

  36. Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y Learning phrase representations using RNN encoder-decoder for statistical machine translation, in: Empirical Methods in Natural Language Processing, pp. 1724–1734

  37. Fukushima K (1980) A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics 36:193–202

    Article  Google Scholar 

  38. Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323:533–536

    Article  Google Scholar 

Download references

Acknowledgements

This work described in this paper was supported in part by the National Natural Science Foundation of China (Grant Nos. 62162012, 62173278, and 62072061), the Science and Technology Support Program of Guizhou (Grant No. QKHZC2021YB531), the Youth Science and Technology Talents Development Project of Colleges and Universities in Guizhou (Grant No. QJHKY2022175), and the Scientific Research Platform Project of Guizhou Minzu University (Grant No. GZMUSYS[2021]04). Yu Bai and Jian Geng contributed equally to this work and should be considered as co-second authors.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huaqing Li.

Ethics declarations

Conflict of interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xia, D., Bai, Y., Geng, J. et al. A distributed EMDN-GRU model on Spark for passenger waiting time forecasting. Neural Comput & Applic 34, 19035–19050 (2022). https://doi.org/10.1007/s00521-022-07482-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-07482-0

Keywords

Navigation