Abstract
It is hard to forecast waiting time from mobile trajectory big data on the traditional centralized mining platform, and especially the taxi driving direction cannot be clearly distinguished by the GPS trajectories of taxicabs in intelligent transportation systems. To this end, we propose a direction identification method (named CA-D) combined with Coordinate Axis and GPS Direction to distinguish taxi driving direction and then establish a distributed model (named EMDN-GRU) on Spark to forecast passenger waiting time based on an Empirical Mode Decomposition (EMD) algorithm with Normalization and a Gated Recurrent Unit (GRU) model. Specifically, in the process of waiting time forecasting, the CA-D method is used to differentiate directions to reduce the data interference caused by different taxi driving directions. Furthermore, the EMD algorithm with normalization is utilized to process large-scale GPS trajectory data. Finally, the GRU model with adjusted parameters is employed to forecast the time-series data obtained in the previous step, and the forecasting results are denormalized and superimposed to produce waiting time for passengers. Compared with LSTM, GRU, EMD-LSTM, EMD-GRU, CNN, and BP, the experimental results from a case study indicate that EMDN-GRU is significantly superior to others. In particular, from three data sets of weekday, weekend, and one week, the MAPE values of EMDN-GRU are reduced by 92.48%, 95.01%, and 90.47% at most.













Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Yang C, Chen J (2016) A scalable data chunk similarity based compression approach for efficient big sensing data processing on cloud. IEEE Transactions on Knowledge and Data Engineering 29:1144–1157
Basanta-Val P, Audsley NC, Wellings AJ, Gray I, Fernández-García N (2016) Architecting time-critical big-data systems. IEEE Transactions on Big Data 2:310–324
Asadianfam S, Shamsi M, Kenari AR (2020) Big data platform of traffic violation detection system: identifying the risky behaviors of vehicle drivers. Multimedia Tools and Applications 79:24645–24684
Xia D, Zhang M, Yan X, Bai Y, Zheng Y, Li Y, Li H (2021) A distributed WND-LSTM model on MapReduce for short-term traffic flow prediction. Neural Computing and Applications 33:2393–2410
Xia D, Jiang S, Yang N, Hu Y, Li Y, Li H, Wang L (2021) Discovering spatiotemporal characteristics of passenger travel with mobile trajectory big data. Physica A: Statistical Mechanics and its Applications 578:126056
Lu R, Jin X, Zhang S, Qiu M, Wu X (2018) A study on big knowledge and its engineering issues. IEEE Transactions on Knowledge and Data Engineering 31:1630–1644
Guo S, Yu L, Chen X, Zhang Y (2010) The modeling of waiting time for passengers to transfer from rail to buses based-on passenger classification, Technical Report
Vázquez JJ, Arjona J, Linares M, Casanovas-Garcia J (2020) A comparison of deep learning methods for urban traffic forecasting using floating car data. Transportation Research Procedia 47:195–202
Saâdaoui F, Messaoud OB (2020) Multiscaled neural autoregressive distributed lag: A new empirical mode decomposition model for nonlinear time series forecasting. International Journal of Neural Systems 30:2050039
Liu Z, Liu J (2020) A robust time series prediction method based on empirical mode decomposition and high-order fuzzy cognitive maps. Knowledge-Based Systems 203:1–33
Nunes JC, Bouaoune Y, Delechelle E, Niang O, Bunel P (2003) Image analysis by bidimensional empirical mode decomposition. Image and Vision Computing 21:1019–1026
Wang Z, Tianrui LI, Cheng Y, Wang Y, Xiuwen YI Prediction of probability of hitting vacant taxi and waiting time based on empirical distribution, In: 2015 Computer Engineering and Applications, IEEE, pp. 254–259
Qi G, Pan G, Li S, Wu Z, Zhang D, Sun L, Yang LT How long a passenger waits for a vacant taxi–large-scale taxi trace mining for smart cities, In: 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, IEEE, pp. 1029–1036
Xu X, Zhou J, Liu Y, Xu Z, Zhao X (2014) Taxi-RS: Taxi-hunting recommendation system based on taxi GPS data. IEEE Transactions on Intelligent Transportation Systems 16:1716–1727
Hwang R-H, Hsueh Y-L, Chen Y-T (2015) An effective taxi recommender system based on a spatio-temporal factor analysis model. Information Sciences 314:28–40
Qiu Z, Li H, Hong S, Lin Y, Fan N, Ou G, Wang T, Fan L Finding vacant taxis using large scale GPS traces, In: 2014 International Conference on Web-Age Information Management, Springer, pp. 793–804
Jing W, Hu L, Shu L, Mukherjee M, Hara T (2016) RPR: recommendation for passengers by roads based on cloud computing and taxis traces data. Personal and Ubiquitous Computing 20:337–347
Qiu J, Du L, Zhang D, Su S, Tian Z (2019) Nei-TTE: intelligent traffic time estimation based on fine-grained time derivation of road segments for smart city. IEEE Transactions on Industrial Informatics 16:2659–2666
Wang D, Zhang J, Cao W, Li J, Zheng Y When will you arrive? estimating travel time based on deep neural networks, In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2500–2507
Fei J, Lu Y, Guo Y, Zhang H (2020) Predicting bus arrival time using BP neural network and dynamic transfer. Procedia Computer Science 174:95–100
Pang J, Huang J, Du Y, Yu H, Huang Q, Yin B (2018) Learning to predict bus arrival time from heterogeneous measurements via recurrent neural network. IEEE Transactions on Intelligent Transportation Systems 20:3283–3293
Chen C, Wang H, Yuan F, Jia H, Yao B (2020) Bus travel time prediction based on Deep Belief Network with back-propagation. Neural Computing and Applications 32:10435–10449
Ma J, Chan J, Ristanoski G, Rajasegarar S, Leckie C (2019) Bus travel time prediction with real-time traffic information. Transportation Research Part C: Emerging Technologies 105:536–549
He P, Jiang G, Lam S-K, Tang D (2018) Travel-time prediction of bus journey with multiple bus trips. IEEE Transactions on Intelligent Transportation Systems 20:4192–4205
Petersen NC, Rodrigues F, Pereira FC (2019) Multi-output bus travel time prediction with convolutional LSTM neural network. Expert Systems with Applications 120:426–435
He P, Jiang G, Lam S-K, Sun Y (2020) Learning heterogeneous traffic patterns for travel time prediction of bus journeys. Information Sciences 512:1394–1406
Achar A, Bharathi D, Kumar BA, Vanajakshi L (2019) Bus arrival time prediction: A spatial kalman filter approach. IEEE Transactions on Intelligent Transportation Systems 21:1298–1307
Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen N.-C, Tung C.C, Liu HH (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences 454 ,903–995
Lee T (2020) EMD and LSTM hybrid deep learning model for predicting sunspot number time series with a cyclic pattern. Solar Physics 295:1–23
Duan WY, Huang LM, Han Y, Zhang YH, Huang S (2015) A hybrid AR-EMD-SVR model for the short-term prediction of nonlinear and non-stationary ship motion. Journal of Zhejiang University-SCIENCE A 16:562–576
Chen Q, Wen D, Li X, Chen D, Lv H, Zhang J, Gao P (2019) Empirical mode decomposition based long short-term memory neural network forecasting model for the short-term metro passenger flow. PloS one 14:e0222365
Rezaei H, Faaljou H, Mansourfar G (2021) Stock price prediction using deep learning and frequency decomposition. Expert Systems with Applications 169:114332
Zhao W, Yang H, Li J, Shang L, Hu L, Fu Q Network traffic prediction in network security based on EMD and LSTM, In: 2021 Proceedings of the 9th International Conference on Computer Engineering and Networks, Springer, pp. 509–518
Jiang T, Zhou C, Zhang H Time series forecasting with an EMD-LSSVM-PSO ensemble adaptive learning paradigm, In: Proceedings of the 2018 International Conference on Computational Intelligence and Intelligent Systems, pp. 44–50
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y Learning phrase representations using RNN encoder-decoder for statistical machine translation, in: Empirical Methods in Natural Language Processing, pp. 1724–1734
Fukushima K (1980) A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics 36:193–202
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323:533–536
Acknowledgements
This work described in this paper was supported in part by the National Natural Science Foundation of China (Grant Nos. 62162012, 62173278, and 62072061), the Science and Technology Support Program of Guizhou (Grant No. QKHZC2021YB531), the Youth Science and Technology Talents Development Project of Colleges and Universities in Guizhou (Grant No. QJHKY2022175), and the Scientific Research Platform Project of Guizhou Minzu University (Grant No. GZMUSYS[2021]04). Yu Bai and Jian Geng contributed equally to this work and should be considered as co-second authors.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xia, D., Bai, Y., Geng, J. et al. A distributed EMDN-GRU model on Spark for passenger waiting time forecasting. Neural Comput & Applic 34, 19035–19050 (2022). https://doi.org/10.1007/s00521-022-07482-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07482-0