A distributed EMDN-GRU model on Spark for passenger waiting time forecasting

Xia, Dawen; Bai, Yu; Geng, Jian; Zhang, Wenyong; Hu, Yang; Li, Yantao; Li, Huaqing

doi:10.1007/s00521-022-07482-0

A distributed EMDN-GRU model on Spark for passenger waiting time forecasting

Original Article
Published: 29 June 2022

Volume 34, pages 19035–19050, (2022)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Dawen Xia¹,
Yu Bai¹,
Jian Geng¹,
Wenyong Zhang¹,
Yang Hu²,
Yantao Li³ &
…
Huaqing Li ORCID: orcid.org/0000-0001-6310-8965⁴

303 Accesses
3 Citations
Explore all metrics

Abstract

It is hard to forecast waiting time from mobile trajectory big data on the traditional centralized mining platform, and especially the taxi driving direction cannot be clearly distinguished by the GPS trajectories of taxicabs in intelligent transportation systems. To this end, we propose a direction identification method (named CA-D) combined with Coordinate Axis and GPS Direction to distinguish taxi driving direction and then establish a distributed model (named EMDN-GRU) on Spark to forecast passenger waiting time based on an Empirical Mode Decomposition (EMD) algorithm with Normalization and a Gated Recurrent Unit (GRU) model. Specifically, in the process of waiting time forecasting, the CA-D method is used to differentiate directions to reduce the data interference caused by different taxi driving directions. Furthermore, the EMD algorithm with normalization is utilized to process large-scale GPS trajectory data. Finally, the GRU model with adjusted parameters is employed to forecast the time-series data obtained in the previous step, and the forecasting results are denormalized and superimposed to produce waiting time for passengers. Compared with LSTM, GRU, EMD-LSTM, EMD-GRU, CNN, and BP, the experimental results from a case study indicate that EMDN-GRU is significantly superior to others. In particular, from three data sets of weekday, weekend, and one week, the MAPE values of EMDN-GRU are reduced by 92.48%, 95.01%, and 90.47% at most.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A distributed EEMDN-SABiGRU model on Spark for passenger hotspot prediction

Article 22 September 2023

Dawen Xia, Jian Geng, … Huaqing Li

SW-BiLSTM: a Spark-based weighted BiLSTM model for traffic flow forecasting

Article 18 March 2022

Dawen Xia, Nan Yang, … Huaqing Li

Freight Vehicle Travel Time Prediction Using Sparse Gaussian Processes Regression with Trajectory Data

References

Yang C, Chen J (2016) A scalable data chunk similarity based compression approach for efficient big sensing data processing on cloud. IEEE Transactions on Knowledge and Data Engineering 29:1144–1157
Article Google Scholar
Basanta-Val P, Audsley NC, Wellings AJ, Gray I, Fernández-García N (2016) Architecting time-critical big-data systems. IEEE Transactions on Big Data 2:310–324
Article Google Scholar
Asadianfam S, Shamsi M, Kenari AR (2020) Big data platform of traffic violation detection system: identifying the risky behaviors of vehicle drivers. Multimedia Tools and Applications 79:24645–24684
Article Google Scholar
Xia D, Zhang M, Yan X, Bai Y, Zheng Y, Li Y, Li H (2021) A distributed WND-LSTM model on MapReduce for short-term traffic flow prediction. Neural Computing and Applications 33:2393–2410
Article Google Scholar
Xia D, Jiang S, Yang N, Hu Y, Li Y, Li H, Wang L (2021) Discovering spatiotemporal characteristics of passenger travel with mobile trajectory big data. Physica A: Statistical Mechanics and its Applications 578:126056
Article Google Scholar
Lu R, Jin X, Zhang S, Qiu M, Wu X (2018) A study on big knowledge and its engineering issues. IEEE Transactions on Knowledge and Data Engineering 31:1630–1644
Article Google Scholar
Guo S, Yu L, Chen X, Zhang Y (2010) The modeling of waiting time for passengers to transfer from rail to buses based-on passenger classification, Technical Report
Vázquez JJ, Arjona J, Linares M, Casanovas-Garcia J (2020) A comparison of deep learning methods for urban traffic forecasting using floating car data. Transportation Research Procedia 47:195–202
Article Google Scholar
Saâdaoui F, Messaoud OB (2020) Multiscaled neural autoregressive distributed lag: A new empirical mode decomposition model for nonlinear time series forecasting. International Journal of Neural Systems 30:2050039
Article Google Scholar
Liu Z, Liu J (2020) A robust time series prediction method based on empirical mode decomposition and high-order fuzzy cognitive maps. Knowledge-Based Systems 203:1–33
Article Google Scholar
Nunes JC, Bouaoune Y, Delechelle E, Niang O, Bunel P (2003) Image analysis by bidimensional empirical mode decomposition. Image and Vision Computing 21:1019–1026
Article Google Scholar
Wang Z, Tianrui LI, Cheng Y, Wang Y, Xiuwen YI Prediction of probability of hitting vacant taxi and waiting time based on empirical distribution, In: 2015 Computer Engineering and Applications, IEEE, pp. 254–259
Qi G, Pan G, Li S, Wu Z, Zhang D, Sun L, Yang LT How long a passenger waits for a vacant taxi–large-scale taxi trace mining for smart cities, In: 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, IEEE, pp. 1029–1036
Xu X, Zhou J, Liu Y, Xu Z, Zhao X (2014) Taxi-RS: Taxi-hunting recommendation system based on taxi GPS data. IEEE Transactions on Intelligent Transportation Systems 16:1716–1727
Article Google Scholar
Hwang R-H, Hsueh Y-L, Chen Y-T (2015) An effective taxi recommender system based on a spatio-temporal factor analysis model. Information Sciences 314:28–40
Article Google Scholar
Qiu Z, Li H, Hong S, Lin Y, Fan N, Ou G, Wang T, Fan L Finding vacant taxis using large scale GPS traces, In: 2014 International Conference on Web-Age Information Management, Springer, pp. 793–804
Jing W, Hu L, Shu L, Mukherjee M, Hara T (2016) RPR: recommendation for passengers by roads based on cloud computing and taxis traces data. Personal and Ubiquitous Computing 20:337–347
Article Google Scholar
Qiu J, Du L, Zhang D, Su S, Tian Z (2019) Nei-TTE: intelligent traffic time estimation based on fine-grained time derivation of road segments for smart city. IEEE Transactions on Industrial Informatics 16:2659–2666
Article Google Scholar
Wang D, Zhang J, Cao W, Li J, Zheng Y When will you arrive? estimating travel time based on deep neural networks, In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2500–2507
Fei J, Lu Y, Guo Y, Zhang H (2020) Predicting bus arrival time using BP neural network and dynamic transfer. Procedia Computer Science 174:95–100
Article Google Scholar
Pang J, Huang J, Du Y, Yu H, Huang Q, Yin B (2018) Learning to predict bus arrival time from heterogeneous measurements via recurrent neural network. IEEE Transactions on Intelligent Transportation Systems 20:3283–3293
Article Google Scholar
Chen C, Wang H, Yuan F, Jia H, Yao B (2020) Bus travel time prediction based on Deep Belief Network with back-propagation. Neural Computing and Applications 32:10435–10449
Article Google Scholar
Ma J, Chan J, Ristanoski G, Rajasegarar S, Leckie C (2019) Bus travel time prediction with real-time traffic information. Transportation Research Part C: Emerging Technologies 105:536–549
Article Google Scholar
He P, Jiang G, Lam S-K, Tang D (2018) Travel-time prediction of bus journey with multiple bus trips. IEEE Transactions on Intelligent Transportation Systems 20:4192–4205
Article Google Scholar
Petersen NC, Rodrigues F, Pereira FC (2019) Multi-output bus travel time prediction with convolutional LSTM neural network. Expert Systems with Applications 120:426–435
Article Google Scholar
He P, Jiang G, Lam S-K, Sun Y (2020) Learning heterogeneous traffic patterns for travel time prediction of bus journeys. Information Sciences 512:1394–1406
Article Google Scholar
Achar A, Bharathi D, Kumar BA, Vanajakshi L (2019) Bus arrival time prediction: A spatial kalman filter approach. IEEE Transactions on Intelligent Transportation Systems 21:1298–1307
Article Google Scholar
Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen N.-C, Tung C.C, Liu HH (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences 454 ,903–995
Lee T (2020) EMD and LSTM hybrid deep learning model for predicting sunspot number time series with a cyclic pattern. Solar Physics 295:1–23
Article Google Scholar
Duan WY, Huang LM, Han Y, Zhang YH, Huang S (2015) A hybrid AR-EMD-SVR model for the short-term prediction of nonlinear and non-stationary ship motion. Journal of Zhejiang University-SCIENCE A 16:562–576
Article Google Scholar
Chen Q, Wen D, Li X, Chen D, Lv H, Zhang J, Gao P (2019) Empirical mode decomposition based long short-term memory neural network forecasting model for the short-term metro passenger flow. PloS one 14:e0222365
Article Google Scholar
Rezaei H, Faaljou H, Mansourfar G (2021) Stock price prediction using deep learning and frequency decomposition. Expert Systems with Applications 169:114332
Article Google Scholar
Zhao W, Yang H, Li J, Shang L, Hu L, Fu Q Network traffic prediction in network security based on EMD and LSTM, In: 2021 Proceedings of the 9th International Conference on Computer Engineering and Networks, Springer, pp. 509–518
Jiang T, Zhou C, Zhang H Time series forecasting with an EMD-LSSVM-PSO ensemble adaptive learning paradigm, In: Proceedings of the 2018 International Conference on Computational Intelligence and Intelligent Systems, pp. 44–50
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
Article Google Scholar
Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y Learning phrase representations using RNN encoder-decoder for statistical machine translation, in: Empirical Methods in Natural Language Processing, pp. 1724–1734
Fukushima K (1980) A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics 36:193–202
Article Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323:533–536
Article Google Scholar

Download references

Acknowledgements

This work described in this paper was supported in part by the National Natural Science Foundation of China (Grant Nos. 62162012, 62173278, and 62072061), the Science and Technology Support Program of Guizhou (Grant No. QKHZC2021YB531), the Youth Science and Technology Talents Development Project of Colleges and Universities in Guizhou (Grant No. QJHKY2022175), and the Scientific Research Platform Project of Guizhou Minzu University (Grant No. GZMUSYS[2021]04). Yu Bai and Jian Geng contributed equally to this work and should be considered as co-second authors.

Author information

Authors and Affiliations

College of Data Science and Information Engineering, Guizhou Minzu University, Guiyang, 550025, China
Dawen Xia, Yu Bai, Jian Geng & Wenyong Zhang
Department of Automotive Engineering, Guizhou Traffic Technician and Transportation College, Guiyang, 550008, China
Yang Hu
College of Computer Science, Chongqing University, Chongqing, 400044, China
Yantao Li
College of Electronic and Information Engineering, Southwest University, Chongqing, 400715, China
Huaqing Li

Authors

Dawen Xia
View author publications
You can also search for this author in PubMed Google Scholar
Yu Bai
View author publications
You can also search for this author in PubMed Google Scholar
Jian Geng
View author publications
You can also search for this author in PubMed Google Scholar
Wenyong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Yantao Li
View author publications
You can also search for this author in PubMed Google Scholar
Huaqing Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huaqing Li.

Ethics declarations

Conflict of interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xia, D., Bai, Y., Geng, J. et al. A distributed EMDN-GRU model on Spark for passenger waiting time forecasting. Neural Comput & Applic 34, 19035–19050 (2022). https://doi.org/10.1007/s00521-022-07482-0

Download citation

Received: 01 November 2021
Accepted: 26 May 2022
Published: 29 June 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s00521-022-07482-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A distributed EMDN-GRU model on Spark for passenger waiting time forecasting

Abstract

Access this article

Similar content being viewed by others

A distributed EEMDN-SABiGRU model on Spark for passenger hotspot prediction

SW-BiLSTM: a Spark-based weighted BiLSTM model for traffic flow forecasting

Freight Vehicle Travel Time Prediction Using Sparse Gaussian Processes Regression with Trajectory Data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A distributed EMDN-GRU model on Spark for passenger waiting time forecasting

Abstract

Access this article

Similar content being viewed by others

A distributed EEMDN-SABiGRU model on Spark for passenger hotspot prediction

SW-BiLSTM: a Spark-based weighted BiLSTM model for traffic flow forecasting

Freight Vehicle Travel Time Prediction Using Sparse Gaussian Processes Regression with Trajectory Data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation