Abstract
Traffic crashes are one of the significant causes of death worldwide, and the prediction of this event is complicated due to many contributing factors. This study used spatial, temporal, and spatiotemporal information to predict crashes in Chicago at 1 km grid levels. A Deep Hybrid Network (DHN) was developed by exploiting inherent unique characteristics of Convolution Neural Network (CNN), Long Short-term Memory (LSTM), and Deep Neural Network (DNN). The hyperparameters of the models were obtained through the Bayesian optimization algorithm. The proposed modeling framework investigated the feature importance, the spatial heterogeneity of predictions, the worst-performing spatial grids, and the spatial distribution of features pertinent to model performance. These analyses transform the proposed DHN into an interpretable and transparent model. The DHN model was compared with Logistic Regression (LR), DNN, CNN, LSTM, and bidirectional LSTM, and it outperformed the baseline models with an accuracy of 0.72, recall of 0.70, false alarm rate of 0.28, and AUC of 0.79. The top three essential features were time, weather, and taxi trips, consecutively. The grid-level distribution of prediction performance investigations revealed a consistent performance of all deep learning models in terms of failed grids (i.e., AUC is 0.5 or less). It was revealed that DHN has the fewest failed grids (i.e., 18 failed from 710 grids) among the experimented models. According to the district level analysis, the O’hare airport area and the central district had the fewest number of failed grids for all methods, while the far south district had the highest number of failed grids. In addition, it was observed that the passed grid had a higher average feature density than the failed grid.









Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The data of this study are public and open sourced.
Abbreviations
- AUC:
-
Area under curve
- Bi-LSTM:
-
Bidirectional long short-term memory
- CART:
-
Classification and regression tree
- CNN:
-
Convolution neural network
- DHN:
-
Deep hybrid network
- DNN:
-
Deep neural network
- DT:
-
Decision tree
- GPS:
-
Global positioning system
- ITS:
-
Intelligent transportation system
- LSTM:
-
Long short-term memory
- LR:
-
Logistic regression
- PNN:
-
Probabilistic neural network
- RNN:
-
Recurrent neural network
- SVM:
-
Support vector machine
- VMS:
-
Variable message signs
- XGBOOST:
-
EXtreme gradient boosting
- oC:
-
Centigrade
- \({g}_{n}\) :
-
Grid (n)
- k:
-
Timestamp (k)
- K:
-
Total number of timestamps
- L:
-
Vector of model prediction for each spatial grids
- m:
-
Number of timesteps backward
- N:
-
Number of spatial grids
- \({u}_{n}\) :
-
Spatial features located in grid (n)
- U:
-
Vector of spatial features
- V:
-
Vector of input features
- \({w}_{k}\) :
-
Temporal features for timestamp (k)
- W:
-
Vector of temporal features
- \({x}_{t,g}\) :
-
Spatiotemporal feature for grid \({g}_{n}\) and a timestamp \({t}_{k}\)
- X:
-
Vector of spatiotemporal features
References
WHO (2018) WHO | Global status report on road safety 2018. World Health Organization
WISQARS (2020) WISQARS (Web-based Injury Statistics Query and Reporting System)|Injury Center|CDC. https://www.cdc.gov/injury/wisqars/. Accessed 3 Nov 2020
WHO (2004) World report on road traffic injury prevention. https://www.who.int/publications/i/item/world-report-on-road-traffic-injury-prevention. Accessed 3 Nov 2020
Alam A, Besselink B, Turri V et al (2015) Heavy-duty vehicle platooning for sustainable freight transportation: a cooperative method to enhance safety and efficiency. IEEE Control Syst Mag 35:34–56
Parkhun S, Kim SM, Ha YG (2016) Highway traffic accident prediction using VDS big data analysis. J Supercomput 72:2815–2831. https://doi.org/10.1007/s11227-016-1624-z
Tamim Kashifi M, Ahmad I (2022) Efficient histogram-based gradient boosting approach for accident severity prediction with multisource data. Transp Res Record: J Transp Res Board. https://doi.org/10.1177/03611981221074370
Yan X, Wu J (2014) Effectiveness of variable message signs on driving behavior based on a driving simulation experiment. Discret Dyn Nat Soc. https://doi.org/10.1155/2014/206805
Ren H, Song Y, Wang J, et al (2018) A Deep Learning Approach to the Citywide Traffic Accident Risk Prediction. IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC 2018-Novem:3346–3351. https://doi.org/10.1109/ITSC.2018.8569437
Bao J, Liu P, Qin X, Zhou H (2018) Understanding the effects of trip patterns on spatially aggregated crashes with large-scale taxi GPS data. Accid Anal Prev 120:281–294. https://doi.org/10.1016/j.aap.2018.08.014
Chen Q, Song X, Yamada H, Shibasaki R (2016) Learning deep representation from big and heterogeneous data for traffic accident inference. 30th AAAI Conference on Artificial Intelligence, AAAI 2016 338–344
Chen M, Mao S, Liu Y (2014) Big data: a survey. Mobile Netw Appl 19:171–209. https://doi.org/10.1007/s11036-013-0489-0
Huang H, Abdel-Aty MA, Darwiche AL (2010) County-level crash risk analysis in Florida: Bayesian spatial modeling. Transp Res Rec. https://doi.org/10.3141/2148-04
Rhee KA, Kim JK, Lee YI, Ulfarsson GF (2016) Spatial regression analysis of traffic crashes in Seoul. Accid Anal Prev 91:190–199. https://doi.org/10.1016/j.aap.2016.02.023
Bao J, Liu P, Ukkusuri S (2019) A spatiotemporal deep learning approach for citywide short-term crash risk prediction with multi-source data. Accid Anal Prev 122:239–254. https://doi.org/10.1016/j.aap.2018.10.015
Yuan J, Abdel-Aty M, Gong Y, Cai Q (2019) Real-time crash risk prediction using long short-term memory recurrent neural network. Transp Res Rec 2673:314–326. https://doi.org/10.1177/0361198119840611
Abdel-Aty M, Uddin N, Pande A et al (2004) Predicting freeway crashes from loop detector data by matched case-control logistic regression. Transp Res Rec. https://doi.org/10.3141/1897-12
Zheng Z, Ahn S, Monsere CM (2010) Impact of traffic oscillations on freeway crash occurrences. Accid Anal Prev 42:626–636. https://doi.org/10.1016/j.aap.2009.10.009
Xu C, Liu P, Wang W, Li Z (2012) Evaluation of the impacts of traffic states on crash risks on freeways. Accid Anal Prev 47:162–171. https://doi.org/10.1016/j.aap.2012.01.020
Abdel-Aty MA, Hassan HM, Ahmed M, Al-Ghamdi AS (2012) Real-time prediction of visibility related crashes. Transp Res Part C: Emerg Technol 24:288–298. https://doi.org/10.1016/j.trc.2012.04.001
Shi Q, Abdel-Aty M (2015) Big Data applications in real-time traffic operation and safety monitoring and improvement on urban expressways. Transp Res Part C: Emerg Technol 58:380–394. https://doi.org/10.1016/j.trc.2015.02.022
Wang L, Abdel-Aty M, Shi Q, Park J (2015) Real-time crash prediction for expressway weaving segments. Transp Res Part C: Emerg Technol 61:1–10. https://doi.org/10.1016/j.trc.2015.10.008
Michie D, Spiegelhalter DJ, Taylor CC et al (1994) Machine learning. Neural Stat Class 13:1–298
Murphy KP (2012) Machine learning: a probabilistic perspective. MIT press
Jordan MI (1979) Mitchell TM (2015) Machine learning: trends, perspectives, and prospects. Science 349:255–260
Marsland S (2015) Machine learning: an algorithmic perspective. CRC Press
Alpaydin E (2020) Introduction to machine learning. MIT press
Scholkopf B, Sung K-K, Burges CJC et al (1997) Comparing support vector machines with Gaussian kernels to radial basis function classifiers. IEEE Trans Signal Process 45:2758–2765
Yu R, Abdel-Aty M (2013) Utilizing support vector machine in real-time crash risk evaluation. Accid Anal Prev. https://doi.org/10.1016/j.aap.2012.11.027
Parsa AB, Taghipour H, Derrible S, Mohammadian A (2019) Real-time accident detection: Coping with imbalanced data. Accid Anal Prev 129:202–210. https://doi.org/10.1016/j.aap.2019.05.014
Guo M, Zhao X, Yao Y et al (2021) A study of freeway crash risk prediction and interpretation based on risky driving behavior and traffic flow data. Accident Anal Prevention 160:106328
Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85–117
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Zhao Z, Chen W, Wu X et al (2017) LSTM network: a deep learning approach for short-term traffic forecast. IET Image Proc. https://doi.org/10.1049/iet-its.2016.0208
Tian C, Ma J, Zhang C, Zhan P (2018) A deep neural network model for short-term load forecast based on long short-term memory network and convolutional neural network. Energies (Basel). https://doi.org/10.3390/en11123493
Zheng Z, Yang Y, Liu J et al (2019) Deep and embedded learning approach for traffic flow prediction in urban informatics. IEEE Trans Intell Transp Syst. https://doi.org/10.1109/TITS.2019.2909904
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput. https://doi.org/10.1162/neco.1997.9.8.1735
Bengio Y, Simard P, Frasconi P (1994) Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw. 5(2):157
Skrobek D, Krzywanski J, Sosnowski M et al (2020) Prediction of sorption processes using the deep learning methods (long short-term memory). Energies (Basel) 13:6601
Alahi A, Goel K, Ramanathan V, et al (2016) Social lstm: Human trajectory prediction in crowded spaces. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 961–971
Cui Z, Ke R, Pu Z, Wang Y (2018) Deep Bidirectional and Unidirectional {LSTM} Recurrent Neural Network for Network-wide Traffic Speed Prediction. CoRR abs/1801.0:1–11
Li P, Abdel-Aty M, Yuan J (2020) Real-time crash risk prediction on arterials based on LSTM-CNN. Accid Anal Prev 135:105371. https://doi.org/10.1016/j.aap.2019.105371
Zhang Z, He Q, Gao J, Ni M (2018) A deep learning approach for detecting traffic accidents from social media data. Transp Res Part C: Emerg Technol 86:580–596. https://doi.org/10.1016/j.trc.2017.11.027
Chen C, Fan X, Zheng C, et al (2018) SDCAE: Stack Denoising Convolutional Autoencoder Model for Accident Risk Prediction Via Traffic Big Data 2018 Sixth International Conference on Advanced Cloud and Big Data SDCAE: Stack Denoising Convolutional Autoencoder Model for Accident Risk Prediction via Traffic Big Data. https://doi.org/10.1109/CBD.2018.00065
Yuan Z, Zhou X, Yang T (2018) Hetero-convlstm: A deep learning approach to traffic accident prediction on heterogeneous spatio-temporal data. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery \& Data Mining. pp 984–992
Data Commons (2020) Chicago - Place Explorer - Data Commons. https://datacommons.org/place/geoId/1714000. Accessed 27 Oct 2020
Continental Motors (2020) How many cars are in Chicago? Commuter Vehicle Numbers. https://www.continentalmotors.com/blog/how-many-cars-are-in-chicago/. Accessed 3 Nov 2020
Arguez A, Durre I, Applequist S, et al (2020) NOAA’s U.S. Climate Normals (1981–2010). Hourly Weather. NOAA National Centers for Environmental Information. https://www.ncdc.noaa.gov/cdo-web/datatools/lcd. Accessed 27 Oct 2020
Chicago Data Portal (2020) Traffic Crashes - Crashes | City of Chicago | Data Portal. https://data.cityofchicago.org/Transportation/Traffic-Crashes-Crashes/85ca-t3if. Accessed 24 Oct 2020
Worldwide Public Holidays (2021) Worldwide Public Holidays Listed by Country - qppstudio.net. https://www.qppstudio.net/worldwide-public-holidays/country-portal.htm. Accessed 31 May 2021
Train KE (2003) Discrete choice methods with simulation. Cambridge University Press
Vapnik V (1999) The nature of statistical learning theory. Springer science and business media
Wang S-C (2003) Artificial neural network. interdisciplinary computing in java programming. Springer
Gers FA, Schmidhuber J, Cummins F (2000) Learning to forget: continual prediction with LSTM. Neural Comput 12:2451–2471. https://doi.org/10.1162/089976600300015015
Xu D, Shi Y, Tsang IW et al (2020) Survey on multi-output learning. IEEE Trans Neural Netw Learn Syst 31:2409–2429. https://doi.org/10.1109/TNNLS.2019.2945133
Núñez JC, Cabido R, Pantrigo JJ et al (2018) Convolutional neural networks and long short-term memory for skeleton-based human activity and hand gesture recognition. Pattern Recogn. https://doi.org/10.1016/j.patcog.2017.10.033
Zheng X, Chen W (2021) An Attention-based Bi-LSTM method for visual object classification via EEG. Biomed Signal Process Control 63:102174. https://doi.org/10.1016/j.bspc.2020.102174
LeCun Y, Boser B, Denker JS et al (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput. https://doi.org/10.1162/neco.1989.1.4.541
Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
Rengasamy D, Jafari M, Rothwell B et al (2020) Deep learning with dynamically weighted loss function for sensor-based prognostics and health management. Sensors (Switzerland) 20:723. https://doi.org/10.3390/s20030723
Snoek J, Larochelle H, Adams RP (2012) Practical Bayesian optimization of machine learning algorithms. Adv Neural Inf Process Syst 4:2951–2959
Ozenne B, Subtil F, Maucort-Boulch D (2015) The precision-recall curve overcame the optimism of the receiver operating characteristic curve in rare diseases. J Clin Epidemiol 68:855–859. https://doi.org/10.1016/j.jclinepi.2015.02.010
Keselman HJ, Rogan JC (1977) The Tukey multiple comparison test: 1953–1976. Psychol Bull 84:1050
Moosavi S, Samavatian MH, Parthasarathy S, et al (2019) Accident risk prediction based on heterogeneous sparse data: New dataset and insights. GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems 33–42. https://doi.org/10.1145/3347146.3359078
Acknowledgements
The authors would like to acknowledge the support of King Fahd University of Petroleum & Minerals in conducting this research.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors acknowledge that there is no conflict of interest.
Consent for publication
All authors agreed to the submission.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kashifi, M.T., Al-Sghan, I.Y., Rahman, S.M. et al. Spatiotemporal grid-based crash prediction—application of a transparent deep hybrid modeling framework. Neural Comput & Applic 34, 20655–20669 (2022). https://doi.org/10.1007/s00521-022-07511-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07511-y