Abstract
Smart cities came out as highly knowledgeable bio-networks, offering intelligent services and innovative solutions to urban problems. With rapid development, urbanization, and population pressure, traffic congestion and collisions are increasing substantially in road highway zones. Recently, traffic collisions have become one of the hugest national health problems in many cities of the world. Hence, crash injury severity prediction is vital for informing responsible authorities and the public to find alternative ways of dealing with its adverse effects, accordingly improving traffic safety and reducing traffic congestion. In predicting crash injury severity, researchers have explored and applied several techniques to aid in traffic injury management. However, the performance of many techniques suffers from some inherent limitations, including overgeneralization, lack of interpretability for humans, and low-performance accuracy. To address these issues, this paper proposes a novel computational framework based on improved wide and deep learning methods to predict accurately crash injury severity in the context of smart cities. On the crash dataset of New Zealand cities from 2000 to 2020, the proposed model has demonstrated better performance in comparison with the benchmark algorithms. Moreover, SHAP (SHapley Additive exPlanation) is employed to interpret the results and analyze the importance of each determinant of crash severity. The proposed method can help to prevent traffic crashes in smart cities and take proactive measures before the occurrence, also trauma centers can refer to this information to dispatch proper emergency service equipment swiftly and assist injuries to get direct medical care regardless of the crash location.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Abbreviations
- SHAP:
-
SHapley Additive exPlanation
- ICTs:
-
Information and Communication Technologies
- WHO:
-
World Health Organization
- ANN:
-
Artificial Neural Networks
- SVM:
-
Support Vector Machine
- DTs:
-
Decision Trees
- AUC:
-
Area Under the Curve
- ML:
-
Machine Learning
- DL:
-
Deep Learning
- AdaBoost:
-
Adaptive Boosting
- XGBoost:
-
Extreme Gradient Boosting
- KNN:
-
K-Nearest Neighbor
- GB:
-
Gradient Boosting
- OP:
-
Ordered Probit
- LR:
-
Logistic Regression
- RNN:
-
Recurrent Neural Network
- RF:
-
Random Forests
- DNN:
-
Deep Neural Network
- LSTM:
-
Long Short-Term Memory
- STCL-Net:
-
Spatio-Temporal Convolutional Long Short-Term Memory Network
- LSTM-CNN:
-
Long Short-Term Memory Convolutional Neural Network
References
Jensen, M., Gutierrez, J., Pedersen, J.: Location intelligence application in digital data activity dimensioning in smart cities. Procedia Comput. Sci. 36, 418–424 (2014)
Anthopoulos, L.G.: Understanding Smart Cities: A tool for Smart Government or an Industrial Trick? Springer (2017)
Mulligan, C.E., Olsson, M.: Architectural implications of smart city business models: an evolutionary perspective. IEEE Commun. Mag. 51(6), 80–85 (2013)
Madakam, S., Ramaswamy, R.: 100 New smart cities (India's smart vision). In 2015 5th National Symposium on Information Technology: Towards New Smart World (NSITNSW). IEEE. 1–6 (2015)
Yin, C., Xiong, Z., Chen, H., Wang, J., Cooper, D., David, B.: A literature survey on smart cities. Sci. China Inform. Sci. 58(10), 1–18 (2015)
Silva, B.N., Khan, M., Han, K.: Towards sustainable smart cities: a review of trends, architectures, components, and open challenges in smart cities. Sustain. Cities Soc. 38, 697–713 (2018)
Assi, K., Rahman, S.M., Mansoor, U., Ratrout, N.: Predicting crash injury severity with machine learning algorithm synergized with clustering technique: a promising protocol. Int. J. Environ. Res. Public Health 17(15), 5497 (2020)
Shiau, Y. R., Tsai, C. H., Hung, Y. H., Kuo, Y. T.: The application of data mining technology to build a forecasting model for classification of road traffic accidents. Math Probl Eng. (2015)
Iranitalab, A., Khattak, A.: Comparison of four statistical and machine learning methods for crash severity prediction. Accid. Anal. Prev. 108, 27–36 (2017)
World Health Organization: Global Status Report on road Safety 2018: Summary. World Health Organization (2018)
Li, Y., Li, Z., Wang, H., Wang, W., Xing, L.: Evaluating the safety impact of adaptive cruise control in traffic oscillations on freeways. Accid. Anal. Prev. 104, 137–145 (2017)
Cai, Q., Abdel-Aty, M., Yuan, J., Lee, J., Wu, Y.: Real-time crash prediction on expressways using deep generative models. Transp. Res. Part C: Emerg. Technol. 117, 102697 (2020)
Li, Y., Chen, Z., Yin, Y., Peeta, S.: Deployment of roadside units to overcome connectivity gap in transportation networks with mixed traffic. Transp. Res. Part C: Emerg. Technol. 111, 496–512 (2020)
Yu, R., Abdel-Aty, M.: An optimal variable speed limits system to ameliorate traffic safety risk. Transp. Res. part C: Emerg. Technol. 46, 235–246 (2014)
Zong, F., Xu, H., Zhang, H.: Prediction for traffic accident severity: comparing the Bayesian network and regression models. Math Probl Eng (2013)
Hashmienejad, S.H.-A., Hasheminejad, S.M.H.: Traffic accident severity prediction using a novel multi-objective genetic algorithm. Int. J. Crashworthiness 22(4), 425–440 (2017)
Zheng, M., Li, T., Zhu, R., Chen, J., Ma, Z., Tang, M., Cui, Z., Wang, Z.: Traffic accident’s severity prediction: a deep-learning approach-based CNN network. IEEE Access 7, 39897–39910 (2019)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning: MIT Press (2016)
Gibbons, C., Richards, S., Valderas, J.M., Campbell, J.: Supervised machine learning algorithms can classify open-text feedback of doctor performance with human-level accuracy. J. Med. Internet. Res. 19(3), e65 (2017)
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature. 521(7553), 436–444 (2015)
Minar, M.R., Naher, J.: Recent advances in deep learning: An overview,. arXiv preprint arXiv:1807.08169 (2018)
Nguyen, B.P., Pham, H.N., Tran, H., Nghiem, N., Nguyen, Q.H., Do, T.T., Tran, C.T., Simpson, C.R.: Predicting the onset of type 2 diabetes using wide and deep learning with electronic health records. Comput. Methods Prog. Biomed. 182, 105055 (2019)
Zhang, J., Li, Z., Pu, Z., Xu, C.: Comparing prediction performance for crash injury severity among various machine learning and statistical methods. IEEE Access 6, 60079–60087 (2018)
Nguyen, H., Kieu, L.-M., Wen, T., Cai, C.: Deep learning methods in transportation domain: a review. IET Intel. Transp. Syst. 12(9), 998–1004 (2018)
Karlaftis, M.G., Vlahogianni, E.I.: Statistical methods versus neural networks in transportation research: differences, similarities and some insights. Transp. Res. Part C: Emerg. Technol. 19(3), 387–399 (2011)
Halim, Z., Kalsoom, R., Bashir, S., Abbas, G.: Artificial intelligence techniques for driving safety and vehicle crash prediction. Artif. Intell. Rev. 46(3), 351–387 (2016)
Nassiri, H., Mohamadian Amiri, A.: Prediction of roadway accident frequencies: count regressions versus machine learning models. Sci. Iran. 21(2), 263–275 (2014)
Avelar, R.E., Dixon, K., Ashraf, S.: A comparative analysis on performance of severe crash prediction methods. Transp. Res. Rec. 2672(30), 109–119 (2018)
Li, Z., Liu, P., Wang, W., Xu, C.: Using support vector machine models for crash injury severity analysis. Accid. Anal. Prev. 45, 478–486 (2012)
Wang, L., Abdel-Aty, M., Lee, J., Shi, Q.: Analysis of real-time crash risk for expressway ramps using traffic, geometric, trip generation, and socio-demographic predictors. Accid. Anal. Prev. 122, 378–384 (2019)
Chen, C., Zhang, G., Qian, Z., Tarefder, R.A., Tian, Z.: Investigating driver injury severity patterns in rollover crashes using support vector machine models. Accid. Anal. Prev. 90, 128–139 (2016)
Chen, M.-M., Chen, M.-C.: Modeling road accident severity with comparisons of logistic regression, decision tree and random forest. Information 11(5), 270 (2020)
Sameen, M.I., Pradhan, B.: Severity prediction of traffic accidents with recurrent neural networks. Appl. Sci. 7(6), 476 (2017)
Alkheder, S., Taamneh, M., Taamneh, S.: Severity prediction of traffic accident using an artificial neural network. J. Forecast. 36(1), 100–108 (2017)
Theofilatos, A., Chen, C., Antoniou, C.: Comparing machine learning and deep learning methods for real-time crash prediction. Transp. Res. Rec. 2673(8), 169–178 (2019)
Jiang, F., Yuen, K.K.R., Lee, E.W.M.: A long short-term memory-based framework for crash detection on freeways with traffic data of different temporal resolutions. Accid. Anal. Prev. 141, 105520 (2020)
Bao, J., Liu, P., Ukkusuri, S.V.: A spatiotemporal deep learning approach for citywide short-term crash risk prediction with multi-source data. Accid. Anal. Prev. 122, 239–254 (2019)
Li, P., Abdel-Aty, M., Yuan, J.: Real-time crash risk prediction on arterials based on LSTM-CNN. Accid. Anal. Prev. 135, 105371 (2020)
Zhang, S., Yao, L., Sun, A., Tay, Y.: Deep learning based recommender system: a survey and new perspectives. ACM Comput. Surv. (CSUR) 52(1), 1–38 (2019)
Cheng, H.-T., Koc, L., Harmsen, J., Shaked, T., Chandra, T., Aradhye, H., Anderson, G., Corrado, G., Chai, W., Ispir, M.: Wide & Deep Learning for Recommender Systems. In Proceedings of the 1st workshop on deep learning for recommender systems, 7–10 (2016)
Chatterjee, S.: Learning and Memorization. In International Conference on Machine Learning. PMLR 755–763 (2018)
Arpit, D., Jastrzębski, S., Ballas, N., Krueger, D., Bengio, E., Kanwal, M.S., Maharaj, T., Fischer, A., Courville, A., Bengio, Y.: A Closer Look At Memorization in Deep Networks. In International conference on machine learning. PMLR. 233–242 (2017)
Ramanan, N., Kunapuli, G., Khot, T., Fatemi, B., Kazemi, S.M., Poole, D., Kersting, K., Natarajan, S.: Structure learning for relational logistic regression: an ensemble approach. Data Min. Knowl. Disc. 35(5), 2089–2111 (2021)
Spigler, S., Geiger, M., d’Ascoli, S., Sagun, L., Biroli, G., Wyart, M.: A jamming transition from under-to over-parametrization affects generalization in deep learning. J. Phys. A: Math. Theor. 52(47), 474001 (2019)
Vinayakumar, R., Soman, K., Poornachandran, P., Sachin Kumar, S.: Evaluating deep learning approaches to characterize and classify the DGAs at scale. J. Intell. Fuzzy Syst. 34(3), 1265–1276 (2018)
Bottou, L.: Large-scale machine learning with stochastic gradient descent. Proceedings of COMPSTAT’2010, pp. 177–186. Springer (2010)
Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theory. 13(1), 21–27 (1967)
Zhang, S., Li, X., Zong, M., Zhu, X., Wang, R.: Efficient kNN classification with different numbers of nearest neighbors. IEEE Trans. Neural Netw. Learn. Syst. 29(5), 1774–1785 (2017)
Lim, C., Yu, B.: Estimation stability with cross-validation (ESCV). J. Comput. Graph. Stat. 25(2), 464–492 (2016)
Draisma, J., Horobeţ, E., Ottaviani, G., Sturmfels, B., Thomas, R.R.: The euclidean distance degree of an algebraic variety. Found. Comput. Math. 16(1), 99–149 (2016)
Chen, T., Guestrin, C.: Xgboost: A Scalable Tree Boosting System. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 785–794 (2016)
Freund, Y., Schapire, R.E.: Experiments with a New Boosting Algorithm. In icml, 96, 148–156 (1996)
Wang, R.: AdaBoost for feature selection, classification and its relation with SVM, a review. Phys. Procedia 25, 800–807 (2012)
Friedman, J.H.: Stochastic gradient boosting. Comput. Stat. Data Anal. 38(4), 367–378 (2002)
Friedman, J.H., Meulman, J.J.: Multiple additive regression trees with application in epidemiology. Stat. Med. 22(9), 1365–1381 (2003)
Zheng, Z., Lu, P., Lantz, B.: Commercial truck crash injury severity analysis using gradient boosting data mining model. J. Saf. Res. 65, 115–124 (2018)
Mousa, S.R., Bakhit, P.R., Osman, O.A., Ishak, S.: A comparative analysis of tree-based ensemble methods for detecting imminent lane change maneuvers in connected vehicle environments. Transp. Res. Rec. 2672(42), 268–279 (2018)
Bekkar, M., Djemaa, H.K., Alitouche, T.A.: Evaluation measures for models assessment over imbalanced data sets. J. Inf. Eng. Appl. 3(10) (2013)
Tharwat, A.: Classification assessment methods. Appl Comput Inform. 17(1), 168–192 (2021)
Muschelli, J.: ROC and AUC with a binary predictor: a potentially misleading metric. J. Classif. 37(3), 696–708 (2020)
Koizumi, Y., Murata, S., Harada, N., Saito, S., Uematsu, H.: SNIPER: Few-shot learning for anomaly detection to minimize false-negative rate with ensured true-positive rate. pp. 915–919
Lundberg, S.M., Lee, S.-I.: A unified approach to interpreting model predictions. Advances in neural information processing systems 30 (2017)
Štrumbelj, E., Kononenko, I.: Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 41(3), 647–665 (2014)
Ribeiro, M.T., Singh, S., Guestrin, C.: Why should I trust you? Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 1135–1144 (2016)
Shapley, L.S., Kuhn, H., Tucker, A.: Contributions to the theory of games. Ann. Math. Stud. 28(2), 307–317 (1953)
Yang, F., Wang, X., Ma, H., Li, J.: Transformers-sklearn: a toolkit for medical language understanding with transformer-based models. BMC Med. Inf. Decis. Mak. 21(2), 1–8 (2021)
Nguyen, Q.H., Ly, H.-B., Ho, L.S., Al-Ansari, N., Le, H.V., Tran, V.Q., Prakash, I., Pham, B.T.: Influence of data splitting on performance of machine learning models in prediction of shear strength of soil. Mathematical Problems in Engineering, 1–15 (2021)
Bisong, E.: Introduction to Scikit-learn. In: Building Machine Learning and Deep Learning Models on Google Cloud Platform, pp. 215–229. Springer (2019)
Fischetti, M., Jo, J.: Deep neural networks and mixed integer linear optimization. Constraints 23(3), 296–309 (2018)
Jiang, X., Pang, Y., Li, X., Pan, J.: Speed up deep neural network based pedestrian detection by sharing features across multi-scale models. Neurocomputing 185, 163–170 (2016)
Xie, Z., He, F., Fu, S., Sato, I., Tao, D., Sugiyama, M.: Artificial neural variability for deep learning: on overfitting, noise memorization, and catastrophic forgetting. Neural Comput. 33(8), 2163–2192 (2021)
Schlichtkrull, M., Kipf, T.N., Bloem, P., Van Den Berg, R., Titov, I., Welling, M.: Modeling relational data with graph convolutional networks. In The Semantic Web: 15th International Conference, ESWC 2018, Heraklion, Crete, Greece, June 3–7, 2018, Proceedings 15, Springer International Publishing, 593–607 (2018)
Eldan, R., Shamir, O.: The power of depth for feedforward neural networks. In Conference on learning theory, PMLR, 907–940 (2016)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
Takase, T.: Dynamic batch size tuning based on stopping criterion for neural network training. Neurocomputing 429, 1–11 (2021)
Qiao, W., Moayedi, H., Foong, L.K.: Nature-inspired hybrid techniques of IWO, DA, ES, GA, and ICA, validated through a k-fold validation process predicting monthly natural gas consumption. Energy Build. 217, 110023 (2020)
Clark, L.A., Pregibon, D.: Tree-based models. In: Statistical Models in S, pp. 377–419. Routledge (2017)
Cheng, L., Chen, X., De Vos, J., Lai, X., Witlox, F.: Applying a random forest method approach to model travel mode choice behavior. Travel Behav. Soc. 14, 1–10 (2019)
Bentéjac, C., Csörgő, A., Martínez-Muñoz, G.: A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 54(3), 1937–1967 (2021)
Chen, C., Zhang, G., Yang, J., Milton, J.C.: An explanatory analysis of driver injury severity in rear-end crashes using a decision table/Naïve bayes (DTNB) hybrid classifier. Accid. Anal. Prev. 90, 95–107 (2016)
Chen, C., Zhang, G., Tarefder, R., Ma, J., Wei, H., Guan, H.: A multinomial logit model-bayesian network hybrid approach for driver injury severity analyses in rear-end crashes. Accid. Anal. Prev. 80, 76–88 (2015)
Suk, H.-I., Wee, C.-Y., Lee, S.-W., Shen, D.: State-space model with deep learning for functional dynamics estimation in resting-state fMRI. Neuroimage 129, 292–307 (2016)
Zhang, C., Bengio, S., Hardt, M., Recht, B., Vinyals, O.: Understanding deep learning (still) requires rethinking generalization. Commun. ACM 64(3), 107–115 (2021)
Ghiassi, M., Zimbra, D., Lee, S.: Targeted twitter sentiment analysis for brands using supervised feature engineering and the dynamic architecture for artificial neural networks. J. Manage. Inform. Syst. 33(4), 1034–1058 (2016)
Wang, J., Xie, W., Liu, B., Ragland, D.R.: Identification of freeway secondary accidents with traffic shock wave detected by loop detectors. Saf. Sci. 87, 195–201 (2016)
Newnam, S., Lewis, I., Warmerdam, A.: Modifying behaviour to reduce over-speeding in work-related drivers: an objective approach. Accid. Anal. Prev. 64, 23–29 (2014)
Schlögl, M.: A multivariate analysis of environmental effects on road accident occurrence using a balanced bagging approach. Accid. Anal. Prev. 136, 105398 (2020)
Tanishita, M., Van Wee, B.: Impact of vehicle speeds and changes in mean speeds on per vehicle-kilometer traffic accident rates in Japan. IATSS Res. 41(3), 107–112 (2017)
Parsa, A.B., Kamal, K., Taghipour, H., Mohammadian, A.K.: Does security of neighborhoods affect non-mandatory trips? a copula-based joint multinomial-ordinal model of mode and trip distance choices. No. 19–03155 (2019)
Zhang, G., Yau, K.K., Zhang, X., Li, Y.: Traffic accidents involving fatigue driving and their extent of casualties. Accid. Anal. Prev. 87, 34–42 (2016)
Khattak, Z.H., Magalotti, M.J., Fontaine, M.D.: Estimating safety effects of adaptive signal control technology using the empirical Bayes method. J. Saf. Res. 64, 121–128 (2018)
Lundberg, S.M., Nair, B., Vavilala, M.S., Horibe, M., Eisses, M.J., Adams, T., Liston, D.E., Low, D.K.-W., Newman, S.-F., Kim, J.: Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat. Biomed. Eng. 2(10), 749–760 (2018)
Lundberg, S.M., Erion, G., Chen, H., DeGrave, A., Prutkin, J.M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., Lee, S.-I.: From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell 2(1), 56–67 (2020)
Yishui, S., Wei, C., Hongjiang, Z.: Research of highway bottlenecks based on catastrophe theory. In 2015 International Conference on Transportation Information and Safety (ICTIS), IEEE, 138–142 (2015)
Zhu, F., Li, Z., Chen, S., Xiong, G.: Parallel transportation management and control system and its applications in building smart cities. IEEE Trans. Intell. Transp. Syst. 17(6), 1576–1585 (2016)
Acknowledgements
Portions of this research were funded through the projects of the National Science Foundation of China (41971340, 41471333), projects of the Fujian Provincial Department of Science and Technology (2021Y4019, 2020D002, 2020L3014, 2019I0019), and the Foundation of Fujian Provincial Universities Key Laboratory of Industrial Control and Data Analysis (Fujian University of Technology) (KF-X19013).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
None declared.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Niyogisubizo, J., Liao, L., Sun, Q. et al. Predicting Crash Injury Severity in Smart Cities: a Novel Computational Approach with Wide and Deep Learning Model. Int. J. ITS Res. 21, 240–258 (2023). https://doi.org/10.1007/s13177-023-00351-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13177-023-00351-7