Abstract
Crash frequency is probably the most commonly used absolute measure in transportation research to quantify traffic safety. However, it suffers from high variability that is caused by the randomness of crash incidents and the density distribution of the crash incidents observed by clustering algorithms over a varying number of clusters in a large spatial domain—we can call this variability a traffic safety sensitivity. This paper presents a quantitative measure—called fatal severity ratio (FSR)—that reduces the traffic safety sensitivity problem significantly. A new fatal-point concept is first introduced and used for normalizing the crash frequency in the proposed FSR measure of traffic safety. An extensive empirical study is conducted to validate and evaluate the fatal-point concept and the FSR measure using several clustering techniques. The 2015 North Carolina fatal crash data set of Fatality Analysis Reporting System is also adopted in this study. The experimental analysis shows that the traffic safety sensitivity can be significantly reduced and the FSR measure can quantify traffic safety better than the crash frequency measure by managing cluster variabilities.
Similar content being viewed by others
References
Abdel-Aty, M., Pande, A.: Crash data analysis: collective vs. individual crash level approach. J. Saf. Res. 38(5), 581–587 (2007)
Alkheder, S., Taamneh, M., Taamneh, S.: Severity prediction of traffic accident using an artificial neural network. J. Forecast. 36(1), 100–108 (2017)
Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: Optics: ordering points to identify the clustering structure. In: ACM Sigmod Record, vol. 28, pp. 49–60. ACM (1999)
Bezdek, J.C.: Objective function clustering. In: Pattern Recognition with Fuzzy Objective Function Algorithms, pp. 43–93. Springer (1981)
Briand, A.S., Côme, E., Mohamed, K., Oukhellou, L.: A mixture model clustering approach for temporal passenger pattern characterization in public transport. Int. J. Data Sci. Anal. 1(1), 37–50 (2016)
Chiou, Y.C., Fu, C.: Modeling crash frequency and severity using multinomial-generalized Poisson model with error components. Accid. Anal. Prev. 50, 73–82 (2013)
Chiou, Y.C., Fu, C.: Modeling crash frequency and severity with spatiotemporal dependence. Anal. Methods Accid. Res. 5, 43–58 (2015)
Claros, B., Sun, C., Edara, P.: Safety effectiveness and crash cost benefit of red light cameras in Missouri. Traffic Inj. Prev. 18(1), 70–76 (2017)
Divaris, K., Vann, W.F., Baker, A.D., Lee, J.Y.: Examining the accuracy of caregivers’ assessments of young children’s oral health status. J. Am. Dent. Assoc. 143(11), 1237–1247 (2012)
Dobrkovic, A., Iacob, M.E., van Hillegersberg, J.: Maritime pattern extraction and route reconstruction from incomplete AIS data. Int. J. Data Sci. Anal. 5(2–3), 111–136 (2018)
DOT: Federal Highway Administration. https://safety.fhwa.dot.gov/local_rural/training/fhwasa14074/sec3.cfm (2013). Accessed 23 Aug 2018
Endo, Y., Toda, H., Nishida, K., Ikedo, J.: Classifying spatial trajectories using representation learning. Int. J. Data Sci. Anal. 2(3–4), 107–117 (2016)
Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD-96, vol. 96, pp. 226–231 (1996)
Gitelman, V., Doveh, E., Hakkert, S.: Designing a composite indicator for road safety. Saf. Sci. 48(9), 1212–1224 (2010)
Guo, F., Fang, Y.: Individual driver risk assessment using naturalistic driving data. Accid. Anal. Prev. 61, 3–9 (2013)
Ivan, K., Haidu, I., Benedek, J., Ciobanu, S.: Identification of traffic accident risk-prone areas under low-light conditions. Nat. Hazards Earth Syst. Sci. 15(9), 2059–2068 (2015)
Liu, Y., Li, Z., Liu, J., Patel, H.: Vehicular crash data used to rank intersections by injury crash frequency and severity. Data Brief 8, 930–933 (2016)
Lovegrove, G.R., Sayed, T.: Macro-level collision prediction models for evaluating neighbourhood traffic safety. Can. J. Civ. Eng. 33(5), 609–621 (2006)
Ma, X., Wu, Y.J., Wang, Y., Chen, F., Liu, J.: Mining smart card data for transit riders travel patterns. Transp. Res. Part C Emerg. Technol. 36, 1–12 (2013)
Montella, A.: A comparative analysis of hotspot identification methods. Accid. Anal. Prev. 42(2), 571–581 (2010)
NCDOT: North Carolina Department of Transportation. https://www.ncdot.gov/travel-maps/maps/Pages/state-transportation-map.aspx (2018). Accessed 23 Aug 2018
NHTSA: National Highway Traffic Safety Administration, Fatality Analysis Reporting System. https://www.nhtsa.gov/research-data/fatality-analysis-reporting-system-fars (2016). Accessed 07 Aug 2017
Park, S., Musey, K., Press, J., McFadden, J.: Exploring roundabouts safety and operation in the context of design consistency. Inst. Transp. Eng. ITE J. 85(6), 43 (2015)
Pour-Rouholamin, M., Jalayer, M.: Analyzing the severity of motorcycle crashes in North Carolina using highway safety information systems data. Inst. Transp. Eng. ITE J. 86(10), 45 (2016)
R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna (2017). https://www.R-project.org/
Ramos, L., Silva, L., Santos, M.Y., Pires, J.M.: Detection of road accident accumulation zones with a visual analytics approach. Procedia Comput. Sci. 64, 969–976 (2015)
Schultz, G.G., Dudley, S.C., Saito, M.: Transportation safety data and analysis. Volume 3: Framework for highway safety mitigation and workforce development. Technical report (2011)
StackExchange: Geographic Information Systems, Measuring Accuracy of Latitude and Longitude? https://gis.stackexchange.com/questions/8650/measuring-accuracy-of-latitude-and-longitude(2017). Accessed 07 Aug 2017
Steinbach, M., Ertöz, L., Kumar, V.: The challenges of clustering high dimensional data. In: New Directions in Statistical Physics, pp. 273–309. Springer (2004)
Suthaharan, S.: Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning, vol. 36. Springer, Berlin (2015)
Suthaharan, S.: A correlation-based subspace analysis for data confidentiality and classification as utility in CPS. In: 2016 IEEE Conference on Communications and Network Security (CNS), pp. 426–431. IEEE (2016)
Yannis, G., Papadimitriou, E., Antoniou, C.: Multilevel modelling for the regional effect of enforcement on road accidents. Accid. Anal. Prev. 39(4), 818–825 (2007)
Zeeb, K., Buchner, A., Schrauf, M.: What determines the take-over time? An integrated model approach of driver take-over after automated driving. Accid. Anal. Prev. 78, 212–221 (2015)
Zhang, Y., Xie, Y., Li, L.: Crash frequency analysis of different types of urban roadway segments using generalized additive model. J. Saf. Res. 43(2), 107–114 (2012)
Acknowledgements
The author sincerely thanks the anonymous referees, the associate editor, and the editor for their excellent comments that helped him improve this paper.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Suthaharan, S. A low-sensitivity quantitative measure for traffic safety data analytics. Int J Data Sci Anal 9, 241–256 (2020). https://doi.org/10.1007/s41060-019-00179-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41060-019-00179-z