Skip to main content
Log in

A low-sensitivity quantitative measure for traffic safety data analytics

  • Applications
  • Published:
International Journal of Data Science and Analytics Aims and scope Submit manuscript

Abstract

Crash frequency is probably the most commonly used absolute measure in transportation research to quantify traffic safety. However, it suffers from high variability that is caused by the randomness of crash incidents and the density distribution of the crash incidents observed by clustering algorithms over a varying number of clusters in a large spatial domain—we can call this variability a traffic safety sensitivity. This paper presents a quantitative measure—called fatal severity ratio (FSR)—that reduces the traffic safety sensitivity problem significantly. A new fatal-point concept is first introduced and used for normalizing the crash frequency in the proposed FSR measure of traffic safety. An extensive empirical study is conducted to validate and evaluate the fatal-point concept and the FSR measure using several clustering techniques. The 2015 North Carolina fatal crash data set of Fatality Analysis Reporting System is also adopted in this study. The experimental analysis shows that the traffic safety sensitivity can be significantly reduced and the FSR measure can quantify traffic safety better than the crash frequency measure by managing cluster variabilities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Abdel-Aty, M., Pande, A.: Crash data analysis: collective vs. individual crash level approach. J. Saf. Res. 38(5), 581–587 (2007)

    Article  Google Scholar 

  2. Alkheder, S., Taamneh, M., Taamneh, S.: Severity prediction of traffic accident using an artificial neural network. J. Forecast. 36(1), 100–108 (2017)

    Article  MathSciNet  Google Scholar 

  3. Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: Optics: ordering points to identify the clustering structure. In: ACM Sigmod Record, vol. 28, pp. 49–60. ACM (1999)

  4. Bezdek, J.C.: Objective function clustering. In: Pattern Recognition with Fuzzy Objective Function Algorithms, pp. 43–93. Springer (1981)

  5. Briand, A.S., Côme, E., Mohamed, K., Oukhellou, L.: A mixture model clustering approach for temporal passenger pattern characterization in public transport. Int. J. Data Sci. Anal. 1(1), 37–50 (2016)

    Article  Google Scholar 

  6. Chiou, Y.C., Fu, C.: Modeling crash frequency and severity using multinomial-generalized Poisson model with error components. Accid. Anal. Prev. 50, 73–82 (2013)

    Article  Google Scholar 

  7. Chiou, Y.C., Fu, C.: Modeling crash frequency and severity with spatiotemporal dependence. Anal. Methods Accid. Res. 5, 43–58 (2015)

    Article  Google Scholar 

  8. Claros, B., Sun, C., Edara, P.: Safety effectiveness and crash cost benefit of red light cameras in Missouri. Traffic Inj. Prev. 18(1), 70–76 (2017)

    Article  Google Scholar 

  9. Divaris, K., Vann, W.F., Baker, A.D., Lee, J.Y.: Examining the accuracy of caregivers’ assessments of young children’s oral health status. J. Am. Dent. Assoc. 143(11), 1237–1247 (2012)

    Article  Google Scholar 

  10. Dobrkovic, A., Iacob, M.E., van Hillegersberg, J.: Maritime pattern extraction and route reconstruction from incomplete AIS data. Int. J. Data Sci. Anal. 5(2–3), 111–136 (2018)

    Article  Google Scholar 

  11. DOT: Federal Highway Administration. https://safety.fhwa.dot.gov/local_rural/training/fhwasa14074/sec3.cfm (2013). Accessed 23 Aug 2018

  12. Endo, Y., Toda, H., Nishida, K., Ikedo, J.: Classifying spatial trajectories using representation learning. Int. J. Data Sci. Anal. 2(3–4), 107–117 (2016)

    Article  Google Scholar 

  13. Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD-96, vol. 96, pp. 226–231 (1996)

  14. Gitelman, V., Doveh, E., Hakkert, S.: Designing a composite indicator for road safety. Saf. Sci. 48(9), 1212–1224 (2010)

    Article  Google Scholar 

  15. Guo, F., Fang, Y.: Individual driver risk assessment using naturalistic driving data. Accid. Anal. Prev. 61, 3–9 (2013)

    Article  Google Scholar 

  16. Ivan, K., Haidu, I., Benedek, J., Ciobanu, S.: Identification of traffic accident risk-prone areas under low-light conditions. Nat. Hazards Earth Syst. Sci. 15(9), 2059–2068 (2015)

    Article  Google Scholar 

  17. Liu, Y., Li, Z., Liu, J., Patel, H.: Vehicular crash data used to rank intersections by injury crash frequency and severity. Data Brief 8, 930–933 (2016)

    Article  Google Scholar 

  18. Lovegrove, G.R., Sayed, T.: Macro-level collision prediction models for evaluating neighbourhood traffic safety. Can. J. Civ. Eng. 33(5), 609–621 (2006)

    Article  Google Scholar 

  19. Ma, X., Wu, Y.J., Wang, Y., Chen, F., Liu, J.: Mining smart card data for transit riders travel patterns. Transp. Res. Part C Emerg. Technol. 36, 1–12 (2013)

    Article  Google Scholar 

  20. Montella, A.: A comparative analysis of hotspot identification methods. Accid. Anal. Prev. 42(2), 571–581 (2010)

    Article  Google Scholar 

  21. NCDOT: North Carolina Department of Transportation. https://www.ncdot.gov/travel-maps/maps/Pages/state-transportation-map.aspx (2018). Accessed 23 Aug 2018

  22. NHTSA: National Highway Traffic Safety Administration, Fatality Analysis Reporting System. https://www.nhtsa.gov/research-data/fatality-analysis-reporting-system-fars (2016). Accessed 07 Aug 2017

  23. Park, S., Musey, K., Press, J., McFadden, J.: Exploring roundabouts safety and operation in the context of design consistency. Inst. Transp. Eng. ITE J. 85(6), 43 (2015)

    Google Scholar 

  24. Pour-Rouholamin, M., Jalayer, M.: Analyzing the severity of motorcycle crashes in North Carolina using highway safety information systems data. Inst. Transp. Eng. ITE J. 86(10), 45 (2016)

    Google Scholar 

  25. R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna (2017). https://www.R-project.org/

  26. Ramos, L., Silva, L., Santos, M.Y., Pires, J.M.: Detection of road accident accumulation zones with a visual analytics approach. Procedia Comput. Sci. 64, 969–976 (2015)

    Article  Google Scholar 

  27. Schultz, G.G., Dudley, S.C., Saito, M.: Transportation safety data and analysis. Volume 3: Framework for highway safety mitigation and workforce development. Technical report (2011)

  28. StackExchange: Geographic Information Systems, Measuring Accuracy of Latitude and Longitude? https://gis.stackexchange.com/questions/8650/measuring-accuracy-of-latitude-and-longitude(2017). Accessed 07 Aug 2017

  29. Steinbach, M., Ertöz, L., Kumar, V.: The challenges of clustering high dimensional data. In: New Directions in Statistical Physics, pp. 273–309. Springer (2004)

  30. Suthaharan, S.: Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning, vol. 36. Springer, Berlin (2015)

    MATH  Google Scholar 

  31. Suthaharan, S.: A correlation-based subspace analysis for data confidentiality and classification as utility in CPS. In: 2016 IEEE Conference on Communications and Network Security (CNS), pp. 426–431. IEEE (2016)

  32. Yannis, G., Papadimitriou, E., Antoniou, C.: Multilevel modelling for the regional effect of enforcement on road accidents. Accid. Anal. Prev. 39(4), 818–825 (2007)

    Article  Google Scholar 

  33. Zeeb, K., Buchner, A., Schrauf, M.: What determines the take-over time? An integrated model approach of driver take-over after automated driving. Accid. Anal. Prev. 78, 212–221 (2015)

    Article  Google Scholar 

  34. Zhang, Y., Xie, Y., Li, L.: Crash frequency analysis of different types of urban roadway segments using generalized additive model. J. Saf. Res. 43(2), 107–114 (2012)

    Article  Google Scholar 

Download references

Acknowledgements

The author sincerely thanks the anonymous referees, the associate editor, and the editor for their excellent comments that helped him improve this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shan Suthaharan.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Suthaharan, S. A low-sensitivity quantitative measure for traffic safety data analytics. Int J Data Sci Anal 9, 241–256 (2020). https://doi.org/10.1007/s41060-019-00179-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41060-019-00179-z

Navigation