Skip to main content
Log in

Can driving patterns predict identity and gender?

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

The advances in vehicle equipment technology enabled us easy and large-scale collection of high-volume vehicle driving data. This data is an important resource for urban area traffic management and vehicle driving support system applications. It has privacy aspects as well. In this study, we are interested in whether machine learning techniques are a real threat to driver re-identification from published CAN (Controller Area Network) bus driving data. To understand, on Uyanik dataset (Takeda et al. in IEEE Trans Intell Transp Syst 12:1609–1623, 2011), we develop machine learning models for driver gender and identity prediction, after a multi step data preprocessing methods of sampling, feature extraction, feature elimination and discretization. Best gender prediction classifiers reached up to 0.97 accuracy rate; and best driver identity prediction classifiers reached up to 0.1 accuracy rate for 105-class and 0.98 accuracy rate for 2-class driver identification tasks. Those high accuracy results, even on a single dataset, suggest that driving patters may indeed act as quasi-identifiers, and hence they should be treated as sensitive personal data. As a result, dissemination of driving data should be done according to non-trivial data privacy protection procedures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Abul O, Bayrak C (2018) From location to location pattern privacy in location-based services. Knowl Inf Syst 56(3):533–557

    Article  Google Scholar 

  • Abul O, Bonchi F, Nanni M (2008) Never walk alone: uncertainty for anonymity in moving objects databases. In: Proceedings of of 24th international conference on data engineering (ICDE)

  • Abut H, Erdogan H, Ercil A, Curuklu B, Koman HC, Tas F, Argunsah AO, Cosar S, Akan B, Karabalkan H, Cokelek E, Sezer V, Danis S, Karaca M, Abbak M, Uzunbas MG, Eritmen K, Kalayci C, Imamoglu M, Karabat C, Peyic M, Arslan B (2007) Data collection with “Uyanik” : too much pain; but gains are coming. In: Proceedings of the biennial on DSP for in-vehicle and mobile systems

  • Aljaafreh A, Alshabatat N, Najim Al-Din M (2012) Driving style recognition using fuzzy logic. In: 2012 IEEE international conference on vehicular electronics and safety, pp 460–463

  • Arbabzadeh N, Jafari M (2018) A data-driven approach for driving safety risk prediction using driver behavior and roadway information data. IEEE Trans Intell Transp Syst 19(2):446–460

    Article  Google Scholar 

  • Castignani G, Derrmann T, Frank R, Engel T (2015) Driver behavior profiling using smartphones: a low-cost platform for driver monitoring. IEEE Intell Transp Syst Mag 7(1):91–102

    Article  Google Scholar 

  • Cheung E, Bera A, Kubin E, Gray K, Manocha D (2018) Identifying driver behaviors using trajectory features for vehicle navigation. arXiv:180300881v2

  • Choi S, Kim J, Kwak D, Angkititrakul P, Hansen J (2007) Analysis and classification of driver behavior using in-vehicle can-bus information. Bienn Workshop DSP In-Veh Mob Syst

  • Choudhary AK, Ingole PK (2014) Smart phone based approach to monitor driving behavior and sharing of statistic. In: 2014 fourth international conference on communication systems and network technologies, pp 279–282

  • Christ M, Braun N, Neuffer J, Kempa-Liehr AW (2018) Time series feature extraction on basis of scalable hypothesis tests (tsfresh—a python package). Neurocomputing 307:72–77

    Article  Google Scholar 

  • Dorr D, Grabengiesser D, Gauterin F (2014) Online driving style recognition using fuzzy logic. In: 17th international IEEE conference on intelligent transportation systems (ITSC), pp 1021–1026

  • Eren H, Makinist S, Akin E, Yilmaz A (2012) Estimating driving behavior by a smartphone. In: IEEE intelligent vehicles symposium, pp 234–239

  • Fugiglando U, Massaro E, Santi P, Milardo S, Abida K, Stahlmann R, Netter F, Ratti C (2019) Driving behavior analysis through CAN bus data in an uncontrolled environment. IEEE Trans Intell Transp Syst 20(2):737–748

    Article  Google Scholar 

  • Gadepally V, Kurt A, Krishnamurthy A, Ozguner U (2011) Driver/vehicle state estimation and detection. In: IEEE conference on intelligent transportation systems, pp 582–587

  • Gadepally V, Krishnamurthy A, Ozguner U (2014) A framework for estimating driver decisions near intersections. IEEE Trans Intell Transp Syst 15(2):637–646

    Article  Google Scholar 

  • Gurung S, Lin D, Jiang W, Hurson A, Zhang R (2014) Traffic information publication with privacy preservation. ACM Trans Intell Syst Technol 5(3):44:1–44:26

    Article  Google Scholar 

  • Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. SIGKDD Explor Newsl 11(1):10–18

    Article  Google Scholar 

  • Hoh B, Iwuchukwu T, Jacobson Q, Work D, Bayen AM, Herring R, Herrera J, Gruteser M, Annavaram M, Ban J (2012) Enhancing privacy and accuracy in probe vehicle-based traffic monitoring via virtual trip lines. IEEE Trans Mob Comput 11(5):849–864

    Article  Google Scholar 

  • Igarashi K, Miyajima C, Itou K, Takeda K, Itakura F, Abut H (2004) Biometric identification using driving behavioral signals. In: 2004 IEEE international conference on multimedia and expo, vol 1, pp 65–68

  • Kalsoom R, Halim Z (2013) Clustering the driving features based on data streams. In: IEEE international multi topic conference (INMIC), pp 89–94

  • Krumm J (2009) A survey of computational location privacy. Pers Ubiquit Comput 13:391–399

    Article  Google Scholar 

  • Ly MV, Martin S, Trivedi MM (2013) Driver classification and driving style recognition using inertial sensors. In: 2013 IEEE intelligent vehicles symposium (IV), pp 1040–1045

  • Machanavajjhala A, Gehrke J, Kifer D, Venkitasubramaniam M (2006) \(l\)-diversity: privacy beyond \(k\)-anonymity. In: Proceedings of the 22nd international conference on data engineering (ICDE’06)

  • Meiring GAM, Myburgh HC (2015) A review of intelligent driving style analysis systems and related artificial intelligence algorithms. Sensors 15(12):30653–30682

    Article  Google Scholar 

  • Mian M, Jaffry W (2019) Modeling of individual differences in driver behavior. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-019-01313-2

    Article  Google Scholar 

  • Miyajima C, Nishiwaki Y, Ozawa K, Wakita T, Itou K, Takeda K (2006) Cepstral analysis of driving behavioral signals for driver identification. In: 2006 IEEE international conference on acoustics speech and signal processing proceedings

  • Mohamad I, Ali MAM, Ismail M (2011) Abnormal driving detection using real time global positioning system data. In: Proceeding of the 2011 IEEE international conference on space science and communication (IconSpace)

  • Nunez-del Prado M, Nin J (2019) Revisiting online anonymization algorithms to ensure location privacy. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-019-01371-6

    Article  Google Scholar 

  • O’Leary DE (1991) Knowledge discovery as a threat to database security. In: Piatetsky-Shapiro G, Frawley WJ (eds) Knowledge discovery in databases. AAAI/MIT Press, Cambridge, pp 507–516

    Google Scholar 

  • Ozcakmak K (2011) Analysis of experimental data collected by drivesafe vehicle uyanik. Master’s thesis, Istanbul Technical University Mechatronics Engineering Department, Istanbul

  • Ozturk E (2010) Driver status identification from driving behavior signals. Master’s thesis, Koc University Electronic and Computer Engineering Department, Istanbul

  • Saleh K, Hossny M, Nahavandi S (2017) Driving behavior classification based on sensor data fusion using LSTM recurrent neural networks. In: IEEE 20th international conference on intelligent transportation systems (ITSC)

  • Sweeney L (2002) k-anonymity: a model for protecting privacy. Int J Uncertain Fuzziness Knowl Based Syst 10(05):557–570

    Article  MathSciNet  Google Scholar 

  • Takeda K, Hansen HLJ, Boyraz P, Malta L, Miyajima C, Abut H (2011) International large-scale vehicle corpora for research on driver behavior on the road. IEEE Trans Intell Transp Syst 12:1609–1623

    Article  Google Scholar 

  • Tango F, Botta M (2013) Real-time detection system of driver distraction using machine learning. IEEE Trans Intell Transp Syst 14(2):894–905

    Article  Google Scholar 

  • Wahlstrom J, Skog I, Handel P (2017) Smartphone-based vehicle telematics: a ten-year anniversary. IEEE Trans Intell Transp Syst 18:2802–2825

    Article  Google Scholar 

  • Wan C, Zhang J (2018) Efficient identity-based data transmission for vanet. J Ambient Intell Humaniz Comput 9(6):1861–1871

    Article  Google Scholar 

  • Wijayagunawardhane NRB, Jinasena SD, Sandaruwan CB, Dharmapriya WANS, Samarasinghe R (2013) SmartV: intelligent vigilance monitoring based on sensor fusion and driving dynamics. In: 2013 IEEE 8th international conference on industrial and information systems, pp 507–512

  • Zardosht M, Beauchemin S, Bauer MA (2018) Identifying driver behavior in preturning maneuvers using in-vehicle CANbus signals. J Adv Transp 2018:1–10

    Article  Google Scholar 

  • Zhang K, Ni J, Yang K, Liang X, Ren J, Shen XS (2017) Security and privacy in smart city applications: challenges and solutions. IEEE Commun Mag 55(1):122–129

    Article  Google Scholar 

  • Zheng Y, Shi X, Sathyanarayana A, Shokouhi N, Hansen JHL (2015) In-vehicle speech recognition and tutorial keywords spotting for novice drivers’ performance evaluation. In: 2015 IEEE intelligent vehicles symposium (IV), pp 168–173

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Osman Abul.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Abul, O., Karatas, B. Can driving patterns predict identity and gender?. J Ambient Intell Human Comput 12, 151–166 (2021). https://doi.org/10.1007/s12652-019-01457-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-019-01457-1

Keywords

Navigation