Abstract
The advances in vehicle equipment technology enabled us easy and large-scale collection of high-volume vehicle driving data. This data is an important resource for urban area traffic management and vehicle driving support system applications. It has privacy aspects as well. In this study, we are interested in whether machine learning techniques are a real threat to driver re-identification from published CAN (Controller Area Network) bus driving data. To understand, on Uyanik dataset (Takeda et al. in IEEE Trans Intell Transp Syst 12:1609–1623, 2011), we develop machine learning models for driver gender and identity prediction, after a multi step data preprocessing methods of sampling, feature extraction, feature elimination and discretization. Best gender prediction classifiers reached up to 0.97 accuracy rate; and best driver identity prediction classifiers reached up to 0.1 accuracy rate for 105-class and 0.98 accuracy rate for 2-class driver identification tasks. Those high accuracy results, even on a single dataset, suggest that driving patters may indeed act as quasi-identifiers, and hence they should be treated as sensitive personal data. As a result, dissemination of driving data should be done according to non-trivial data privacy protection procedures.
Similar content being viewed by others
References
Abul O, Bayrak C (2018) From location to location pattern privacy in location-based services. Knowl Inf Syst 56(3):533–557
Abul O, Bonchi F, Nanni M (2008) Never walk alone: uncertainty for anonymity in moving objects databases. In: Proceedings of of 24th international conference on data engineering (ICDE)
Abut H, Erdogan H, Ercil A, Curuklu B, Koman HC, Tas F, Argunsah AO, Cosar S, Akan B, Karabalkan H, Cokelek E, Sezer V, Danis S, Karaca M, Abbak M, Uzunbas MG, Eritmen K, Kalayci C, Imamoglu M, Karabat C, Peyic M, Arslan B (2007) Data collection with “Uyanik” : too much pain; but gains are coming. In: Proceedings of the biennial on DSP for in-vehicle and mobile systems
Aljaafreh A, Alshabatat N, Najim Al-Din M (2012) Driving style recognition using fuzzy logic. In: 2012 IEEE international conference on vehicular electronics and safety, pp 460–463
Arbabzadeh N, Jafari M (2018) A data-driven approach for driving safety risk prediction using driver behavior and roadway information data. IEEE Trans Intell Transp Syst 19(2):446–460
Castignani G, Derrmann T, Frank R, Engel T (2015) Driver behavior profiling using smartphones: a low-cost platform for driver monitoring. IEEE Intell Transp Syst Mag 7(1):91–102
Cheung E, Bera A, Kubin E, Gray K, Manocha D (2018) Identifying driver behaviors using trajectory features for vehicle navigation. arXiv:180300881v2
Choi S, Kim J, Kwak D, Angkititrakul P, Hansen J (2007) Analysis and classification of driver behavior using in-vehicle can-bus information. Bienn Workshop DSP In-Veh Mob Syst
Choudhary AK, Ingole PK (2014) Smart phone based approach to monitor driving behavior and sharing of statistic. In: 2014 fourth international conference on communication systems and network technologies, pp 279–282
Christ M, Braun N, Neuffer J, Kempa-Liehr AW (2018) Time series feature extraction on basis of scalable hypothesis tests (tsfresh—a python package). Neurocomputing 307:72–77
Dorr D, Grabengiesser D, Gauterin F (2014) Online driving style recognition using fuzzy logic. In: 17th international IEEE conference on intelligent transportation systems (ITSC), pp 1021–1026
Eren H, Makinist S, Akin E, Yilmaz A (2012) Estimating driving behavior by a smartphone. In: IEEE intelligent vehicles symposium, pp 234–239
Fugiglando U, Massaro E, Santi P, Milardo S, Abida K, Stahlmann R, Netter F, Ratti C (2019) Driving behavior analysis through CAN bus data in an uncontrolled environment. IEEE Trans Intell Transp Syst 20(2):737–748
Gadepally V, Kurt A, Krishnamurthy A, Ozguner U (2011) Driver/vehicle state estimation and detection. In: IEEE conference on intelligent transportation systems, pp 582–587
Gadepally V, Krishnamurthy A, Ozguner U (2014) A framework for estimating driver decisions near intersections. IEEE Trans Intell Transp Syst 15(2):637–646
Gurung S, Lin D, Jiang W, Hurson A, Zhang R (2014) Traffic information publication with privacy preservation. ACM Trans Intell Syst Technol 5(3):44:1–44:26
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. SIGKDD Explor Newsl 11(1):10–18
Hoh B, Iwuchukwu T, Jacobson Q, Work D, Bayen AM, Herring R, Herrera J, Gruteser M, Annavaram M, Ban J (2012) Enhancing privacy and accuracy in probe vehicle-based traffic monitoring via virtual trip lines. IEEE Trans Mob Comput 11(5):849–864
Igarashi K, Miyajima C, Itou K, Takeda K, Itakura F, Abut H (2004) Biometric identification using driving behavioral signals. In: 2004 IEEE international conference on multimedia and expo, vol 1, pp 65–68
Kalsoom R, Halim Z (2013) Clustering the driving features based on data streams. In: IEEE international multi topic conference (INMIC), pp 89–94
Krumm J (2009) A survey of computational location privacy. Pers Ubiquit Comput 13:391–399
Ly MV, Martin S, Trivedi MM (2013) Driver classification and driving style recognition using inertial sensors. In: 2013 IEEE intelligent vehicles symposium (IV), pp 1040–1045
Machanavajjhala A, Gehrke J, Kifer D, Venkitasubramaniam M (2006) \(l\)-diversity: privacy beyond \(k\)-anonymity. In: Proceedings of the 22nd international conference on data engineering (ICDE’06)
Meiring GAM, Myburgh HC (2015) A review of intelligent driving style analysis systems and related artificial intelligence algorithms. Sensors 15(12):30653–30682
Mian M, Jaffry W (2019) Modeling of individual differences in driver behavior. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-019-01313-2
Miyajima C, Nishiwaki Y, Ozawa K, Wakita T, Itou K, Takeda K (2006) Cepstral analysis of driving behavioral signals for driver identification. In: 2006 IEEE international conference on acoustics speech and signal processing proceedings
Mohamad I, Ali MAM, Ismail M (2011) Abnormal driving detection using real time global positioning system data. In: Proceeding of the 2011 IEEE international conference on space science and communication (IconSpace)
Nunez-del Prado M, Nin J (2019) Revisiting online anonymization algorithms to ensure location privacy. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-019-01371-6
O’Leary DE (1991) Knowledge discovery as a threat to database security. In: Piatetsky-Shapiro G, Frawley WJ (eds) Knowledge discovery in databases. AAAI/MIT Press, Cambridge, pp 507–516
Ozcakmak K (2011) Analysis of experimental data collected by drivesafe vehicle uyanik. Master’s thesis, Istanbul Technical University Mechatronics Engineering Department, Istanbul
Ozturk E (2010) Driver status identification from driving behavior signals. Master’s thesis, Koc University Electronic and Computer Engineering Department, Istanbul
Saleh K, Hossny M, Nahavandi S (2017) Driving behavior classification based on sensor data fusion using LSTM recurrent neural networks. In: IEEE 20th international conference on intelligent transportation systems (ITSC)
Sweeney L (2002) k-anonymity: a model for protecting privacy. Int J Uncertain Fuzziness Knowl Based Syst 10(05):557–570
Takeda K, Hansen HLJ, Boyraz P, Malta L, Miyajima C, Abut H (2011) International large-scale vehicle corpora for research on driver behavior on the road. IEEE Trans Intell Transp Syst 12:1609–1623
Tango F, Botta M (2013) Real-time detection system of driver distraction using machine learning. IEEE Trans Intell Transp Syst 14(2):894–905
Wahlstrom J, Skog I, Handel P (2017) Smartphone-based vehicle telematics: a ten-year anniversary. IEEE Trans Intell Transp Syst 18:2802–2825
Wan C, Zhang J (2018) Efficient identity-based data transmission for vanet. J Ambient Intell Humaniz Comput 9(6):1861–1871
Wijayagunawardhane NRB, Jinasena SD, Sandaruwan CB, Dharmapriya WANS, Samarasinghe R (2013) SmartV: intelligent vigilance monitoring based on sensor fusion and driving dynamics. In: 2013 IEEE 8th international conference on industrial and information systems, pp 507–512
Zardosht M, Beauchemin S, Bauer MA (2018) Identifying driver behavior in preturning maneuvers using in-vehicle CANbus signals. J Adv Transp 2018:1–10
Zhang K, Ni J, Yang K, Liang X, Ren J, Shen XS (2017) Security and privacy in smart city applications: challenges and solutions. IEEE Commun Mag 55(1):122–129
Zheng Y, Shi X, Sathyanarayana A, Shokouhi N, Hansen JHL (2015) In-vehicle speech recognition and tutorial keywords spotting for novice drivers’ performance evaluation. In: 2015 IEEE intelligent vehicles symposium (IV), pp 168–173
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Abul, O., Karatas, B. Can driving patterns predict identity and gender?. J Ambient Intell Human Comput 12, 151–166 (2021). https://doi.org/10.1007/s12652-019-01457-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-019-01457-1