Abstract
With the advances of machine learning algorithms and the pervasiveness of network terminals, online medical primary diagnosis scheme, which can provide the primary diagnosis service anywhere anytime, has attracted considerable interest recently. However, the flourish of online medical primary diagnosis scheme still faces many challenges including information security and privacy preservation. In this paper, we propose an efficient and privacy-preserving medical primary diagnosis scheme, called PDiag, on naive Bayes classification. With PDiag, the sensitive personal health information can be processed without privacy disclosure during online medical primary diagnosis service. Specifically, based on an improved expression for the naive Bayes classifier, an efficient and privacy-preserving classification scheme is introduced with lightweight polynomial aggregation technique. The encrypted user query is directly operated at the service provider without decryption, and the diagnosis result can only be decrypted by user. Through extensive analysis, we show that PDiag ensures users’ health information and service provider’s prediction model are kept confidential, and has significantly less computation and communication overhead than existing schemes. In addition, performance evaluations via implementing PDiag on smartphone and computer demonstrate PDiag’s effectiveness in term of real environment.
Similar content being viewed by others
References
Mattio R (2014) Shortest average wait time for doctors in major cities increased one minute year over year. http://www.businesswire.com/news/home/20140326005955/en/Shortest-Average-Wait-Time-Doctors-Major-Cities
news B (2016) Waiting lists: Increase in number for ni outpatient appointments. [Online]. Available: http://www.bbc.com/news/uk-northern-ireland-35661496
Messenger S (2016) Breast cancer patient waits in wales shocking. [Online]. Available: http://www.bbc.com/news/uk-wales-35778888
Chenguang H, Xiaomao F, Ye L (2013) Toward ubiquitous healthcare services with a novel efficient cloud platform. IEEE transactions on bio-medical engineering 60(1):230–234
Anderson MP, Dubnicka SR (2014) A sequential naïve bayes classifier for dna barcodes. Stat Appl Genet Mol Biol 13(4):423–434
Bellazzi R, Zupan B (2008) Predictive data mining in clinical medicine: Current issues and guidelines. Int J Med Inform 77(2):81–97
Blanco R, Inza I, Merino M, Quiroga J, Larrañaga P (2005) Feature selection in bayesian classifiers for the prognosis of survival of cirrhotic patients treated with tips. J Biomed Inform 38(5):376–388
Ko EJ, Lee HJ, Lee JW (2007) Ontology-based context modeling and reasoning for u-healthcare. IEICE Trans Inf Syst 90(8):1262–1270
Lu R, Lin X, Shen X (2013) Spoc: A secure and privacy-preserving opportunistic computing framework for mobile-healthcare emergency. IEEE Trans Parallel Distrib Syst 24(3):614–624
Zhu H, Lu R, Huang C, Chen L, Li H (2015) An efficient privacy-preserving location based services query scheme in outsourced cloud. IEEE Trans Veh Technol PP(99):1–1. [Online]. Available: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7327242
Lu R, Zhu H, Liu X, Liu J, Shao J (2014) Toward efficient and privacy-preserving computing in big data era. IEEE Netw 28(4):46–50
Bos JW, Lauter K, Naehrig M (2014) Private predictive analysis on encrypted medical data. J Biomed Inform 50:234–243
Liu X, Lu R, Ma J, Chen L, Qin B (2015) Privacy-preserving patient-centric clinical decision support system on naive bayesian classification. IEEE Journal of Biomedical and Health Informatics 99:1–1
Rahulamathavan Y, Veluru S, Phan R-W, Chambers J, Rajarajan M (2014) Privacy-preserving clinical decision support system using gaussian kernel-based classification. IEEE Journal of Biomedical and Health Informatics 18(1):56–66
Boneh D, Franklin M K (2001) Identity-based encryption from the weil pairing. In: Proceedings of the 21st Annual International Cryptology Conference on Advances in Cryptology, ser CRYPTO ’01. Springer-Verlag, London, UK, pp 213–229
Leung K M (2007) Naive bayesian classifier, Polytechnic University Department of Computer Science/Finance and Risk Engineering
Ren J, Lee SD, Chen X, Kao B, Cheng R, Cheung D (2009) Naive bayes classification of uncertain data. In: Data Mining, 2009. ICDM’09 Ninth IEEE International Conference on. IEEE, pp 944–949
Rahulamathavan Y, Rajarajan M (2015) Efficient privacy-preserving facial expression classification. IEEE Trans Dependable Secure Comput 7516:1
Boneh D, Shacham H (2001) Short signatures from the weil pairing. In: Advances in Cryptology 2001. Springer, pp 514– 532
Wolberg DWH (1995) UCI machine learning repository. [Online]. Available: http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+
To GB, Brown G, To GB, Brown G (2004) Diversity in neural network ensembles. University of Birmingham
Zhou X, Wang S, Xu W, Ji G, Phillips P, Sun P, Zhang Y (2015) Detection of pathological brain in mri scanning based on wavelet-entropy and naive bayes classifier. In: Bioinformatics and biomedical engineering. Springer, pp 201–209
Güler I, Beyli EDÜ (2007) Multiclass support vector machines for eeg-signals classification. IEEE Trans Inf Technol Biomed 11(2):117–126
Ajemba P, Ramirez L, Durdle N, Hill D, Raso V (2005) A support vectors classifier approach to predicting the risk of progression of adolescent idiopathic scoliosis. IEEE Trans Inf Technol Biomed 9(2):276–282
Wang W, Chen S, Brune KA, Hruban RH, Parmigiani G, Klein AP (2007) Pancpro: risk assessment for individuals with a family history of pancreatic cancer. J Clin Oncol 25(11):1417–1422
Barakat MNH, Bradley AP (2010) Intelligible support vector machines for diagnosis of diabetes mellitus. IEEE Trans Inf Technol Biomed 14(4):1114–1120
Huang C-L, Liao H-C, Chen M-C (2008) Prediction model building and feature selection with support vector machines in breast cancer diagnosis. Expert Systems with Applications 34(1):578–587
Sundar NA, Latha PP, Chandra MR (2012) Performance analysis of classification data mining techniques over heart disease database. IJESAT] International Journal of engineering science & advanced technology ISSN:2250–3676
Pattekari SA, Parveen A (2012) Prediction system for heart disease using naïve bayes. International Journal of Advanced Computer and Mathematical Sciences 3(3):290–294
Medhekar DS, Bote MP, Deshmukh SD (2013) Heart disease prediction system using naive bayes. Int J Enhanced Res Sci Technol Eng 3:2
Mathew G, Obradovic Z (2011) A privacy-preserving framework for distributed clinical decision support. In: Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on IEEE, pp 129–134
Kantarcıoglu M, Vaidya J, Clifton C (2003) Privacy preserving naive bayes classifier for horizontally partitioned data. In: IEEE ICDM workshop on privacy preserving data mining, pp 3–9
Yang Z, Zhong S, Wright RN (2005) Privacy-preserving classification of customer data without loss of accuracy. In: SDM. SIAM, pp 92–102
Yi X, Zhang Y (2009) Privacy-preserving naive bayes classification on distributed data via semi-trusted mixers. Inf Syst 34(3):371–380
Sumana M, Hareesha KS (2014) Privacy preserving naive bayes classifier for horizontally partitioned data using secure division. International Journal of Network Security and Its Applications 6:6
Gangrade A, Patel R (2012) Privacy preserving naïve bayes classifier for horizontally distribution scenario using un-trusted third party. IOSR Journal of Computer Engineering (IOSRJCE) ISSN:2278–0661
Vaidya J, Clifton C (2004) Privacy preserving naïve bayes classifier for vertically partitioned data. In: SDM. SIAM, pp 522–526
Toshniwal D (2011) Privacy preserving naïve bayes classification using trusted third party computation over distributed progressive databases. Advances in Computer Science and Information Technology:24–32
Huai M, Huang L, Yang W, Li L, Qi M (2015) Privacy-Preserving Naive Bayes Classification. Springer International Publishing
Acknowledgments
This work was financially supported by the National Natural Science Foundation of China under Grant 61303218, Grant 6167241 and Grant U1401251, National Key Research and Development Program of China under Grant 2016YFB0800804, Natural Science Basic Research Plan in Shaanxi Province of China under Grant 2016JM6007, Research Foundations for the Central Universities of China under Grant JB161507, and China 111 Project under Grant B16037. We would like to thank the anonymous reviewers for their insightful comments and suggestions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, X., Zhu, H., Lu, R. et al. Efficient privacy-preserving online medical primary diagnosis scheme on naive bayesian classification. Peer-to-Peer Netw. Appl. 11, 334–347 (2018). https://doi.org/10.1007/s12083-016-0506-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12083-016-0506-8