Abstract
Association rule mining is an effective data mining technique which has been used widely in health informatics research right from its introduction. Since health informatics has received a lot of attention from researchers in last decade, and it has developed various sub-domains, so it is interesting as well as essential to review state of the art health informatics research. As knowledge discovery researchers and practitioners have applied an array of data mining techniques for knowledge extraction from health data, so the application of association rule mining techniques to health informatics domain has been focused and studied in detail in this survey. Through critical analysis of applications of association rule mining literature for health informatics from 2005 to 2014, it has been explored that, instead of the more efficient alternative approaches, the Apriori algorithm is still a widely used frequent itemset generation technique for application of association rule mining for health informatics. Moreover, other limitations related to applications of association rule mining for health informatics have also been identified and recommendations have been made to mitigate those limitations. Furthermore, the algorithms and tools utilized for application of association rule mining have also been identified, conclusions have been drawn from the literature surveyed, and future research directions have been presented.
Similar content being viewed by others
Notes
EuroMISE Project STULONG, http://euromise.vse.cz/stulong-en/index.php. Accessed \(5{\mathrm{th}}\) December 2014.
Cleveland Heart Disease Dataset, https://archive.ics.uci.edu/ml/datasets/Heart+Disease. Accessed \(5{\mathrm{th}}\) December 2014.
TREC 2011 Medical Track Dataset, http://trec.nist.gov/data/medical2011.html. Accessed 5th December 2014.
References
Anwar MA, Ahmed N (2014) Analyzing lifestyle and environmental factors on semen fertility using association rule mining. Inf Knowl Manag 4(2):15–21
Babashzadeh A, Daoud M, Huang J (2013) Using semantic-based association rule mining for improving clinical text retrieval. Health Information Science, Springer, Berlin, pp 186–197
Badrinath N, Gopinath G, Ravichandran KS, Soundhar RG (2016) Estimation of automatic detection of erythemato-squamous diseases through adaboost and its hybrid classifiers. Artificial Intelligence Review, Springer Science+Business Media Dordrecht 45:471–488. doi:10.1007/s10462-015-9436-8
Berka P, Rauch J (2010) Mining and post-processing of association rules in the atherosclerosis risk domain. Information Technology in Bio-and Medical Informatics, Springer, Berlin, pp 110–117
Bouker S, Saidi R, Ben Yahia S, Mephu Nguifo E (2014) Mining undominated association rules through interestingness measures. Int J Artif Intell Tools 23(04):1460011
Bouker S, Saidi R, Ben-Yahia S, Mephu-Nguifo E (2012) Ranking and selecting association rules based on dominance relationship. In: 2012 IEEE 24th international conference on tools with artificial intelligence, pp 658–665
Bouker S, Saidi R, Ben-Yahia S, Mephu-Nguifo E (2013) Towards a semantic and statistical selection of association rules. arXiv preprint arXiv:1305.5824
Buczak AL, Koshute PT, Babin SM, Feighner BH, Lewis SH (2012) A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data. BMC Med Inf Decis Making 12:124. doi:10.1186/1472-6947-12-124
Chen Y, Li F, Fan J (2015) Mining association rules in big data with NGEP. Cluster Comput 18(2):577–585. doi:10.1007/s10586-014-0419-3
Coira E (2003) Guide to health informatics. CRC Press, Boca Raton
Concaro S, Sacchi L, Cerra C, Fratino P, Bellazzi P (2011) Mining health care administrative data with temporal association rules on hybrid events. Methods Inf Med 50(2):166
Concaro S, Sacchi L, Cerra C, Bellazzi R (2009a) Mining administrative and clinical diabetes data with temporal association rules. MIE August 2009, pp 574–578
Concaro S, Sacchi L, Cerra C, Fratino P, Bellazzi R (2009b) Mining healthcare data with temporal association rules: Improvements and assessment for a practical use. Artificial Intelligence in Medicine. Springer, Berlin, pp 16–25
Concaro S, Sacchi L, Cerra C, Stefanelli M, Fratino P, Bellazzi R (2009c) Temporal data mining for the assessment of the costs related to diabetes mellitus pharmacological treatment. In: 2009 AMIA annual symposium, San Francisco, pp 119–123
Faghihi U, Fournier-Viger P, Nkambou R (2012) A computational model for causal learning in cognitive agents. Knowl Based Syst 30:48–56
Fournier-Viger P, Faghihi U, Nkambou R, Mephu Nguifo E (2012) CMRules: mining sequential rules common to several sequences. Knowl Based Syst 25(1):63–76
Fürnkranz J, Kliegr T (2015) A brief overview of rule learning. In: Rule technologies: foundations, tools, and applications. Springer International Publishing, New York, pp 54–69
Glott R, Husmann E, Sadeghi AR, Schunter M (2011) Trustworthy clouds underpinning the future internet. Springer, Berlin, pp 209–221. doi:10.1007/978-3-642-20898-0_15
Go E, Lee S, Yoon T (2014) Analysis of Ebolavirus with decision tree and Apriori algorithm. Int J Mach Learn Comput 4(6):543–546. doi:10.7763/IJMLC.2014.V4.470
Gosain A, Kumar A (2009) Analysis of health care data using different data mining techniques. In: International conference on intelligent agent & multi-agent systems, pp 1–6
Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Record 29(2):1–12
Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann Publishers, Los Altos
Ilayaraja M, Meyyappan T (2013) Mining medical data to identify frequent diseases using Apriori algorithm. In: 2013 International conference on pattern recognition, informatics and mobile engineering (PRIME), pp 194–199
Iqbal S, Altaf W, Aslam M, Mahmood W, Khan MUG (2016) Application of intelligent agents in health-care: review. Artificial Intelligence Review. Springer Science+Business Media, Dordrecht, pp 1–30. doi:10.1007/s10462-016-9457-y
Jabbar MA, Chandra P, Deekshatulu BL (2012) Knowledge discovery from mining association rules for heart disease prediction. J Theor Appl Inf Technol 41(2):45–53
Kamsu-Foguem B, Rigal F, Mauget F (2013) Mining association rules for the quality improvement of the production process. Expert Syst Appl 40(4):1034–1045
Kavipriya A, Gomathy B (2013) Data mining applications in medical image mining: an analysis of breast cancer using weighted rule mining and classifiers. IOSR J Comput Eng 8(4):18–23
Köksal G, Batmaz I, Testik MC (2011) A review of data mining applications for quality improvement in manufacturing industry. Expert Syst Appl 38(10):13448–13467
Kuo YT, Lonie A, Pearce AR, Sonenberg L (2014) Mining surprising patterns and their explanations in clinical data. Appl Artif Intel 28(2):111–138
Lee DG, Ryu KS, Bashir M, Bae JW, Ryu KH (2013) Discovering medical knowledge using association rule mining in young adults with acute myocardial infarction. J Med Syst 37(2):1–10
Li X (2014) An algorithm for mining frequent itemsets from library big data. J Softw 9(9):2361–2365. doi:10.4304/jsw.9.9.2361-2365
Lin YC, Wu CW, Tseng VS (2015) Mining high utility itemsets in big data. Adv Knowl Discov Data Mining 9078:649–661. doi:10.1007/978-3-319-18032-8_51
Li H, Wang Y, Zhang D, Zhang M, Chang EY (2008) Pfp: parallel fp-growth for query recommendation. In: 2008 ACM conference on recommender systems. ACM, New York, pp 107–114. doi:10.1145/1454008.1454027
Mahgoub H, Rösner D, Ismail N, Torkey F (2008) A text mining technique using association rules extraction. International J Comput Intel 4(1):21–28
Mahmood S, Shahbaz M, Rehman ZU (2013) Extraction of positive and negative association rules from text: a temporal approach. Pak J Sci 65(3):407–413
Mahmood S, Shahbaz M, Guergachi A (2014) Negative and positive association rules mining from text using frequent and infrequent itemsets. Sci World J. doi:10.1155/2014/973750
Maquee A, Shojaie AA, Mosaddar D (2012) Clustering and association rules in analyzing the efficiency of maintenance system of an urban bus network. Int J Syst Assur Eng Manag 3(3):175–183
McCormick T, Rudin C, Madigan D (2011) A hierarchical model for association rule mining of sequential events: an approach to automated medical symptom prediction. SSRN eLibrary. doi:10.2139/ssrn.1736062
Mirabadi A, Sharifian S (2010) Application of association rules in iranian railways (rai) accident data analysis. Safety Sci 48(10):1427–1435
Moens S, Aksehirli E, Goethals B (2013) Frequent itemset mining for big data. In: IEEE international conference on big data, pp 111–118. doi:10.1109/BigData.2013.6691742
Moradi M, Keyvanpour MR (2015) An analytical review of XML association rules mining. Artificial Intelligence Review, Springer Science+Business Media, Dordrecht 43(2):277–300. doi:10.1007/s10462-012-9376-5
Nkambou R, Fournier-Viger P, Mephu Nguifo E (2011) Learning task models in ill-defined domain using an hybrid knowledge discovery framework. Knowl Based Syst 24(1):176–185
Ogasawara M, Sugimori H, Iida Y, Yoshida K (2005) Analysis between lifestyle, family medical history and medical abnormalities using data mining method—association rule analysis. Knowledge-Based Intelligent Information and Engineering Systems. Springer, Berlin, pp 161–171
Ohsaki M, Abe H, Tsumoto S, Yokoi H, Yamaguchi T (2007) Evaluation of rule interestingness measures in medical knowledge discovery in databases. Artif Intell Med 41(3):177–196
Oliveira T, Novais P, Neves J (2014) Development and implementation of clinical guidelines: an artificial intelligence perspective. Artificial Intelligence Review, Springer Science+Business Media, Dordrecht 42(4):999–1027. doi:10.1007/s10462-012-9376-5
Ordonez C, Ezquerra N, Santana CA (2006) Constraining and summarizing association rules in medical data. Knowl Inf Syst 9(3):1–2
Pan H, Li J, Wei Z (2005) Mining interesting association rules in medical images. Advanced data mining and applications. Springer, Berlin
Park SH, Jang SY, Kim H, Lee SW (2014) An association rule mining-based framework for understanding lifestyle risk behaviors. PloS one 9(2):e88859
Payus C, Sulaiman N, Shahani M, Bakar AA (2013) Association rules of data mining application for respiratory illness by air pollution database. Int J Basic Appl Sci 13(3):11–16
Radu A, Costan A, Iancu B, Dadarlat V, Peculea A (2015) Intercloud platform for connecting and managing heterogeneous services with applications for e-health. In: 2015 Conference on grid, cloud & high performance computing in science (ROLCG), Cluj-Napoca. doi:10.1109/ROLCG.2015.7367229
Raheja V, Rajan KS (2012) Comparative study of association rule mining and MiSTIC in extracting spatio-temporal disease occurrences patterns. In: 2012 IEEE 12th international conference on data mining workshops (ICDMW), pp 813–820
Rajendran P, Madheswaran M (2010) An improved image mining technique for brain tumour classification using efficient classifier. arXiv preprint arXiv:1001.1988
Rashid MA, Hoque MT, Sattar A (2014) Association rules mining based clinical observations. arXiv preprint arXiv:1401.2571
Ribeiro MX, Bugatti PH, Traina C Jr, Marques P, Rosa NA, Traina AM (2009) Supporting content-based image retrieval and computer-aided diagnosis systems with association rule-based techniques. Data Knowl Eng 68(12):1370–1382
Ruiz PP, Kamsu-Foguem B, Grabot B (2014) Generating knowledge in maintenance from Experience Feedback. Knowl Based Syst. doi:10.1016/j.knosys.2014.02.002
Russell S, Norvig P (2009) Artificial intelligence: a modern approach, 3rd edn. Prentice Hall, Englewood Cliffs
Savel TG, Foldy S (2012) The role of public health informatics in enhancing public health surveillance. MMWR Surveill Summ 61:20–24
Sharma N, Om H (2014) Significant patterns for oral cancer detection: association rule on clinical examination and history data. Netw Model Anal Health Inf Bioinf 3(1):1–13
Soni J, Ansari U, Sharma D, Soni S (2011) Intelligent and effective heart disease prediction system using weighted associative classifiers. Int J Comput Sci Eng 3(6):2385–2392
Soni S, Vyas OP (2010) Using associative classifiers for predictive analysis in health care data mining. Int J Comput Appl 4(5):33–37
Srinivasan S, Ramakrishnan S (2011) Evolutionary multi objective optimization for rule mining: a review. Artificial Intelligence Review, Springer Science+Business Media B.V 36(3):205–248. doi:10.1007/s10462-011-9212-3
Srinivas K, Rao GR, Govardhan A (2012) Mining association rules from large datasets towards disease prediction. Int Proc Comput Sci Inf Technol 27:22–26
Thangam M, Vanniappan B (2015) Mining association rules in dengue gene sequence with latent periodicity. Comput Biol J. doi:10.1155/2015/839692
Vukićević M, Radovanović S, Milovanović M, Minović M (2014) Cloud based metalearning system for predictive modeling of biomedical data. Sci World J. doi:10.1155/2014/859279
Wang C, Guo XJ, Xu JF, Wu C, Sun YL, Ye XF, Qian W, Ma XQ, Du WM, He J (2012) Exploration of the association rules mining technique for the signal detection of adverse drug events in spontaneous reporting systems. PloS ONE 7(7):e40561
Xianhai J, Cunxi X (2009) Home health telemonitoring system based on data mining. Int Forum Inf Technol Appl 2:431–434
Yu L (2009) Association rules based data mining on test data of physical health standard. Int Joint Conf Comput Sci Optimiz 2:322–324
Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I (2010) Spark: cluster computing with working sets. In: HotCloud’10 2nd USENIX conference on Hot topics in cloud computing, p 10
Zitouni M, Akbarinia R, Yahia SB, Masseglia F (2015) A prime number based approach for closed frequent itemset mining in big data. In: 26th International conference on database and expert systems applications (DEXA’2015), vol 9261, pp 509–516. doi:10.1007/978-3-22849-5_35
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Altaf, W., Shahbaz, M. & Guergachi, A. Applications of association rule mining in health informatics: a survey. Artif Intell Rev 47, 313–340 (2017). https://doi.org/10.1007/s10462-016-9483-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-016-9483-9