Abstract
In this paper, we propose a rule-based classification method that uses artificial missing values to improve the effectiveness and precision of medical data analysis. We apply artificial missing values to avoid the sharp boundary problem encountered when discretizing continuous variables. In discretization, we treat attribute values near the boundary as missing values. We evaluated the performance of the proposed artificial missing value-based classification method and our experimental results using medical data show this method to be effective for classification. The proposed method can reduce the number of rules required to build a classifier. It may also be able to control the relation between a false positive and true positive in rule-based classifiers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th VLDB Conf., pp. 487–499 (1994)
Han, J., Pei, J., Yin, Y., Mao, R.: Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. Data Mining and Knowledge Discovery 8, 53–87 (2004)
Ivancevic, V., Tusek, I., Tusek, J., Knezevic, M., Elheshk, S., Lukovic, I.: Using association rule mining to identify risk factors for early childhood caries. Computer Methods and Programs in Biomedicine 122, 175–181 (2015)
Held, F., et al.: Polypharmacy in older adults: Association Rule and Frequent-Set Analysis to evaluate concomitant medication use. Pharmacological Research (2016). doi:10.1016/j.phrs.2016.12.018
Totia, G., Vilalta, R., Lindnerd, P., Leferb, B., Macias, C., Priceda, D.: Analysis of correlation between pediatric asthma exacerbation and exposure to pollutant mixtures with association rule mining. Artificial Intelligence in Medicine 74, 44–52 (2016)
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proc. of the ACM Int’l. Conf. on Knowledge Discovery and Data Mining, pp. 80–86 (1998)
Li, W., Han, J., Pei, J.: CMAR: Accurate and efficient classification based on multiple class-association rules. In: Proc. of the 2001 IEEE Int’l. Conf. on Data Mining, pp. 369–376 (2001)
Grzymala-Busse, J.W., Grzymala-Busse, W.J.: Handling missing attribute values. In: Maimon, O., Rockach, L. (eds.) Data Mining and Knowledge Discovery Handbook, 2nd edn., pp. 33–51. Springer (2010)
Saar-Tsechansky, M., Provost, F.: Handling Missing Values when Applying. Journal of Machine Learning Research 8, 1625–1657 (2007)
Shimada, K.: An evolving associative classifier for incomplete database. In: Perner, P. (ed.) ICDM 2012. LNCS, vol. 7377, pp. 136–150. Springer, Heidelberg (2012). doi:10.1007/978-3-642-31488-9_12
Shimada, K., Hanioka, T.: An evolutionary method for exceptional association rule set discovery from incomplete database. In: Bursa, M., Khuri, S., Renda, M.E. (eds.) ITBAM 2014. LNCS, vol. 8649, pp. 133–147. Springer, Cham (2014). doi:10.1007/978-3-319-10265-8_12
Mabu, S., Chen, C., Lu, N., Shimada, K., Hirasawa, K.: An Intrusion-Detection Model Based on Fuzzy Class-Association-Rule Mining Using Genetic Network Programming. IEEE Trans. on Syst., Man, and Cyber. -Part C 41, 130–139 (2011)
Shimada, K., Arahira, T., Hanioka, T.: An evolutionary rule mining method for continuous value prediction from incomplete database and its application utilizing artificial missing values. In: Proc. of the First IEEE Int’l. Conf. on Big Data Computing Service and Applications, pp. 392–399 (2015)
Smith, J.W., Everhart, J.E., Dickson, W.C., Knowler, W.C., Johannes, R.S.: Using the ADAP learning algorithm to forecast the onset of diabetes mellitus. In: Proc. of the Symposium on Computer Applications and Medical Care, pp. 261–265. IEEE Computer Society Press (1988)
Blake, C., Merz, C.: UCI repository of machine learning databases. http://www.ics.uci.edu/mlearn/MLRepository.html
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Shimada, K., Arahira, T., Hanioka, T. (2017). Association Rule-based Classifier Using Artificial Missing Values. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2017. Lecture Notes in Computer Science(), vol 10357. Springer, Cham. https://doi.org/10.1007/978-3-319-62701-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-62701-4_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62700-7
Online ISBN: 978-3-319-62701-4
eBook Packages: Computer ScienceComputer Science (R0)