Synonyms
Definition
When inducing decision trees or decision rules from real-world data, many different aspects must be taken into account. One important aspect, in particular, is the processing of missing (unknown) attribute values. In machine learning (ML), instances (objects, observations) are usually represented by a list of attribute values; such a list commonly has a fixed length (i.e., a fixed number of attributes).
The topic of missing attribute values has been analyzed in the field of ML in many papers (Brazdil and Bruha 1992; Bruha and Franek 1996; Karmaker and Kwer 2005; Long and Zhang 2004; Quinlan 1986, 1989). Grzymala-Busse (2003) and Li and Cercone (2006) discuss the treatment of missing attribute values using the rough set strategies.
There are a few directions in which missing (unknown) attribute values as well as the corresponding routines for their processing may be studied and designed. First, the source of...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Brazdil PB, Bruha I (1992) A note on processing missing attribute values: a modified technique. In: Workshop on machine learning, Canadian conference AI, Vancouver
Bruha I (2003) Unknown attribute value processing by domain-specific external expert knowledge. In: 7th WSEAS international conference on systems, Corfu
Bruha I (2004) Meta-learner for unknown attribute values processing: dealing with inconsistency of meta-databases. J Intell Inf Syst 22(1):71–84
Bruha I, Franek F (1996) Comparison of various routines for unknown attribute value processing: covering paradigm. Int J Pattern Recognit Artif Intell 10(8):939–955
Bruha I, Berka P (2000) Discretization and fuzzification of numerical attributes in attribute-based learning. In: Szczepaniak PS, Lisboa PJG, Kacprzyk J (eds) Fuzzy systems in medicine. Physica/Springer, Heidelberg/New York, pp 112–138
Bruha I, Kockova S (1994) A support for decision making: cost-sensitive learning system. Artif Intell Med 6: 67–82
Clark P, Niblett T (1989) The CN2 induction algorithm. Mach Learn 3:261–283
Fan DW, Chan PK, Stolfo SJ (1996) A comparative evaluation of combiner and stacked generalization. In: Workshop integrating multiple learning models, AAAI, Portland
Grzymala-Busse JW (2003) Rough set strategies to date with missing attribute values. In: IEEE conference on proceedings of workshop on foundations and new directions in data mining, data mining, pp 56–63
Karmaker A, Kwer S (2005) Incorporating an EM-approach for handling missing attribute-values in decision tree induction. In: International conference on hybrid intelligent systems, pp 6–11
Kononenko I (1992) Combining decisions of multiple rules. In: du Boulay B, Sgurev V (eds) Artificial intelligence V: methodology, systems, applications, pp 87–96. Elsevier
Lakshminarayan K et al (1996) Imputation of missing data using machine learning techniques. In: Conference knowledge discovery in databases (KDD-96), pp 140–145
Li J, Cercone N (2006) Assigning missing attribute values based on rough sets theory. In: IEEE international conference on granular computing, Atlanta, pp 31–37
Long WJ, Zhang WX (2004) A novel measure of compatibility and methods of missing attribute values treatment in decision tables. In: International conference on machine learning and cybernetics, pp 2356–2360
Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106
Quinlan JR (1989) Unknown attribute values in ID3. In: Proceedings of international workshop on machine learning, pp 164–168
Ragel A, Cremilleux B (1998) Treatment of missing values for association rules. Lecture Notes in Computer Science, vol 1394, pp 258–270
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media New York
About this entry
Cite this entry
Bruha, I. (2017). Missing Attribute Values. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_954
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7687-1_954
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7685-7
Online ISBN: 978-1-4899-7687-1
eBook Packages: Computer ScienceReference Module Computer Science and Engineering