Trading off between Misclassification, Recognition and Generalization in Data Mining with Continuous Features

Wang, Dianhui; Dillon, Tharam; Chang, Elizabeth

doi:10.1007/3-540-48035-8_30

Dianhui Wang³,
Tharam Dillon³ &
Elizabeth Chang⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2358))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

1146 Accesses
3 Citations

Abstract

This paper aims at developing a data mining approach for classification rule representation and automated acquisition from numerical data with continuous attributes. The classification rules are crisp and described by ellipsoidal regions with different attributes for each individual rule. A regularization model trading off misclassification rate, recognition rate and generalization ability is presented and applied to rule refinement. A regularizing data mining algorithm is given, which includes self-organizing map network based clustering techniques, feature selection using breakpoint technique, rule initialization and optimization, classifier structure and usage. An Illustrative example demonstrates the applicability and potential of the proposed techniques for domains with continuous attributes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. L. Kennedy, Y. Lee, B. V. Roy, C. D. Reed and R. P. Lippmann, Solvong Data Mining Problems Through Pattern Recognition. Prentice Hall, PTR, Unica Technologies, Inc., (1998)
Google Scholar
S. Theodoridis and K. Koutroumbas: Pattern Recognition, Academic Press, (1999)
Google Scholar
A.K. Jain, P. W. Duin, and J. Mao, Statistical pattern recognition: a review, IEEE Trans.On Pattern Analysis and Machine Intelligence, 5 (2000) 4–37
Article Google Scholar
S. Sestito and T. S. Dillon, Automated Knowledge Acquisition. Australia: Prentice Hall, (1994)
MATH Google Scholar
T. Kohonen, Self-Organization and Associative Memory. Berlin: Springer-Verlag (1989)
Google Scholar
R. P. Lippmann, Pattern classification using neural networks, IEEE Communications Magazine, (1989) 47–64
Google Scholar
J. Y. Ching, A. K. C. Wong and K. C. C. Chan, Class-dependent discretization for inductive learning from continuous and mixed-mode data, IEEE Trans.On Pattern Analysis and Machine Intelligence, 7(1995) 641–651
Article Google Scholar
P. K. Simpson, Fuzzy min-max neural networks-Part I: Classification, IEEE Trans. On Neural Networks, 5(1992) 776–786
Article Google Scholar
X. Wu, Fuzzy interpretation of discretized intervals, IEEE Trans. On Fuzzy Systems, 6(1999)753–759
Google Scholar
S. Mitra, R. K. De and S. K. Pal, Knowledge-based fuzzy MLP for classification and rule generation, IEEE Trans. On Neural Networks, 6(1997) 1338–1350
Article Google Scholar
L. M. Fu, A neural-network model for learning domain rules based on its activation function characteristics, IEEE Trans. On Neural Networks, 5(1998) 787–795
Google Scholar
J. Vesanto, and E. Alhoniemi, Clustering of the self-organizing map, IEEE Trans. On Neural Networks, Special Issue On Neural Networks for Data Mining and Knowledge Discovery, 3(2000) 586–600
Article Google Scholar
H. Lu, R. Setion and H. Liu, Effective data mining using neural networks, IEEE Trans. On Knowledge and Data Engineering, 6(1996) 957–961
Google Scholar
S. Abe, R. Thawonmas and Y. Kobayashi, Feature selection by analyzing class regions approximated by ellipsoids, IEEE Trans. On SMC-Part C: Applications and Reviews, 2(1998) 282–287
Google Scholar
W. F. Bloomer, T. S. Dillon and M. Witten, Hybrid BRAINNE: Further developments in extracting symbolic disjunctive rules, Expert Systems with Applications, 3(1997) 163–168
Article Google Scholar
D. H. Wang and T. S. Dillon, Extraction and optimization of classification rules for continuous or mixed mode data using neural nets, Proceedings of SPIE International Conference on Data Mining and Knowledge Discovery: Theory, Tools and Technology III, pp. 38–45, April 16-20, 2001, Orlando, Florida, USA.
Google Scholar
H. Yang and T. S. Dillon, Convergence of self-organizing neural algorithm, Neural Networks, 5(1992) 485–493
Article Google Scholar
G. G. Sutton and J. A. Reggia, Effects of normalization constraints on competitive learning, IEEE Trans. On Neural Networks, 3(1994) 502–504
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Computer Engineering, La Trobe University, Melbourne, VIC, 3083, Australia
Dianhui Wang & Tharam Dillon
Department of Computer Science and Software Engineering, Newcastle University, Newcastle, Australia
Elizabeth Chang

Authors

Dianhui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tharam Dillon
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Chang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Centre for Intelligent Systems and Complex Processes, Swinburne University of Technology, John Street, Hawthorn, Victoria, Australia, 3122
Tim Hendtlass
Department of Computer Science, Southwest Texas State University, 601 University Drive, San Marcos, TX, 78666, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, D., Dillon, T., Chang, E. (2002). Trading off between Misclassification, Recognition and Generalization in Data Mining with Continuous Features. In: Hendtlass, T., Ali, M. (eds) Developments in Applied Artificial Intelligence. IEA/AIE 2002. Lecture Notes in Computer Science(), vol 2358. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48035-8_30

Download citation

DOI: https://doi.org/10.1007/3-540-48035-8_30
Published: 21 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43781-9
Online ISBN: 978-3-540-48035-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics