Abstract
The performance of a classification process depends heavily on the feature used in it. The traditional features/variables selection schemes are mostly developed from the model fitting point of view, which may not be good or efficient for classification purpose. Here we propose a graphical selection method, which allows us to integrate the information in the test data set, and it is suitable for selection useful features from high dimensional data set. We applied it to the Thrombin data set, which was used in KDD CUP 2001. By using the selected features from our graphical method and a SVM classifier, we obtained the higher classification accuracy than the results reported in KDD Cup 2001.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Burges, Christopher J.C.A (1998) Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, 2(2), 121–167.
Cheng, Jie (2001), KDD CUP 2001 Task 1:Thrombin.
Vapnik, V. (1995). The nature of Statistical Learning Theory, New York: Springer Verlag.
Chih-Chung Chang and Chih-Jen Lin (2000). LIBSVM — A Library for Support Vector Machines.
Mardia, K.V., Kent, J.T. and Bibby, J.M.(1979). Multivariate Analysis. London: Academic Press.
Zanette, D. H. Entropic analysis of the role of words in literary texts. Preprint.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chang, Yc.I., Hsu, H., Chou, LY. (2002). Graphical Features Selection Method. In: Yin, H., Allinson, N., Freeman, R., Keane, J., Hubbard, S. (eds) Intelligent Data Engineering and Automated Learning — IDEAL 2002. IDEAL 2002. Lecture Notes in Computer Science, vol 2412. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45675-9_71
Download citation
DOI: https://doi.org/10.1007/3-540-45675-9_71
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44025-3
Online ISBN: 978-3-540-45675-9
eBook Packages: Springer Book Archive