Abstract
In this paper, the feature selection for classification of natural disaster texts through testors, is presented. Testors are features subsets such that no class confusion is introduced. Typical testors are irreducible testors. Then they can be used in order to select which words are relevant to separate the classes, and so, be useful to get better classification rates. Some experiments were done with KNN and Naive Bayes Classifiers, results were compared against frequency threshold and information gain methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Salton, McGill, M.J.: An Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Joachims: Text Categorization with Support Vector Machines: Learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)
Lazo-Cortes, M., Ruiz-Shulcloper, J., Alba-Cabrera, E.: An overview of the evolution of the concept of testor. Pattern Recognition 34, 753–762 (2001)
Mitchel, M.: An introduction to genetic algorithms. MIT Press, Cambridge (1996)
Goldberg, D.: Genetic algorithms in search, optimization and machine learning. Addison-Wesley, Reading (1989)
Sánchez, G., Lazo, M., Fuentes, O.: Genetic algorithm to calculate typical testors of minimal cost. In: Proc. IV Iberoamerican Simposium on Pattern Recognition (SIARP 1999), La Havana, Cuba, pp. 207–213 (1999) (in Spanish)
Téllez-Valero, A., Montes-y-Gómez, M., Fuentes-Chavez, O., Villaseñor-Pineda, L.: Automatic classification of texts about natural disasters in Mexico. In: Proc, International Congress on Computer Science Research, Oaxtepec, México (2003) (in Spanish) (to appear)
Martínez-Trinidad, J.F., Guzmán-Arenas, A.: The logical combinatorial approach to pattern recognition an overview through selected works. Pattern Recognition 34, 741–751 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Carrasco-Ochoa, J.A., Martínez-Trinidad, J.F. (2004). Feature Selection for Natural Disaster Texts Classification Using Testors. In: Yang, Z.R., Yin, H., Everson, R.M. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2004. IDEAL 2004. Lecture Notes in Computer Science, vol 3177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28651-6_62
Download citation
DOI: https://doi.org/10.1007/978-3-540-28651-6_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22881-3
Online ISBN: 978-3-540-28651-6
eBook Packages: Springer Book Archive