Cascading for Nominal Data

Maudes, Jesús; Rodríguez, Juan J.; García-Osorio, César

doi:10.1007/978-3-540-72523-7_24

Jesús Maudes¹,
Juan J. Rodríguez¹ &
César García-Osorio¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4472))

Included in the following conference series:

International Workshop on Multiple Classifier Systems

1253 Accesses
1 Citations

Abstract

In pattern recognition many methods need numbers as inputs. Using nominal datasets with these methods requires to transform such data into numerical. Usually, this transformation consists in encoding nominal attributes into a group of binary attributes (one for each possible nominal value). This approach, however, can be enhanced for certain methods (e.g., those requiring linear separable data representations). In this paper, different alternatives are evaluated for enhancing SVM (Support Vector Machine) accuracy with nominal data. Some of these approaches convert nominal into continuous attributes using distance metrics (i.e., VDM (Value Difference Metric)). Other approaches combine the SVM with other classifier which could work directly with nominal data (i.e., a Decision Tree). An experimental validation over 27 datasets shows that Cascading with an SVM at Level-2 and a Decision Tree at Level-1 is a very interesting solution in comparison with other combinations of these base classifiers, and when compared to VDM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Blake, C.L., Merz, C.J.: UCI Repository of Machine Learning Databases, http://www.ics.uci.edu/mlearn/MLRepository.html
Demsar, J.: Statistical Comparisons of Classifiers over Multiple Data Sets. Journal of Machine Learning Research 7, 1–30 (2006)
MathSciNet Google Scholar
Duch, W., Grudzinski, K., Stawski, G.: Symbolic Features in Neural Networks. In: Proc. 5th Conference on Neural Networks and Soft Computing, Zakopane, pp. 180–185 (2000)
Google Scholar
Gama, J., Brazdil, P.: Cascade Generalization. Machine Learning 41(3), 315–343 (2000)
Article MATH Google Scholar
Grabczewski, K., Jankowski, N.: Transformations of Symbolic Data for Continuous Data Oriented Models. In: Kaynak, O., et al. (eds.) ICANN 2003 and ICONIP 2003. LNCS, vol. 2714, pp. 359–366. Springer, Heidelberg (2003)
Google Scholar
Kohavi, R., Wolpert, D.H.: Bias Plus Variance Decomposition for Zero-One Loss Functions. In: Saitta, L. (ed.) Machine Learning, Procs 13th International Conference, pp. 275–283. Morgan Kaufmann, San Francisco (1996)
Google Scholar
Nadeau, C., Bengio, Y.: Inference for the Generalization Error. Machine Learning 52, 239–281 (2003)
Article MATH Google Scholar
Platt, J.: Fast Training of Support Vector Machines using Sequential Minimal Optimization. In: Schoelkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods, MIT Press, Cambridge (1998)
Google Scholar
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)
Google Scholar
Seewald, A.K., Fuernkranz, J.: An Evaluation of Grading Classifiers. In: Hoffmann, F., et al. (eds.) IDA 2001. LNCS, vol. 2189, pp. 115–124. Springer, Heidelberg (2001)
Chapter Google Scholar
Stanfill, C., Waltz, D.: Toward Memory-Based Reasoning. Communication of the ACM 29, 1213–1229 (1986)
Article Google Scholar
Witten, H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005), http://www.cs.waikato.ac.nz/ml/weka
MATH Google Scholar
Wolpert, D.: Stacked Generalization. Neural networks 5, 241–260 (1992)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Escuela Politécnica Superior – Lenguajes y Sistemas Informáticos, Universidad de Burgos, Av. Cantabria s/n, 09006, Burgos, Spain
Jesús Maudes, Juan J. Rodríguez & César García-Osorio

Authors

Jesús Maudes
View author publications
You can also search for this author in PubMed Google Scholar
Juan J. Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
César García-Osorio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Michal Haindl Josef Kittler Fabio Roli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Maudes, J., Rodríguez, J.J., García-Osorio, C. (2007). Cascading for Nominal Data. In: Haindl, M., Kittler, J., Roli, F. (eds) Multiple Classifier Systems. MCS 2007. Lecture Notes in Computer Science, vol 4472. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72523-7_24

Download citation

DOI: https://doi.org/10.1007/978-3-540-72523-7_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72481-0
Online ISBN: 978-3-540-72523-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics