An ensemble-based method for linear feature extraction for two-class problems

Masip, David; Kuncheva, Ludmila I.; Vitrià, Jordi

doi:10.1007/s10044-005-0002-x

An ensemble-based method for linear feature extraction for two-class problems

Theoretical Advances
Published: 27 September 2005

Volume 8, pages 227–237, (2005)
Cite this article

Pattern Analysis and Applications Aims and scope Submit manuscript

David Masip¹,
Ludmila I. Kuncheva² &
Jordi Vitrià¹

168 Accesses
14 Citations
Explore all metrics

Abstract

In this paper we propose three variants of a linear feature extraction technique based on Adaboost for two-class classification problems. Unlike other feature extraction techniques, we do not make any assumptions about the distribution of the data. At each boosting step we select from a pool of linear projections the one that minimizes the weighted error. We propose three different variants of the feature extraction algorithm, depending on the way the pool of individual projections is constructed. Using nine real and two artificial data sets of different original dimensionality and sample size we compare the performance of the three proposed techniques with three classical techniques for linear feature extraction: Fisher linear discriminant analysis (FLD), Nonparametric discriminant analysis (NDA) and a recently proposed feature extraction method for heteroscedastic data based on the Chernoff criterion. Our results show that for data sets of relatively low-original dimensionality FLD appears to be both the most accurate and the most economical feature extraction method (giving just one-dimension in the case of two classes). The techniques based on Adaboost fare better than the classical techniques for data sets of large original dimensionality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Observation of Different Clustering Algorithms and Clustering Evaluation Criteria for a Feature Selection Based on Linear Discriminant Analysis

Discriminant Function Selection in Binary Classification Task

Enforcement of the principal component analysis–extreme learning machine algorithm by linear discriminant analysis

Article 25 June 2015

Notes

We assume that the reader is familiar with Adaboost although the feature extraction should be reproducible from the Fig. 3
Functions from the PRTOOLS 3.1.7 toolbox [24] have been used for classifiers 1–3. For the SVM classifier we used the OSU SVM Classifier Matlab toolbox 3.00 that can be downloaded from http://www.ece.osu.edu/∼maj/osu_svm/.
Full information about the standard deviations and the calculated confidence intervals can be found at http://www.cvc.uab.es/∼davidm/experiments.htm

References

Hyvarinen A, Karhunen J, Oja E (2001) Independent component analysis. John Wiley and Sons
Google Scholar
Friedman JH (1987) Explanatory projection pursuit. J Am Statistical Assoc 82:249–266
Article MATH Google Scholar
Lee DD, Seung HS (1999) Learning the parts of objects with nonnegative matrix factorization. Nature 401:788–791
Article PubMed Google Scholar
Fisher R (1936) The use of multiple measurements in taxonomic problems. Ann Eugenics 7:179–188
Google Scholar
Fukunaga K, Mantock J (1983) Nonparametric discriminant analysis. IEEE T Pattern 5(6):671–678
Article MATH Google Scholar
Loog M, Duin RPW (2004) Linear dimensionality reduction via a heteroscedastic extension of lda: The Chernoff criterion. IEEE T Pattern Anal 26(6):732–739
Article Google Scholar
McLachlan GJ (2004) Discriminant analysis and statistical pattern recognition. John Wiley and Sons, Inc, New York
MATH Google Scholar
RST, SLK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326
Google Scholar
Tenenbaum JB, de Silva V, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500):2319–2323
Article Google Scholar
Long PM, Vega VB (2003) Boosting and microarray data. Machine Learning 52:31–44
Article MATH Google Scholar
Athitsos V, Alon J, Sclaroff S, Kollios G (2004) Boostmap: a method for efficient approximate similarity rankings. In: CVPR (2), 2004, pp 268–275
Sirlantzis K, Hoque S, Fairhurst MC (2002) Trainable multiple classifier schemes for handwritten character recognition. In: Multiple classifier systems, 2002, pp 169–178
Brown G, Yao X, Wyatt J, Wersing H, Sendhoff B (2002) Exploiting ensemble diversity for automatic feature extraction. In: Proc. of the 9th international conference on neural information processing (ICONIP’02), 2002, pp 1786–1790
Kuncheva LI (2004) Combining pattern classifiers. John Wiley and Sons
MATH Google Scholar
Breiman L (1998) Arcing classifiers. Ann Stat 26(3):801–849
Article MATH MathSciNet Google Scholar
Kirby M, Sirovich L (1990) Application of the Karhunen-Loeve procedure for the characterization of human faces. IEEE T Pattern Anal 12(1):103–108
Article Google Scholar
Fukunaga K (1990) Introduction to statistical pattern recognition, 2nd edn. Academic Press, Boston
MATH Google Scholar
Bressan M, Vitria J (2003) Nonparametric discriminant analysis and nearest neighbor classification. Pattern Recogn Lett 24(15):2743–2749
Article Google Scholar
Schapire RE (1999) A brief introduction to boosting. In: IJCAI, 1999, pp 1401–1406
Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. In: International conference on machine learning, 1996, pp 148–156
Blake C, Merz C (1998) UCI repository of machine learning databases http://www.ics.uci.edu/∼mlearn/MLRepository.html
Skurichina M (2001) Stabilizing weak classifiers, Ph.D. thesis, Delft University of Technology
Martinez A, Benavente R (1998) The AR face database. Tech Rep 24, Computer Vision Center (June 1998)
Duin RPW (2004) PRTOOLS v3.17, Tech. rep., Delft University of Technology. http://www.ph.tn.tudelft.nl/∼bob/PRTOOLS.html

Download references

Acknowledgements

This work is supported by MCYT grant TIC2003-00654, and FP2000-4960 Ministerio de Ciencia y Tecnologia, Spain.

Author information

Authors and Affiliations

Computer Vision Center, Department Informàtica, Universitat Autònoma de Barcelona, Edifici O Bellaterra, 08193, Barcelona, Spain
David Masip & Jordi Vitrià
School of Informatics, University of Wales, Bangor Dean Street, Bangor, Gwynedd, LL57 1UT, United Kingdom
Ludmila I. Kuncheva

Authors

David Masip
View author publications
You can also search for this author in PubMed Google Scholar
Ludmila I. Kuncheva
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Vitrià
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Masip.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Masip, D., Kuncheva, L.I. & Vitrià, J. An ensemble-based method for linear feature extraction for two-class problems. Pattern Anal Applic 8, 227–237 (2005). https://doi.org/10.1007/s10044-005-0002-x

Download citation

Received: 30 October 2004
Accepted: 15 February 2005
Published: 27 September 2005
Issue Date: December 2005
DOI: https://doi.org/10.1007/s10044-005-0002-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An ensemble-based method for linear feature extraction for two-class problems

Abstract

Access this article

Similar content being viewed by others

An Observation of Different Clustering Algorithms and Clustering Evaluation Criteria for a Feature Selection Based on Linear Discriminant Analysis

Discriminant Function Selection in Binary Classification Task

Enforcement of the principal component analysis–extreme learning machine algorithm by linear discriminant analysis

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An ensemble-based method for linear feature extraction for two-class problems

Abstract

Access this article

Similar content being viewed by others

An Observation of Different Clustering Algorithms and Clustering Evaluation Criteria for a Feature Selection Based on Linear Discriminant Analysis

Discriminant Function Selection in Binary Classification Task

Enforcement of the principal component analysis–extreme learning machine algorithm by linear discriminant analysis

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation