Abstract
In research of pattern recognition, we always want to achieve the correct classification rate according to the characteristics required. Feature extraction greatly affects the design and performance of the classifier, and it is one of the core issue of PR research. As an important component of pattern recognition, feature extraction has been paid close attention by many scholars, and currently has become one of the research hot spots in the field of pattern recognition. This article gives a general discussion of feature extraction, includes linear feature extraction and nonlinear feature extraction, and introduces the frontier methods of this field, at last discusses the development tendency of feature extraction.
Similar content being viewed by others
References
Aizerman M, Branverman E, Rozonoer L (1964) Theoretical foundations of the potential foundation method in patten recognitions. Autom emote control 25: 821–837
Bian Z, Zhang (2000) Pattern Recognition. 2. Tsinghua University Press, Beijing, China
Cheng Z, Chen D (2002) Integrated strategy of pattern classification and its application. J Zhejiang Univ (Engineering Science) 36(6): 601–606
Comon P (1994) Independent component analysis-A new concept. Signal Process 36(3): 287–314
Ding S, Jin F, Wang J (2003) Information feature analysis and selection of orthogonal transformation. Acta Geodaetica Et Cartographic Sinica 32(1): 73–77
Ding S, Jin F, Wang J (2004) New PCA feature compression algorithm based on information theory. Mini-micro Syst 25(4): 694–697
Ding S, Jin F (2003) Information characteristics K-L transform based on information entropy. Trans Nonferr Met Soc China 13(3): 729–734
Ding S, Jin F, Shi Z (2005) Information feature compression based on partial least squares. J Comput Aided Des Comput Graph 17(2): 368–371
Ding S, Jin F, Wang X et al (2005) Information feature compression algorithm based on SCEC. Mini-micro Sys 26(7): 1202–1205
Fan Y, Chen Y, Sun W et al (2005) Algorithm for bi-directional reduce feature data based on the principal component analysis and immune clustering. Acta Simulata Systematica Sinica 17(1): 148–153
Fang K (1989) Practical multivariate statistic analysis. East China Normal University Press, Shanghai
Foman G (2003) An exnetsive empirical study of feature selection metrics for text classification. J Mach Learn Res 3: 1289–1305
Friedman JH, Tukey JW (1974) A projection pursuit algorithm for exploratory data analysis. IEEE Trans Comput C- 23(9): 881–890
Fujarewicz K, Wiench M (2003) Selecting differentially expressed genes for colon tumor Classification. Int J Appl Math Comput Sci 13(3): 327–335
Huang R, He M, Yang S (2007) A Margin Based Feature Extraction Algorithm for the Small Sample Size problem. Chin J Comput 30(7): 1173–1178
Jensn D, Cohen P (2000) Multipe comparisons in induction algorithms. Mach Learn 38(3): 309–338
Jin Z, Yang J, Lu J (1999) An optimal set of uncorrelated discriminant features. Chin J Comput 22(10): 1105–1110
Jin Z, Yang J, Hu J et al (2001) Face recognition based on the uncorrelated discriminant transformation. Pattern Recognit 34(7): 1405–1416
Johnson A et al (1998) Applied multivariate satatistical analysis. 4th. Prentice-Hall, Inc
Johnson JL (1994) Pulse-coupled neural nets: translation, rotation, scale, distortion and intensity signal invariance for images. Appl Opt 33(26): 6239–6253
Jolliffe IT (1986) Principal component analysis. Springer, Berlin
Jones MC, Sibson R (1987) What is projection pursuit. J Royal Stat Soc Ser A(General) 150(1): 1–36
Jutten C, Herault J (1988) Independent component analysis versus PCA. In: Proceeding of European signal processing Conference, pp 287-314
Jutten C, Herault J (1991) Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture. Signal Process 24(1): 1–10
Karhunen J, Hyvarinen A, Vigario R (1997) Applications of neural blind separation to signal and image processing. In: Proceedings of the IEEE 1997 international conference on acoustics, speech, and signal processing, pp 131–134
Lewis DD (1992) An evaluation of phrasal and clustered representations on a text categorization task. In: Proceedings of the 15th ACM international conference on research and development in information retrieval, Copenhagen, pp 246–254
Li H (1995) Fuzzy Information eentropy criterion for fault features evaluation. Inf Control 24(5): 301–304
Li S, Ji X, Zhu S (1999) The application study of entropy analysis method in feature extraction. J North China Inst Tech 20(3): 278–281
Liu Z, Sheng S (2004) Research on the method of fault feature extraction. Appl Electron Tech 11(19): 19–21
Mao Y, Xia X, Xia Z (2007) A survey for study of feature selection algorithms. Pattern Recognit Artif Intell 20(2): 211–218
Mcculloch S, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bulletin Math Biol 5(4): 115–133
Mercer J (1909) Functions of postitive and negative type and their connection with the theory of integral equations. Philos Trans R Soc Lond 209: 415–446
Mika S, Ratsch G, Weston J et al (1999) Fisher discriminant analysis with kernels. In: Neural networks for signal processing IX. IEEE Press, New York, 41–48
Narendra PM, Fukunaga K (1977) A branch and bound algorithm for feature subset selection. IEEE Trans Comput 26(9): 917–922
Nguyen HS, Nguyen SH, Skowron A (1996) Searching for features defined by hyperplanes. Found Intell Sys 1079: 366–375
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290: 2323–2326
Scholkopf B, Smola A, Mulle KR (1998) Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 10(5): 1299–1319
Setiono R (2000) Generating concise and accurate classification rules for breast cancer diagnosis. Artif Intell Med 18(3): 205–219
Seung Sebastian H, Lee D (2000) The manifold ways of perception. Science 290(12): 2268–2269
Shan S, Chen Y, Cheng Y (2003) Data mining-concept, models,methods and algorithms. Tsinghua University Press, Beijing
Shannon CE (1948) A mathematical theory of communication. BSTJ 5(1): 379–423
Shawe-Taylor J, Williams C (2003) The stability of kernel principal components analysis and its relation to the process eigenspectrum. In: Advances in neural information processing systems 15. MIT Press, USA
Shen D (1999) Discriminative wavelet shape descriptors for recognition of 2-D patterns. Pattern Recognit 32(2): 151–165
Shi Z (2002) Knowledge discovery. Tsinghua University Press, Beijing
Simek K, Fujarewicz K, Swierniak A et al (2004) Using SVD and SVM methods for selection, classification, clustering and modeling of DNA microarray data. Eng Appl Artif Intel 17(4): 417–427
Smon H (2001) Neural networks: a comprehensive foundation, 2nd edn. Tsinghua University Press, Beijing
Song F, Gao X, Liu S (2005) Dimensionality reduction in statistical pattern recognition and low loss dimensionality reduction. Chin J Comput 28(11): 1915–1922
Sun J (2002) Modern pattern recognition. Defense University of Science and Technology Publishing House, Changsha
Sun P, Xu Z, Shen J (2004a) Nonlinear canonical correlation analysis for discrimination based on kernel methods. Chin J Comput 27(6): 789–795
Sun Z, Bebis G, Miller R (2004b) Object detection using feature subset selection. Pattern Recognit 37(11): 2165–2176
Tenenbaum J, Silva D, Langford J (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500): 2319–2323
Vafaie H, De Jong K (1993) Robust feature selection algorithms. In: Proceedings of IEEE conference on tools with artificial intelligence, pp 356-363
Wang H, Zheng J, Yao Z (2006) Application of dimension reduction on using improved LLE based on clustering. J Comput Res Dev 43(8): 1485–1490
Wang Z, Wang H, Leng L (2005) Face recognition combining the null space approach and the fractional LDA. Comput Appl 25(11): 2586–2588
Wiener E, Pedersen JO, Weigend AS (1995) A neural network approach to topic spotting. In: Proceedings of the 4th annual symposium on document analysis and information retrieval, pp 317-332
Yang J, Frangi AF, Yan J et al (2005) KPCA plus IDA: A complete kernel fisher discriminant framework for feature extraction and recognition. IEEE Trans Pattern Anal Mach Intell 27(2): 230–244
Yang Z, Li Y, Hu D (2002) Independent component analysis: a survey. J Autom 28(5): 762–772
Yuen P, Lai J (2001) Face representation using independent component analysis. Pattern Recognit 34(3): 545–553
Zeng H, Yuan Z (1999) About a new approach of selection and reduction on system feature in pattern recognition. J Sichuan Inst Light Ind Chem Technol 12(4): 1–5
Zhang H, Sun G (1999) Tabu search algorithm for feature selection. J Autom 25(4): 457–466
Zhang X (1998) Dynamic programming method for feature selection. J Autom 24(5): 675–668
Zhang Y, Fang K (1997) Practical multivariate statistic analysis. Science Press, Beijing
Zhou D, Gao W, Zhao D (2003) Face recognition based on singular value decomposition and discriminant KL projection. J Softw 14(4): 783–789
Zhu D, Wu C, Qin W (1999) Multivariate statistic analysis and software SAS. Southeast University Press, Nanjing
Zhu Q, Wu B, Wan N (2006) An interest point detect method to stereo images with good repeatability and information content. Acta Electron Sinica 34(2): 205–209
Zhu X (2000) Extracting geological structure information by multi-principal component analysis. J Remote Sens 4(4): 299–303
Zhuang Z, Zhang A, Li F (2007) Based on an optimized LDA algorithm for face recognition. J Electron Inf Technol 29(9): 2047–2049
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ding, S., Zhu, H., Jia, W. et al. A survey on feature extraction for pattern recognition. Artif Intell Rev 37, 169–180 (2012). https://doi.org/10.1007/s10462-011-9225-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-011-9225-y