A Gaussian Mixture Based Maximization of Mutual Information for Supervised Feature Extraction

Leiva-Murillo, José M.; Artés-Rodríguez, Antonio

doi:10.1007/978-3-540-30110-3_35

José M. Leiva-Murillo² &
Antonio Artés-Rodríguez²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3195))

Included in the following conference series:

International Conference on Independent Component Analysis and Signal Separation

1897 Accesses
5 Citations

Abstract

In this paper, we propose a new method for linear feature extraction and dimensionality reduction for classification problems. The method is based on the maximization of the Mutual Information (MI) between the resulting features and the classes. A Gaussian Mixture is used for modelling the distribution of the data. By means of this model, the entropy of the data is then estimated, and so the MI at the output. A gradient descent algorithm is provided for its optimization. Some experiments are provided in which the method is compared with other popular linear feature extractors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

UCI Repository of Machine Learning Databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
CBCL Software and Datasets, MIT, Face Images database (2000), http://www.ai.mit.edu/projects/cbcl/software-datasets/index.html
Battiti, R.: Using mutual information for selecting features in supervised neural net learning. Neural Networks 5, 537–550 (1994)
Article Google Scholar
Bell, A.J., Sejnowski, T.: An information maximisation approach to blind separation and blind deconvolution. Neural Computation 7(6), 1004–1034 (1995)
Article Google Scholar
Center, J.L.: Blind source separation, independent component analysis, and pattern classification - connections and synergies. In: Proceedings MaxEnt 23, Jackson Hole, WY (2003)
Google Scholar
Cover, T.M., Thomas, J.A.: Elements of Information Theory. John Wiley &Sons, Chichester (1991)
Book Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via EM algorithm (with discussion). Journal of the Royal Statistical Society B(39), 1–38 (1977)
MATH Google Scholar
Xu, D., Principe, J., Fischer III., J.W.: Information-Theoretic Learning, vol. 1. Wiley, Chichester (2000)
Google Scholar
Kaski, S., Peltonen, J.: Informative discriminant analysis. In: Proceeding of the ICML, Washington DC, vol. 5, pp. 329–336 (2003)
Google Scholar
Kwak, N., Choi, C.: Input feature selection by mutual information based on parzen window. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(12), 1667–1671 (2002)
Article Google Scholar
Pereira, F.C., Tishby, N., Bialek, W.: The information bottleneck method. In: 37th Annual Allerton International Conference on Communications, Control and Computing (1999)
Google Scholar
Torkkola, K.: Feature extraction by non-parametric mutual information maximization. Journal on Machine Learning Research 3, 1415–1438 (2003)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Signal Theory and Communications, Universidad Carlos III de Madrid, Avda. de la Universidad 30, 28911, Leganés-Madrid, Spain
José M. Leiva-Murillo & Antonio Artés-Rodríguez

Authors

José M. Leiva-Murillo
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Artés-Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Architecture and Computer Technology, University of Granada, Spain
Carlos G. Puntonet & Alberto Prieto &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Leiva-Murillo, J.M., Artés-Rodríguez, A. (2004). A Gaussian Mixture Based Maximization of Mutual Information for Supervised Feature Extraction. In: Puntonet, C.G., Prieto, A. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2004. Lecture Notes in Computer Science, vol 3195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30110-3_35

Download citation

DOI: https://doi.org/10.1007/978-3-540-30110-3_35
Published: 27 October 2004
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23056-4
Online ISBN: 978-3-540-30110-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics