Statistical Image Object Recognition using Mixture Densities

Dahmen, Jörg; Keysers, Daniel; Ney, Hermann; Güld, Mark Oliver

doi:10.1023/A:1011242314266

Statistical Image Object Recognition using Mixture Densities

Published: May 2001

Volume 14, pages 285–296, (2001)
Cite this article

Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Jörg Dahmen¹,
Daniel Keysers¹,
Hermann Ney¹ &
…
Mark Oliver Güld¹

105 Accesses
17 Citations
Explore all metrics

Abstract

In this paper, we present a mixture density based approach to invariant image object recognition. To allow for a reliable estimation of the mixture parameters, the dimensionality of the feature space is optionally reduced by applying a robust variant of linear discriminant analysis. Invariance to affine transformations is achieved by incorporating invariant distance measures such as tangent distance. We propose an approach to estimating covariance matrices with respect to image variabilities as well as a new approach to combined classification, called the virtual test sample method. Application of the proposed classifier to the well known US Postal Service handwritten digits recognition task (USPS) yields an excellent error rate of 2.2%. We also propose a simple, but effective approach to compensate for local image transformations, which significantly increases the performance of tangent distance on a database of 1,617 medical radiographs taken from clinical daily routine.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Gaussian Mixture Trees for One Class Classification in Automated Visual Inspection

Proportional data modeling via entropy-based variational bayes learning of mixture models

Article 08 April 2017

Hybrid generative discriminative approaches based on Multinomial Scaled Dirichlet mixture models

Article 07 March 2019

References

J. Dahmen, K. Beulen, M. Güld, and H. Ney, “A mixture density based approach to object recognition for image retrieval,” in Proc. 6th Int. RIAO Conf. on Content-Based Multimedia Information Access, Paris, France, 2000, pp. 1632–1647.
J. Dahmen, D. Keysers, M. Güld, and H. Ney, “Invariant image object recognition using Gaussian mixture densities,” in Proc. 15th Int. Conf. on Pattern Recognition, Barcelona, Spain, 2000, pp. 614–617.
J. Dahmen, T. Theiner, D. Keysers, H. Ney, T. Lehmann, and B. Wein, “Classification of radiographs in the ‘Image Retrieval in Medical Applications’ system (IRMA),” in Proc. 6th Int. RIAO Conf. on Content-Based Multimedia Information Access, Paris, France, 2000, pp. 551–566.
J. Dahmen, D. Keysers, M. Pitz, and H. Ney, “Structured covariance matrices for statistical image object recognition,” in Proc. 22nd Symposium German Association for Pattern Recognition, Kiel, Germany, 2000, pp. 99–106.
J. Dahmen, R. Schlüter, and H. Ney, “Discriminative training of Gaussian mixtures for image object recognition,” in Proc. 21st Symposium German Association for Pattern Recognition, Bonn, Germany, 1999, pp. 205–212.
J. Dahmen, J. Hektor, R. Perrey, and H. Ney, “Automatic classification of red blood cells using Gaussian mixture densities,” in Proc. Bildverarbeitung für die Medizin, Munich, Germany, 2000, pp. 331–335.
A. Dempster, N. Laird, and D. Rubin, “Maximum likelihood from incomplete data via the EM algorithm,” Journal Royal Statistical Society, Vol. 39(B), pp. 1–38, 1977.
Google Scholar
P. Devijver and J. Kittler, Pattern Recognition. A Statistical Approach, Prentice-Hall: Englewood Cliffs, NJ, 1982.
Google Scholar
H. Drucker, R. Schapire, and P. Simard, “Boosting performance in neural networks,” Int. Journal of Pattern Recognition and Artificial Intelligence, Vol. 7, No. 4, pp. 705–719, 1993.
Google Scholar
R. Duda and P. Hart, Pattern Classification and Scene Analysis, John Wiley & Sons: New York, 1973.
Google Scholar
Y. Freund and R. Schapire, “Experiments with a new boosting algorithm,” in Proc. 13th Int. Conf. on Machine Learning, Bari, Italy, 1996, pp. 148–156.
K. Fukunaga, Introduction to Statistical Pattern Recognition, Academic Press: San Diego, CA, 1990.
Google Scholar
T. Hastie, P. Simard, and E. Säckinger, “Learning prototype models for tangent distance,” in Proc. Advances in Neural Information Processing Systems 7, MIT Press, Cambridge, MA, 1995, pp. 999–1006.
Google Scholar
G. Hinton, M. Revow, and P. Dayan, “Recognizing handwritten digits using a mixture of linear models,” in Proc. Advances in Neural Information Processing Systems 7, MIT Press, Cambridge, MA, 1995, pp. 1015–1022.
Google Scholar
D. Keysers, J. Dahmen, T. Theiner, and H. Ney, “Experiments with an extended tangent distance,” in Proc. 15th Int. Conf. on Pattern Recognition, Barcelona, Spain, 2000, pp. 38–42.
D. Keysers, J. Dahmen, and H. Ney, “A probabilistic view on tangent distance,” in Proc. 22nd Symposium German Association for Pattern Recognition, Kiel, Germany, 2000, pp. 107–114.
J. Kittler, M. Hatef, and R. Duin, “Combining classifiers,” in Proc. 13th Int. Conf. on Pattern Recognition, Vienna, Austria, 1996, pp. 897–901.
T. Lehmann, B. Wein, J. Dahmen, J. Bredno, F. Vogelsang, and M. Kohnen, “Content-based image retrieval in medical applications: A novel multi-step approach,” in Proc. Int. Society for Optical Engineering (SPIE), Vol. 3972, No. 32, 2000, pp. 312–320.
Google Scholar
Y. Linde, A. Buzo, and R. Gray, “An algorithm for vector quantizer design,” IEEE Trans. Communications, Vol. 28, No. 1, pp. 84–95, 1980.
Google Scholar
B. Moghaddam and A. Pentland, “Probabilistic visual learning for object representation,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 19, No. 7, pp. 696–710, July 1997.
Google Scholar
H. Ney, “Acoustic modelling of phoneme units for continuous speech recognition,” Fifth European Signal Processing Conf., Barcelona, pp. 65–72, Sep., 1990.
W. Press, S. Teukolsky, W. Vetterling, and B. Flannery, Numerical Recipes in C, Cambridge University Press, New York, 1992.
Google Scholar
B. Schiele and J. Crowley, “Probabilistic object recognition using multidimensional receptive field histograms,” in Proc. 13th Int. Conf. on Pattern Recognition, Vienna, Austria, 1996, pp. 50–54.
B. Schölkopf, P. Simard, A. Smola, and V. Vapnik, “Prior knowledge in support vector kernels,” in Proc. Advances in Neural Information Processing Systems 10, MIT Press, Cambridge, MA, 1998, pp. 640–646.
Google Scholar
H. Schwenk and M. Milgram, “Transformation invariant autoassociation with application to handwritten character recognition,” in Proc. Advances in Neural Information Processing Systems 7, MIT Press, Cambridge, MA, 1995, pp. 991–998.
Google Scholar
P. Simard, Y. Le Cun, and J. Denker, “Efficient pattern recognition using a new transformation distance,” in Proc. Advances in Neural Information Processing Systems 5, Morgan Kaufmann, San Mateo, CA, 1993, pp. 50–58.
Google Scholar
P. Simard, Y. Le Cun, J. Denker, and B. Victorri, “Transformation invariance in pattern recognition—Tangent distance and tangent propagation,” Lecture Notes in Computer Science, Vol. 1524, Springer: New York, 1998, pp. 239–274.
Google Scholar
V. Vapnik, The Nature of Statistical Learning Theory, Springer: New York, 1995.
Google Scholar
J. Wood, “Invariant pattern recognition: A review,” Pattern Recognition, Vol. 29, No. 1, pp. 1–17, January 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Lehrstuhl für Informatik VI, Computer Science Department, RWTH Aachen, University of Technology, D-52056, Aachen, Germany
Jörg Dahmen, Daniel Keysers, Hermann Ney & Mark Oliver Güld

Authors

Jörg Dahmen
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Keysers
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Ney
View author publications
You can also search for this author in PubMed Google Scholar
Mark Oliver Güld
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dahmen, J., Keysers, D., Ney, H. et al. Statistical Image Object Recognition using Mixture Densities. Journal of Mathematical Imaging and Vision 14, 285–296 (2001). https://doi.org/10.1023/A:1011242314266

Download citation

Issue Date: May 2001
DOI: https://doi.org/10.1023/A:1011242314266

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Statistical Image Object Recognition using Mixture Densities

Abstract

Access this article

Similar content being viewed by others

Gaussian Mixture Trees for One Class Classification in Automated Visual Inspection

Proportional data modeling via entropy-based variational bayes learning of mixture models

Hybrid generative discriminative approaches based on Multinomial Scaled Dirichlet mixture models

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Statistical Image Object Recognition using Mixture Densities

Abstract

Access this article

Similar content being viewed by others

Gaussian Mixture Trees for One Class Classification in Automated Visual Inspection

Proportional data modeling via entropy-based variational bayes learning of mixture models

Hybrid generative discriminative approaches based on Multinomial Scaled Dirichlet mixture models

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation