skip to main content
10.1145/2493525.2493531acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmlisConference Proceedingsconference-collections
research-article

Expectation propagation learning of finite Beta-Liouville mixtures for spatio-temporal object recognition

Published:04 August 2013Publication History

ABSTRACT

In this paper, we develop an efficient approach for the learning of finite Beta-Liouville mixture models. Unlike existing approaches, our is based on expectation propagation for parameters estimation and can select automatically the appropriate number of mixture components. We provide a coherent and unified learning framework to learn the complexity of the deployed mixture models and all the involved model parameters. We illustrate the performance of our learning algorithm with artificial data and a real application namely spatio-temporal objects (or dynamic events) recognition which has significant potential to be used in interactive systems or robotics. In particular, we highlight three of the most common spatio-temporal objects which involving facial expression, human activities and hand gesture. Our experiments results show the merits of the proposed approach.

References

  1. K. Anderson and P. W. McOwan. A real-time automated system for the recognition of human facial expressions. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 36(1):96--105, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. N. Bouguila. Bayesian hybrid generative discriminative learning based on finite liouville mixture models. Pattern Recognition, 44(6):1183--1200, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. N. Bouguila. Hybrid generative/discriminative approaches for proportional data modeling and classification. IEEE Transactions on Knowledge and Data Engineering, 24(12):2184--2202, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Cheng, J. Yu, and H. Xiong. Facial expression recognition in jaffe dataset based on gaussian process classification. IEEE Transactions on Neural Networks, 21(10):1685--1690, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. F. De La Torre, J. Campoy, Z. Ambadar, and J. Cohn. Temporal segmentation of facial behavior. In Proc. of the IEEE 11th International Conference on Computer Vision (ICCV), pages 1--8, 2007.Google ScholarGoogle Scholar
  6. W. Fan and N. Bouguila. Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection. Pattern Recognition, 46(10):2754--2769, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. W. Fan, N. Bouguila, and D. Ziou. Variational learning for finite Dirichlet mixture models and applications. IEEE Transactions on Neural Networks and Learning Systems, 23(5):762--774, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  8. W. Fan, N. Bouguila, and D. Ziou. Unsupervised hybrid feature extraction selection for high-dimensional non-Gaussian data clustering with variational inference. IEEE Transactions on Knowledge and Data Engineering, 25(7):1670--1685, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1/2):177--196, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. H. Jhuang, T. Serre, L. Wolf, and T. Poggio. A biologically inspired system for action recognition. In Proc. of the IEEE 11th International Conference on Computer Vision (ICCV), pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  11. Y. Ke, R. Sukthankar, and M. Hebert. Efficient visual event detection using volumetric features. In Proc. of the 10th IEEE International Conference on Computer Vision (ICCV), pages 166--173, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Kim, S. Wong, and R. Cipolla. Tensor canonical correlation analysis for action classification. In Proc. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  13. I. Laptev and T. Lindeberg. Space-time interest points. In Proc. of the IEEE 9th International Conference on Computer Vision (ICCV), pages 432--439, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Z. Li, J. ichi Imai, and M. Kaneko. Facial-component-based bag of words and phog descriptor for facial expression recognition. In Proc. of the IEEE International Conference on Systems, Man and Cybernetics (SMC), pages 1353--1358, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Liu, J. Luo, and M. Shah. Recognizing realistic actions from videos "in the wild". In Proc. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1996--2003, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  16. Z. Ma and A. Leijon. Expectation propagation for estimating the parameters of the beta distribution. In Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 2082--2085, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  17. G. McLachlan and D. Peel. Finite Mixture Models. New York: Wiley, 2000.Google ScholarGoogle Scholar
  18. T. Minka. Expectation propagation for approximate bayesian inference. In Proc. of the Conference on Uncertainty in Artificial Intelligence (UAI), pages 362--369, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. N. Bouguila and D. Ziou. Unsupervised selection of a finite Dirichlet mixture model: An mml-based approach. IEEE Transactions on Knowledge and Data Engineering, 18(8):993--1009, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. N. Bouguila and D. Ziou. High-dimensional unsupervised selection and estimation of a finite generalized Dirichlet mixture model based on minimum message length. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(10):1716--1731, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. N. Bouguila and D. Ziou. A Dirichlet process mixture of generalized Dirichlet distributions for proportional data modeling. IEEE Transactions on Neural Networks, 21(1):107--122, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. Pantic, M. F. Valstar, R. Rademaker, and L. Maat. Web-based database for facial expression analysis. In Proc. of the IEEE International Conference on Multimedia and Expo (ICME), pages 317--321, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  23. V. Parameswaran and R. Chellappa. Quasi-invariants for human action representation and recognition. In Proc. of the 16th IEEE/IAPR International Conference on Pattern Recognition (ICPR), pages 307--310, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In Proc. of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pages 1--8, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  25. M. B. H. Rhouma and H. Frigui. Self-organization of pulse-coupled oscillators with application to clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(2):180--195, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. F. Valstar and M. Pantic. Combined support vector machines and hidden markov models for modeling facial action temporal dynamics. In Proc. of the IEEE international conference on Human-computer interaction (HCI), pages 118--127, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. F. Valstar and M. Pantic. Induced disgust, happiness and surprise: an addition to the MMI facial expression database. In Proc. of International Conference on Language Resources and Evaluation, Workshop on EMOTION, pages 65--70, 2010.Google ScholarGoogle Scholar
  28. G. Willems, T. Tuytelaars, and L. J. V. Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. In Proc. of the 10th European Conference on Computer Vision (ECCV), pages 650--663, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. S.-F. Wong and R. Cipolla. Extracting spatiotemporal interest points using global information. In Proc. of the IEEE 11th International Conference on Computer Vision (ICCV), pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  30. L. Zelnik-Manor and M. Irani. Event-based analysis of video. In Proc. of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pages II--123--II--130, 2001.Google ScholarGoogle Scholar

Index Terms

  1. Expectation propagation learning of finite Beta-Liouville mixtures for spatio-temporal object recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      MLIS '13: Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
      August 2013
      70 pages
      ISBN:9781450320191
      DOI:10.1145/2493525

      Copyright © 2013 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 August 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      MLIS '13 Paper Acceptance Rate10of14submissions,71%Overall Acceptance Rate10of14submissions,71%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader