ABSTRACT
In this paper, we develop an efficient approach for the learning of finite Beta-Liouville mixture models. Unlike existing approaches, our is based on expectation propagation for parameters estimation and can select automatically the appropriate number of mixture components. We provide a coherent and unified learning framework to learn the complexity of the deployed mixture models and all the involved model parameters. We illustrate the performance of our learning algorithm with artificial data and a real application namely spatio-temporal objects (or dynamic events) recognition which has significant potential to be used in interactive systems or robotics. In particular, we highlight three of the most common spatio-temporal objects which involving facial expression, human activities and hand gesture. Our experiments results show the merits of the proposed approach.
- K. Anderson and P. W. McOwan. A real-time automated system for the recognition of human facial expressions. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 36(1):96--105, 2006. Google ScholarDigital Library
- N. Bouguila. Bayesian hybrid generative discriminative learning based on finite liouville mixture models. Pattern Recognition, 44(6):1183--1200, 2011. Google ScholarDigital Library
- N. Bouguila. Hybrid generative/discriminative approaches for proportional data modeling and classification. IEEE Transactions on Knowledge and Data Engineering, 24(12):2184--2202, 2012. Google ScholarDigital Library
- F. Cheng, J. Yu, and H. Xiong. Facial expression recognition in jaffe dataset based on gaussian process classification. IEEE Transactions on Neural Networks, 21(10):1685--1690, 2010. Google ScholarDigital Library
- F. De La Torre, J. Campoy, Z. Ambadar, and J. Cohn. Temporal segmentation of facial behavior. In Proc. of the IEEE 11th International Conference on Computer Vision (ICCV), pages 1--8, 2007.Google Scholar
- W. Fan and N. Bouguila. Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection. Pattern Recognition, 46(10):2754--2769, 2013. Google ScholarDigital Library
- W. Fan, N. Bouguila, and D. Ziou. Variational learning for finite Dirichlet mixture models and applications. IEEE Transactions on Neural Networks and Learning Systems, 23(5):762--774, 2012.Google ScholarCross Ref
- W. Fan, N. Bouguila, and D. Ziou. Unsupervised hybrid feature extraction selection for high-dimensional non-Gaussian data clustering with variational inference. IEEE Transactions on Knowledge and Data Engineering, 25(7):1670--1685, 2013. Google ScholarDigital Library
- T. Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1/2):177--196, 2001. Google ScholarDigital Library
- H. Jhuang, T. Serre, L. Wolf, and T. Poggio. A biologically inspired system for action recognition. In Proc. of the IEEE 11th International Conference on Computer Vision (ICCV), pages 1--8, 2007.Google ScholarCross Ref
- Y. Ke, R. Sukthankar, and M. Hebert. Efficient visual event detection using volumetric features. In Proc. of the 10th IEEE International Conference on Computer Vision (ICCV), pages 166--173, 2005. Google ScholarDigital Library
- T. Kim, S. Wong, and R. Cipolla. Tensor canonical correlation analysis for action classification. In Proc. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1--8, 2007.Google ScholarCross Ref
- I. Laptev and T. Lindeberg. Space-time interest points. In Proc. of the IEEE 9th International Conference on Computer Vision (ICCV), pages 432--439, 2003. Google ScholarDigital Library
- Z. Li, J. ichi Imai, and M. Kaneko. Facial-component-based bag of words and phog descriptor for facial expression recognition. In Proc. of the IEEE International Conference on Systems, Man and Cybernetics (SMC), pages 1353--1358, 2009. Google ScholarDigital Library
- J. Liu, J. Luo, and M. Shah. Recognizing realistic actions from videos "in the wild". In Proc. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1996--2003, 2009.Google ScholarCross Ref
- Z. Ma and A. Leijon. Expectation propagation for estimating the parameters of the beta distribution. In Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 2082--2085, 2010.Google ScholarCross Ref
- G. McLachlan and D. Peel. Finite Mixture Models. New York: Wiley, 2000.Google Scholar
- T. Minka. Expectation propagation for approximate bayesian inference. In Proc. of the Conference on Uncertainty in Artificial Intelligence (UAI), pages 362--369, 2001. Google ScholarDigital Library
- N. Bouguila and D. Ziou. Unsupervised selection of a finite Dirichlet mixture model: An mml-based approach. IEEE Transactions on Knowledge and Data Engineering, 18(8):993--1009, 2006. Google ScholarDigital Library
- N. Bouguila and D. Ziou. High-dimensional unsupervised selection and estimation of a finite generalized Dirichlet mixture model based on minimum message length. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(10):1716--1731, 2007. Google ScholarDigital Library
- N. Bouguila and D. Ziou. A Dirichlet process mixture of generalized Dirichlet distributions for proportional data modeling. IEEE Transactions on Neural Networks, 21(1):107--122, 2010. Google ScholarDigital Library
- M. Pantic, M. F. Valstar, R. Rademaker, and L. Maat. Web-based database for facial expression analysis. In Proc. of the IEEE International Conference on Multimedia and Expo (ICME), pages 317--321, 2005.Google ScholarCross Ref
- V. Parameswaran and R. Chellappa. Quasi-invariants for human action representation and recognition. In Proc. of the 16th IEEE/IAPR International Conference on Pattern Recognition (ICPR), pages 307--310, 2002. Google ScholarDigital Library
- J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In Proc. of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pages 1--8, 2008.Google ScholarCross Ref
- M. B. H. Rhouma and H. Frigui. Self-organization of pulse-coupled oscillators with application to clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(2):180--195, 2001. Google ScholarDigital Library
- M. F. Valstar and M. Pantic. Combined support vector machines and hidden markov models for modeling facial action temporal dynamics. In Proc. of the IEEE international conference on Human-computer interaction (HCI), pages 118--127, 2007. Google ScholarDigital Library
- M. F. Valstar and M. Pantic. Induced disgust, happiness and surprise: an addition to the MMI facial expression database. In Proc. of International Conference on Language Resources and Evaluation, Workshop on EMOTION, pages 65--70, 2010.Google Scholar
- G. Willems, T. Tuytelaars, and L. J. V. Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. In Proc. of the 10th European Conference on Computer Vision (ECCV), pages 650--663, 2008. Google ScholarDigital Library
- S.-F. Wong and R. Cipolla. Extracting spatiotemporal interest points using global information. In Proc. of the IEEE 11th International Conference on Computer Vision (ICCV), pages 1--8, 2007.Google ScholarCross Ref
- L. Zelnik-Manor and M. Irani. Event-based analysis of video. In Proc. of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pages II--123--II--130, 2001.Google Scholar
Index Terms
- Expectation propagation learning of finite Beta-Liouville mixtures for spatio-temporal object recognition
Recommendations
Expectation propagation learning of a Dirichlet process mixture of Beta-Liouville distributions for proportional data clustering
We propose a nonparametric Bayesian model for the clustering of proportional data. Our model is based on an infinite mixture of Beta-Liouville distributions and allows a compact description of complex data. The choice of the Beta-Liouville as the basis ...
An Expectation-Maximization Algorithm for Blind Separation of Noisy Mixtures Using Gaussian Mixture Model
In this paper, we propose a new expectation-maximization (EM) algorithm, named GMM-EM, to blind separation of noisy instantaneous mixtures, in which the non-Gaussianity of independent sources is exploited by modeling their distribution using the ...
TRUST-TECH-Based Expectation Maximization for Learning Finite Mixture Models
In spite of the initialization problem, the Expectation-Maximization (EM) algorithm is widely used for estimating the parameters of finite mixture models. Most popular model-based clustering techniques might yield poor clusters if the parameters are not ...
Comments