Abstract
In this paper, we propose an efficient learning framework for both finite and infinite Gamma mixture models. Unlike existing learning methods such as maximum-likelihood method, we propose here to tackle the problem of learning Gamma mixtures within a coherent and unified framework based on an expectation-propagation inference method. In addition, we introduce an effective Bayesian technique in order to generalize the finite Gamma (EP-GaMM) to the infinite mixture (EP-inGaMM). The developed framework offers accurate approximations to the full posterior and takes into account the prior knowledge in the statistical model. In particular, the model’s parameters are estimated accurately and the optimal number of components is determined automatically. The choice of Gamma mixture is motivated by its flexibility and its modeling capabilities in solving many real-life applications. We highlight the importance of the proposed framework by solving main common spatio-temporal objects recognition challenges which might be used especially in interactive systems or robotics. The effectiveness of our approach is demonstrated through several real challenging applications. Our experiments shows comparable to superior results to other methods from the state-of-the-art. Results show that the average recognition accuracy of the proposed framework can achieve 78% which is better than many other related methods.
Similar content being viewed by others
References
Alharithi F, Almulihi A, Bourouis S, Alroobaea R, Bouguila N (2021) Discriminative learning approach based on flexible mixture model for medical data categorization and recognition. Sensors 21(7):2450
Almulihi A, Alharithi F, Bourouis S, Alroobaea R, Pawar Y, Bouguila N (2021) Oil spill detection in sar images using online extended variational learning of dirichlet process mixtures of gamma distributions. Remote Sens 13 (15):2991
Andrews JL, McNicholas PD, Subedi S (2011) Model-based classification via mixtures of multivariate t-distributions. Comput Stat Data Anal 55 (1):520–529
Bhatti UA, Huang M, Wu D, Zhang Y, Mehmood A, Han H (2019) Recommendation system using feature extraction and pattern recognition in clinical care systems. Enterprise Inform Syst 13(3):329–351
Bhatti UA, Yu Z, Chanussot J, Zeeshan Z, Yuan L, Luo W, Nawaz SA, Bhatti MA, Ain QU, Mehmood A (2022) Local similarity-based spatial-spectral fusion hyperspectral image classification with deep CNN and gabor filtering. IEEE Trans Geosci Remote Sens 60:1–15
Bhatti UA, Yu Z, Li J, Nawaz SA, Mehmood A, Zhang K, Yuan L (2020) Hybrid watermarking algorithm using clifford algebra with arnold scrambling and chaotic encryption. IEEE Access 8:76386–76398
Bishop CM (2006) Pattern recognition and machine learning. Springer
Bourouis S, Alroobaea R, Rubaiee S, Andejany M, Almansour FM, Bouguila N (2021) Markov chain monte carlo-based bayesian inference for learning finite and infinite inverted beta-liouville mixture models. IEEE Access 9:71170–71183
Bourouis S, Alroobaea R, Rubaiee S, Andejany M, Bouguila N (2021) Nonparametric bayesian learning of infinite multivariate generalized normal mixture models and its applications. Appl Sci 11(13):5798
Bourouis S, Bouguila N (2021) Nonparametric learning approach based on infinite flexible mixture model and its application to medical data analysis. Int J Imaging Syst Technol 31(4):1989–2002
Bourouis S, Bouguila N (2022) Unsupervised learning using expectation propagation inference of inverted beta-liouville mixture models for pattern recognition applications. Cybern Syst, pp 1–25
Bourouis S, Sallay H, Bouguila N (2021) A competitive generalized gamma mixture model for medical image diagnosis. IEEE Access 9:13727–13736
Channoufi I, Bourouis S, Bouguila N, Hamrouni K (2018) Spatially constrained mixture model with feature selection for image and video segmentation. In: Image and signal processing - 8th international conference, ICISP 2018, cherbourg, france, july 2-4, 2018, proceedings, pp 36–44
Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision, ECCV, vol 1, pp 1–2
Fan GF, Zhang LZ, Yu M, Hong WC, Dong SQ (2022) Applications of random forest in multivariable response surface for short-term load forecasting. International Journal of Electrical Power and Energy Systems 139:108073
Fan W, Bouguila N (2014) Non-gaussian data clustering via expectation propagation learning of finite dirichlet mixture models and applications. Neural Process Lett 39(2):115–135
Fan W, Bouguila N (2015) Expectation propagation learning of a dirichlet process mixture of beta-liouville distributions for proportional data clustering. Eng Appl Artif Intell 43:1–14
Fang H, Parthaláin NM, Aubrey AJ, Tam GKL, Borgo R, Rosin PL, Grant PW, Marshall AD, Chen M (2014) Facial expression recognition in dynamic sequences: an integrated approach. Pattern Recognit 47(3):1271–1281
Ferguson TS (1983) Bayesian density estimation by mixtures of normal distributions. In: Recent advances in statistics. Academic Press, pp 287–302
Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1/2):177–196
Kim T, Wong S, Cipolla R (2007) Tensor canonical correlation analysis for action classification. In: 2007 IEEE Computer society conference on computer vision and pattern recognition (CVPR 2007), 18-23 june. IEEE Computer Society, Minneapolis, Minnesota, USA, p 2007
Laptev I, Marszalek M, Schmid C, Rozenfeld B (2008) Learning realistic human actions from movies. In: 2008 IEEE Computer society conference on computer vision and pattern recognition (CVPR 2008), 24-26 june, 2008. IEEE Computer Society, Anchorage, Alaska, USA
Li J, Huang S, Zhang X, Fu X, Chang CC, Tang Z, Luo Z (2018) Facial expression recognition by transfer learning for small datasets. In: International conference on security with intelligent computing and big-data services. Springer, pp 756–770
Li J, Jin K, Zhou D, Kubota N, Ju Z (2020) Attention mechanism-based CNN for facial expression recognition. Neurocomputing 411:340–350
Liu X, Fu H, Jia Y (2008) Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images. Pattern Recognit 41(2):484–493
Liu X, Jin L, Han X, You J (2021) Mutual information regularized identity-aware facial expression recognition in compressed video. Pattern Recogn 119:108105
Lucey P, Cohn JF, Kanade T, Saragih JM, Ambadar Z, Matthews IA (2010) The extended cohn-kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: IEEE Conference on computer vision and pattern recognition, CVPR workshops 2010, san francisco, CA, USA, 13-18 June, 2010. IEEE Computer Society, pp 94–101
Ma Z, Leijon A (2010) Expectation propagation for estimating the parameters of the beta distribution. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, ICASSP 2010, 14-19 March 2010, Sheraton Dallas Hotel, Dallas, Texas, USA. pp 2082–2085. IEEE
Mahmoudi MA, Chetouani A, Boufera F, Tabia H (2020) Learnable pooling weights for facial expression recognition. Pattern Recognit Lett, 138
Marcolin F, Vezzetti E, Monaci MG (2021) Face perception foundations for pattern recognition algorithms. Neurocomputing 443:302–319
Maybeck PS (1982) Stochastic models, estimation, and control. Academic Press
McLachlan GJ, Peel D (2004) Finite mixture models. John Wiley & Sons
McNicholas PD (2016) Model-based clustering. J Classif 33(3):331–373
Minka TP (2001) Expectation propagation for approximate bayesian inference. In: Breese JS, Koller D (eds) UAI ’01: Proceedings of the 17th conference in uncertainty in artificial intelligence, University of Washington, Seattle, Washington, USA, August 2-5, 2001. Morgan Kaufmann, pp 362–369
Najar F, Bourouis S, Bouguila N, Belghith S (2019) Unsupervised learning of finite full covariance multivariate generalized gaussian mixture models for human activity recognition. Multim Tools Appl 78(13):18669–18691
Najar F, Bourouis S, Bouguila N, Belghith S (2020) A new hybrid discriminative/generative model using the full-covariance multivariate generalized gaussian mixture models. Soft Comput 24(14):10611–10628
Najar F, Bourouis S, Zaguia A, Bouguila N, Belghith S (2018) Unsupervised human action categorization using a riemannian averaged fixed-point learning of multivariate GGMM. In: Image analysis and recognition - 15th international conference, ICIAR 2018, póvoa de varzim, portugal, june 27-29, 2018, proceedings. pp 408–415
Niebles JC, Wang H, Li F (2008) Unsupervised learning of human action categories using spatial-temporal words. Int J Comput Vis 79(3):299–318
Peng X, Xia Z, Li L, Feng X (2016) Towards facial expression recognition in the wild: a new database and deep recognition system. In: 2016 IEEE Conference on computer vision and pattern recognition workshops, CVPR workshops 2016, las vegas, NV, USA, June 26 - July 1, 2016. IEEE Computer Society, pp 1544–1550
Rodriguez MD, Ahmed J, Shah M (2008) Action MACH a spatio-temporal maximum average correlation height filter for action recognition. In: 2008 IEEE Computer society conference on computer vision and pattern recognition (CVPR 2008), 24-26 june, 2008. Anchorage, Alaska, USA, IEEE Computer Society
Sallay H, Bourouis S, Bouguila N (2021) Online learning of finite and infinite gamma mixture models for COVID-19 detection in medical images. Comput 10(1):6
Sariyanidi E, Gunes H, Cavallaro A (2017) Learning bases of activity for facial expression recognition. IEEE Trans Image Process 26(4):1965–1978
Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: a local svm approach. In: Proceedings of the 17th international conference on pattern recognition, 2004. ICPR 2004, vol 3. IEEE, pp 32–36
Schu̇ldt C., Laptev I, Caputo B (2004) Recognizing human actions: a local SVM approach. In: 17Th international conference on pattern recognition, ICPR 2004, cambridge, UK, August 23-26, 2004. IEEE Computer Society, pp 32–36
Scovanner P, Ali S, Shah M (2007) A 3-dimensional sift descriptor and its application to action recognition. In: Proceedings of the 15th ACM international conference on Multimedia. ACM, pp 357–360
Sethuraman J (1994) A constructive definition of dirichlet priors. Statistica sinica, pp 639–650
Shan C, Gong S, McOwan PW (2009) Facial expression recognition based on local binary patterns: A comprehensive study. Image Vis Comput 27 (6):803–816
Sikka K, Wu T, Susskind J, Bartlett MS (2012) Exploring bag of words architectures in the facial expression domain. In: Fusiello A, Murino V, Cucchiara R (eds) Computer vision - ECCV 2012. Workshops and demonstrations - florence, italy, october 7-13, 2012, proceedings, Part II. Lecture notes in computer science, vol 7584. Springer, pp 250–259
Song Z, Ali S, Bouguila N, Fan W (2020) Nonparametric hierarchical mixture models based on asymmetric gaussian distribution. Digit Signal Process 106:102829
Teh YW (2010) Dirichlet Process. Springer, US, Boston, MA, pp 280–287
Vedaldi A, Fulkerson B (2010) Vlfeat: an open and portable library of computer vision algorithms. In: Bimbo AD, Chang S, Smeulders AWM (eds) Proceedings of the 18th international conference on multimedia 2010, Firenze, Italy, October 25-29, 2010. ACM, pp 1469–1472
Vrigkas M, Nikou C, Kakadiaris IA (2015) A review of human activity recognition methods. Frontiers in Robotics and AI 2:28
Wong S, Cipolla R (2007) Extracting spatiotemporal interest points using global information. In: IEEE 11Th international conference on computer vision, ICCV 2007, rio de janeiro, brazil, october 14-20, 2007. IEEE Computer Society, pp 1–8
Wu H, Lu Z, Zhang J, Li X, Zhao M, Ding X (2021) Facial expression recognition based on multi-features cooperative deep convolutional network. Appl Sci 11(4):1428
Zeng C, Liu J, Li J, Cheng J, Zhou J, Nawaz SA, Xiao X, Bhatti UA (2022) Multi-watermarking algorithm for medical image based on kaze-dct. Journal of Ambient Intelligence and Humanized Computing, PP 1–9
Zhang Z, Lai C, Liu H, Li Y (2020) Infrared facial expression recognition via gaussian-based label distribution learning in the dark illumination environment for human emotion detection. Neurocomputing 409:341–350
Zhao L, Wang Z, Zhang G (2017) Facial expression recognition from video sequences based on spatial-temporal motion local binary pattern and gabor multiorientation fusion histogram. Math Probl Eng 2017
Zhong D, Zhang H, Chang S (1996) Clustering methods for video browsing and annotation. In: Sethi IK, Jain RC (eds) Storage and retrieval for still image and video databases IV, San Diego/La Jolla, CA, USA, January 28 - February 2, 1996. SPIE Proceedings, vol 2670. SPIE, pp 239–246
Zhou H, Zhou S (2019) Scene categorization towards urban tunnel traffic by image quality assessment. J Vis Commun Image Represent, pp 65
Zhu R, Dornaika F, Ruichek Y (2019) Learning a discriminant graph-based embedding with feature selection for image categorization. Neural Netw 111:35–46
Acknowledgement
The researchers would like to acknowledge Deanship of Scientific Research, Taif University for funding this work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Bourouis, S., Bouguila, N. Expectation propagation learning of finite and infinite Gamma mixture models and its applications. Multimed Tools Appl 82, 33267–33284 (2023). https://doi.org/10.1007/s11042-023-14666-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14666-w