Skip to main content
Log in

Expectation propagation learning of finite and infinite Gamma mixture models and its applications

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, we propose an efficient learning framework for both finite and infinite Gamma mixture models. Unlike existing learning methods such as maximum-likelihood method, we propose here to tackle the problem of learning Gamma mixtures within a coherent and unified framework based on an expectation-propagation inference method. In addition, we introduce an effective Bayesian technique in order to generalize the finite Gamma (EP-GaMM) to the infinite mixture (EP-inGaMM). The developed framework offers accurate approximations to the full posterior and takes into account the prior knowledge in the statistical model. In particular, the model’s parameters are estimated accurately and the optimal number of components is determined automatically. The choice of Gamma mixture is motivated by its flexibility and its modeling capabilities in solving many real-life applications. We highlight the importance of the proposed framework by solving main common spatio-temporal objects recognition challenges which might be used especially in interactive systems or robotics. The effectiveness of our approach is demonstrated through several real challenging applications. Our experiments shows comparable to superior results to other methods from the state-of-the-art. Results show that the average recognition accuracy of the proposed framework can achieve 78% which is better than many other related methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. https://labicvl.github.io/gesdb.htm

  2. https://labicvl.github.io/gesdb.htm

References

  1. Alharithi F, Almulihi A, Bourouis S, Alroobaea R, Bouguila N (2021) Discriminative learning approach based on flexible mixture model for medical data categorization and recognition. Sensors 21(7):2450

    Article  Google Scholar 

  2. Almulihi A, Alharithi F, Bourouis S, Alroobaea R, Pawar Y, Bouguila N (2021) Oil spill detection in sar images using online extended variational learning of dirichlet process mixtures of gamma distributions. Remote Sens 13 (15):2991

    Article  Google Scholar 

  3. Andrews JL, McNicholas PD, Subedi S (2011) Model-based classification via mixtures of multivariate t-distributions. Comput Stat Data Anal 55 (1):520–529

    Article  MathSciNet  MATH  Google Scholar 

  4. Bhatti UA, Huang M, Wu D, Zhang Y, Mehmood A, Han H (2019) Recommendation system using feature extraction and pattern recognition in clinical care systems. Enterprise Inform Syst 13(3):329–351

    Article  Google Scholar 

  5. Bhatti UA, Yu Z, Chanussot J, Zeeshan Z, Yuan L, Luo W, Nawaz SA, Bhatti MA, Ain QU, Mehmood A (2022) Local similarity-based spatial-spectral fusion hyperspectral image classification with deep CNN and gabor filtering. IEEE Trans Geosci Remote Sens 60:1–15

    Article  Google Scholar 

  6. Bhatti UA, Yu Z, Li J, Nawaz SA, Mehmood A, Zhang K, Yuan L (2020) Hybrid watermarking algorithm using clifford algebra with arnold scrambling and chaotic encryption. IEEE Access 8:76386–76398

    Article  Google Scholar 

  7. Bishop CM (2006) Pattern recognition and machine learning. Springer

  8. Bourouis S, Alroobaea R, Rubaiee S, Andejany M, Almansour FM, Bouguila N (2021) Markov chain monte carlo-based bayesian inference for learning finite and infinite inverted beta-liouville mixture models. IEEE Access 9:71170–71183

    Article  Google Scholar 

  9. Bourouis S, Alroobaea R, Rubaiee S, Andejany M, Bouguila N (2021) Nonparametric bayesian learning of infinite multivariate generalized normal mixture models and its applications. Appl Sci 11(13):5798

    Article  Google Scholar 

  10. Bourouis S, Bouguila N (2021) Nonparametric learning approach based on infinite flexible mixture model and its application to medical data analysis. Int J Imaging Syst Technol 31(4):1989–2002

    Article  Google Scholar 

  11. Bourouis S, Bouguila N (2022) Unsupervised learning using expectation propagation inference of inverted beta-liouville mixture models for pattern recognition applications. Cybern Syst, pp 1–25

  12. Bourouis S, Sallay H, Bouguila N (2021) A competitive generalized gamma mixture model for medical image diagnosis. IEEE Access 9:13727–13736

    Article  Google Scholar 

  13. Channoufi I, Bourouis S, Bouguila N, Hamrouni K (2018) Spatially constrained mixture model with feature selection for image and video segmentation. In: Image and signal processing - 8th international conference, ICISP 2018, cherbourg, france, july 2-4, 2018, proceedings, pp 36–44

  14. Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision, ECCV, vol 1, pp 1–2

  15. Fan GF, Zhang LZ, Yu M, Hong WC, Dong SQ (2022) Applications of random forest in multivariable response surface for short-term load forecasting. International Journal of Electrical Power and Energy Systems 139:108073

    Article  Google Scholar 

  16. Fan W, Bouguila N (2014) Non-gaussian data clustering via expectation propagation learning of finite dirichlet mixture models and applications. Neural Process Lett 39(2):115–135

    Article  Google Scholar 

  17. Fan W, Bouguila N (2015) Expectation propagation learning of a dirichlet process mixture of beta-liouville distributions for proportional data clustering. Eng Appl Artif Intell 43:1–14

    Article  Google Scholar 

  18. Fang H, Parthaláin NM, Aubrey AJ, Tam GKL, Borgo R, Rosin PL, Grant PW, Marshall AD, Chen M (2014) Facial expression recognition in dynamic sequences: an integrated approach. Pattern Recognit 47(3):1271–1281

    Article  Google Scholar 

  19. Ferguson TS (1983) Bayesian density estimation by mixtures of normal distributions. In: Recent advances in statistics. Academic Press, pp 287–302

  20. Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1/2):177–196

    Article  MATH  Google Scholar 

  21. Kim T, Wong S, Cipolla R (2007) Tensor canonical correlation analysis for action classification. In: 2007 IEEE Computer society conference on computer vision and pattern recognition (CVPR 2007), 18-23 june. IEEE Computer Society, Minneapolis, Minnesota, USA, p 2007

  22. Laptev I, Marszalek M, Schmid C, Rozenfeld B (2008) Learning realistic human actions from movies. In: 2008 IEEE Computer society conference on computer vision and pattern recognition (CVPR 2008), 24-26 june, 2008. IEEE Computer Society, Anchorage, Alaska, USA

  23. Li J, Huang S, Zhang X, Fu X, Chang CC, Tang Z, Luo Z (2018) Facial expression recognition by transfer learning for small datasets. In: International conference on security with intelligent computing and big-data services. Springer, pp 756–770

  24. Li J, Jin K, Zhou D, Kubota N, Ju Z (2020) Attention mechanism-based CNN for facial expression recognition. Neurocomputing 411:340–350

    Article  Google Scholar 

  25. Liu X, Fu H, Jia Y (2008) Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images. Pattern Recognit 41(2):484–493

    Article  MATH  Google Scholar 

  26. Liu X, Jin L, Han X, You J (2021) Mutual information regularized identity-aware facial expression recognition in compressed video. Pattern Recogn 119:108105

    Article  Google Scholar 

  27. Lucey P, Cohn JF, Kanade T, Saragih JM, Ambadar Z, Matthews IA (2010) The extended cohn-kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: IEEE Conference on computer vision and pattern recognition, CVPR workshops 2010, san francisco, CA, USA, 13-18 June, 2010. IEEE Computer Society, pp 94–101

  28. Ma Z, Leijon A (2010) Expectation propagation for estimating the parameters of the beta distribution. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, ICASSP 2010, 14-19 March 2010, Sheraton Dallas Hotel, Dallas, Texas, USA. pp 2082–2085. IEEE

  29. Mahmoudi MA, Chetouani A, Boufera F, Tabia H (2020) Learnable pooling weights for facial expression recognition. Pattern Recognit Lett, 138

  30. Marcolin F, Vezzetti E, Monaci MG (2021) Face perception foundations for pattern recognition algorithms. Neurocomputing 443:302–319

    Article  Google Scholar 

  31. Maybeck PS (1982) Stochastic models, estimation, and control. Academic Press

  32. McLachlan GJ, Peel D (2004) Finite mixture models. John Wiley & Sons

  33. McNicholas PD (2016) Model-based clustering. J Classif 33(3):331–373

    Article  MathSciNet  MATH  Google Scholar 

  34. Minka TP (2001) Expectation propagation for approximate bayesian inference. In: Breese JS, Koller D (eds) UAI ’01: Proceedings of the 17th conference in uncertainty in artificial intelligence, University of Washington, Seattle, Washington, USA, August 2-5, 2001. Morgan Kaufmann, pp 362–369

  35. Najar F, Bourouis S, Bouguila N, Belghith S (2019) Unsupervised learning of finite full covariance multivariate generalized gaussian mixture models for human activity recognition. Multim Tools Appl 78(13):18669–18691

    Article  Google Scholar 

  36. Najar F, Bourouis S, Bouguila N, Belghith S (2020) A new hybrid discriminative/generative model using the full-covariance multivariate generalized gaussian mixture models. Soft Comput 24(14):10611–10628

    Article  Google Scholar 

  37. Najar F, Bourouis S, Zaguia A, Bouguila N, Belghith S (2018) Unsupervised human action categorization using a riemannian averaged fixed-point learning of multivariate GGMM. In: Image analysis and recognition - 15th international conference, ICIAR 2018, póvoa de varzim, portugal, june 27-29, 2018, proceedings. pp 408–415

  38. Niebles JC, Wang H, Li F (2008) Unsupervised learning of human action categories using spatial-temporal words. Int J Comput Vis 79(3):299–318

    Article  Google Scholar 

  39. Peng X, Xia Z, Li L, Feng X (2016) Towards facial expression recognition in the wild: a new database and deep recognition system. In: 2016 IEEE Conference on computer vision and pattern recognition workshops, CVPR workshops 2016, las vegas, NV, USA, June 26 - July 1, 2016. IEEE Computer Society, pp 1544–1550

  40. Rodriguez MD, Ahmed J, Shah M (2008) Action MACH a spatio-temporal maximum average correlation height filter for action recognition. In: 2008 IEEE Computer society conference on computer vision and pattern recognition (CVPR 2008), 24-26 june, 2008. Anchorage, Alaska, USA, IEEE Computer Society

  41. Sallay H, Bourouis S, Bouguila N (2021) Online learning of finite and infinite gamma mixture models for COVID-19 detection in medical images. Comput 10(1):6

    Article  Google Scholar 

  42. Sariyanidi E, Gunes H, Cavallaro A (2017) Learning bases of activity for facial expression recognition. IEEE Trans Image Process 26(4):1965–1978

    Article  MathSciNet  MATH  Google Scholar 

  43. Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: a local svm approach. In: Proceedings of the 17th international conference on pattern recognition, 2004. ICPR 2004, vol 3. IEEE, pp 32–36

  44. Schu̇ldt C., Laptev I, Caputo B (2004) Recognizing human actions: a local SVM approach. In: 17Th international conference on pattern recognition, ICPR 2004, cambridge, UK, August 23-26, 2004. IEEE Computer Society, pp 32–36

  45. Scovanner P, Ali S, Shah M (2007) A 3-dimensional sift descriptor and its application to action recognition. In: Proceedings of the 15th ACM international conference on Multimedia. ACM, pp 357–360

  46. Sethuraman J (1994) A constructive definition of dirichlet priors. Statistica sinica, pp 639–650

  47. Shan C, Gong S, McOwan PW (2009) Facial expression recognition based on local binary patterns: A comprehensive study. Image Vis Comput 27 (6):803–816

    Article  Google Scholar 

  48. Sikka K, Wu T, Susskind J, Bartlett MS (2012) Exploring bag of words architectures in the facial expression domain. In: Fusiello A, Murino V, Cucchiara R (eds) Computer vision - ECCV 2012. Workshops and demonstrations - florence, italy, october 7-13, 2012, proceedings, Part II. Lecture notes in computer science, vol 7584. Springer, pp 250–259

  49. Song Z, Ali S, Bouguila N, Fan W (2020) Nonparametric hierarchical mixture models based on asymmetric gaussian distribution. Digit Signal Process 106:102829

    Article  Google Scholar 

  50. Teh YW (2010) Dirichlet Process. Springer, US, Boston, MA, pp 280–287

    Google Scholar 

  51. Vedaldi A, Fulkerson B (2010) Vlfeat: an open and portable library of computer vision algorithms. In: Bimbo AD, Chang S, Smeulders AWM (eds) Proceedings of the 18th international conference on multimedia 2010, Firenze, Italy, October 25-29, 2010. ACM, pp 1469–1472

  52. Vrigkas M, Nikou C, Kakadiaris IA (2015) A review of human activity recognition methods. Frontiers in Robotics and AI 2:28

    Article  Google Scholar 

  53. Wong S, Cipolla R (2007) Extracting spatiotemporal interest points using global information. In: IEEE 11Th international conference on computer vision, ICCV 2007, rio de janeiro, brazil, october 14-20, 2007. IEEE Computer Society, pp 1–8

  54. Wu H, Lu Z, Zhang J, Li X, Zhao M, Ding X (2021) Facial expression recognition based on multi-features cooperative deep convolutional network. Appl Sci 11(4):1428

    Article  Google Scholar 

  55. Zeng C, Liu J, Li J, Cheng J, Zhou J, Nawaz SA, Xiao X, Bhatti UA (2022) Multi-watermarking algorithm for medical image based on kaze-dct. Journal of Ambient Intelligence and Humanized Computing, PP 1–9

  56. Zhang Z, Lai C, Liu H, Li Y (2020) Infrared facial expression recognition via gaussian-based label distribution learning in the dark illumination environment for human emotion detection. Neurocomputing 409:341–350

    Article  Google Scholar 

  57. Zhao L, Wang Z, Zhang G (2017) Facial expression recognition from video sequences based on spatial-temporal motion local binary pattern and gabor multiorientation fusion histogram. Math Probl Eng 2017

  58. Zhong D, Zhang H, Chang S (1996) Clustering methods for video browsing and annotation. In: Sethi IK, Jain RC (eds) Storage and retrieval for still image and video databases IV, San Diego/La Jolla, CA, USA, January 28 - February 2, 1996. SPIE Proceedings, vol 2670. SPIE, pp 239–246

  59. Zhou H, Zhou S (2019) Scene categorization towards urban tunnel traffic by image quality assessment. J Vis Commun Image Represent, pp 65

  60. Zhu R, Dornaika F, Ruichek Y (2019) Learning a discriminant graph-based embedding with feature selection for image categorization. Neural Netw 111:35–46

    Article  MATH  Google Scholar 

Download references

Acknowledgement

The researchers would like to acknowledge Deanship of Scientific Research, Taif University for funding this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sami Bourouis.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bourouis, S., Bouguila, N. Expectation propagation learning of finite and infinite Gamma mixture models and its applications. Multimed Tools Appl 82, 33267–33284 (2023). https://doi.org/10.1007/s11042-023-14666-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-14666-w

Keywords

Navigation