Skip to main content
Log in

Semisupervised online learning of hierarchical structures for visual object classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

One of the main challenges in hierarchical object classification is the derivation of the correct hierarchical structure. The classic way around the problem is assuming prior knowledge about the hierarchical structure itself. Two major drawbacks result from the former assumption. Firstly it has been shown that the hierarchies tend to reduce the differences between adjacent nodes. It has been observed that this trait of hierarchical models results in a less accurate classification. Secondly the mere assumption of prior knowledge about the form of the hierarchy requires an extra amount of information about the dataset that in many real world scenarios may not be available. In this work we address the mentioned problems by introducing online learning of hierarchical models. Our models start from a crude guess of the hierarchy and proceed to figure out the detailed version progressively. We show the merits of the proposed work via extensive simulations and experiments on a real objects database.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Bakhtiari AS, Bouguila N (2010) A hierarchical statistical model for object classification. In: Proc. of the IEEE international workshop on multimedia signal processing (MMSP), pp 493–498

  2. Bakhtiari AS, Bouguila N (2011) An expandable hierarchical statistical framework for count data modeling and its application to object classification. In: Proc. of the IEEE international conference on tools with artificial intelligence (ICTAI), pp 817–824

  3. Bakhtiari AS, Bouguila N (2012) A novel hierarchical statistical model for count data modeling and its application in image classification. In: Proc. of the international conference on neural information processing (ICONIP). LNCS 7664, vol 2, pp 332–340

  4. Bar M (2004) Visual objects in context. Nat Rev Neurosci 5:617–629

    Article  Google Scholar 

  5. Bishop CM (2006) Pattern recognition and machine learning. Springer, New York

    Google Scholar 

  6. Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  7. Bouguila N (2007) Spatial color image databases summarization. In: Proc. of the IEEE conference on acoustics, speech, and signal processing (ICASSP), vol 1, pp 953–956

  8. Bouguila N, ElGuebaly W (2008) On discrete data clustering. In: Proc. of the Pacific-Asia conference on knowledge discovery and data mining (PAKDD), LNCS 5012, pp 503–510

  9. Bouguila N, ElGuebaly W (2008) A generative model for spatial color image databases categorization. In: Proc. of the IEEE conference on acoustics, speech and signal processing (ICASSP), pp 821–824

  10. Bouguila N, ElGuebaly W (2009) Discrete data clustering using finite mixture models. Pattern Recogn 42:33–42

    Article  MATH  Google Scholar 

  11. Bouguila N, Ziou D (2004) Improving content based image retrieval systems using finite multinomial Dirichlet mixture. In: Proc. of the 14th IEEE workshop on machine learning for signal processing (MLSP), pp 23–32

  12. Bouguila N, Ziou D (2006) Unsupervised selection of a finite Dirichlet mixture model: an MML-based approach. IEEE Trans Knowl Data Eng 18:993–1009

    Article  Google Scholar 

  13. Bouguila N, Ziou D (2006) A hybrid SEM algorithm for high-dimensional unsupervised learning using a finite generalized Dirichlet mixture. IEEE Trans Image Process 15:2657–2668

    Article  Google Scholar 

  14. Bouguila N, Ziou D (2007) Unsupervised learning of a finite discrete mixture: applications to texture modeling and image databases summarization. J Vis Commun Image Represent 15:295–309

    Article  Google Scholar 

  15. Bouguila N, Ziou D (2010) A Dirichlet process mixture of generalized Dirichlet distributions for proportional data modeling. IEEE Trans Neural Netw 21:107–122

    Article  Google Scholar 

  16. Bouguila N, Ziou D, Vaillancourt J (2004) Unsupervised learning of a finite mixture model based on the Dirichlet distribution and its application. IEEE Trans Image Process 13:1533–1543

    Article  Google Scholar 

  17. Brunelli R, Mich O (2000) Image retrieval by examples. IEEE Trans Multimedia 2:164–171

    Article  Google Scholar 

  18. Chapelle O, Haffner P, Vapnik VN (1999) Support vector machines for histogram-based image classification. IEEE Trans Neural Netw 10:1055–1064

    Article  Google Scholar 

  19. Connor R, Mosimann J (1969) Concepts of independence for proportions with a generalization of the dirichlet distribution. J Am Stat Assoc 64:194–206

    Article  MATH  MathSciNet  Google Scholar 

  20. Csurka G, Dance CR, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Proc. of the workshop on statistical learning in computer vision, ECCV, pp 1–22

  21. Durett R (2008) Probability models for DNA sequence evolution. Springer, New York

    Book  Google Scholar 

  22. Fang KT, Kotz S, Ng KW (1990) Symmetric multivariate and related distributions. Chapman & Hall/CRC

  23. Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: Proc. of the IEEE computer society conference on computer vision and pattern recognition (CVPR), pp 524–531

  24. Fei-Fei L, VanRullen R, Koch C, Perona P (2002) Rapid natural scene categorization in the near absence of attention. Proc Natl Acad Sci 99:9596–9601

    Article  Google Scholar 

  25. Greenspan H, Pinhas AT (2007) Medical image categorization and retrieval for PACS using the GMM-KL framework. IEEE Trans Inf Technol Biomed 11:190–202

    Article  Google Scholar 

  26. Hartigan JA (1975) Clustering algorithms. Wiley, New York

    MATH  Google Scholar 

  27. Hofmann T (1998) Learning and representing topic. A hierarchical mixture for word occurrences in document databases. In: Proc. of the conference for automated learning and discovery (CONALD)

  28. Lee JJ (2008) LIBPMK: a pyramid match toolkit. MIT Computer Science and Artificial Intelligence Laboratory

  29. Lopez-Rubio E, Palomo EJ (2011) Growing hierarchical probabilistic self-organizing graphs. IEEE Trans Neural Netw 22:997–1008

    Article  Google Scholar 

  30. Lowe DG (1999) Object recognition from local scale-invariant features. In: Proc. of the international conference on computer vision (ICCV), vol 2, pp 1150–1157

  31. Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27:1615–1630

    Article  Google Scholar 

  32. Peelen MV, Fei-Fei L, Kastner S (2009) Neural mechanisms of rapid natural scene categorization in human visual cortex. Nature 460:94–97

    Article  Google Scholar 

  33. Sivic J, Russell BC, Zisserman A, Freeman WT, Efros AA (2008) Unsupervised discovery of visual object class hierarchies. In: Proc. of the IEEE computer society conference on computer vision and pattern recognition (CVPR), pp 1–8

  34. Veeramachaneni S, Sona D, Avesani P (2005) Hierarchical Dirichlet model for document classification. In: Proc. of the international conference on machine learning (ICML), pp 928–935

  35. Yu KL, Lam W (1998) A new on-line learning algorithm for adaptive text filtering. In: Proc. of the international conference on information and knowledge management (CIKM), pp 156–160

Download references

Acknowledgements

The completion of this research was made possible thanks to the Natural Sciences and Engineering Research Council of Canada (NSERC). The authors would like to thank the anonymous referees and the associate editor for their helpful comments. The complete source code of this work is available upon request.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nizar Bouguila.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bakhtiari, A.S., Bouguila, N. Semisupervised online learning of hierarchical structures for visual object classification. Multimed Tools Appl 74, 1805–1822 (2015). https://doi.org/10.1007/s11042-013-1719-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-013-1719-y

Keywords

Navigation