Skip to main content

Image Annotation Using a Semantic Hierarchy

  • Conference paper
  • First Online:
Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2018)

Abstract

With the fast development of smartphones and social media image sharing, automatic image annotation has become a research area of great interest. It enables indexing, extracting and searching in large collections of images in an easier and faster way. In this paper, we propose a model for the annotation extension of images using a semantic hierarchy. This latter is built from vocabulary keyword annotations combining a mixture of Bernoulli distributions with mixtures of Gaussians.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://s3.amazonaws.com/dl4j-distribution/GoogleNews-vectors-negative300.bin.gz.

References

  1. Bannour, H., Hudelot, C.: Building and using fuzzy multimedia ontologies for semantic image annotation. Multimed. Tools Appl. 72, 2107–2141 (2014)

    Article  Google Scholar 

  2. Barrat, S., Tabbone, S.: Classification and automatic annotation extension of images using Bayesian network. In: da Vitoria Lobo, N., et al. (eds.) SSPR/SPR 2008. LNCS, vol. 5342, pp. 937–946. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-89689-0_97

    Chapter  Google Scholar 

  3. Bart, E., Porteous, I., Perona, P., Welling, M.: Unsupervised learning of visual taxonomies. In: CVPR, pp. 1–8. IEEE (2008)

    Google Scholar 

  4. Blundell, C., Teh, Y.W., Heller, K.A.: Bayesian rose trees. arXiv preprint arXiv:1203.3468 (2012)

  5. Cao, X., Zhang, H., Guo, X., Liu, S., Meng, D.: SLED: semantic label embedding dictionary representation for multilabel image annotation. IEEE IP 24(9), 2746–2759 (2015)

    MathSciNet  Google Scholar 

  6. Chong, W., Blei, D., Li, F.F.: Simultaneous image classification and annotation. In: CVPR, pp. 1903–1910. IEEE (2009)

    Google Scholar 

  7. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. JRSS Ser. B 39(1), 1–38 (1977)

    MathSciNet  MATH  Google Scholar 

  8. El-Bendary, N., Kim, T.H., Hassanien, A.E., Sami, M.: Automatic image annotation approach based on optimization of classes scores. Computing 96(5), 381–402 (2014)

    Article  Google Scholar 

  9. Feng, S., Manmatha, R., Lavrenko, V.: Multiple Bernoulli relevance models for image and video annotation. In: CVPR, vol. 2, pp. 1002–1009. IEEE (2004)

    Google Scholar 

  10. Fountain, T., Lapata, M.: Taxonomy induction using hierarchical random graphs. In: ACL, pp. 466–476 (2012)

    Google Scholar 

  11. Fu, H., Zhang, Q., Qiu, G.: Random forest for image annotation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7577, pp. 86–99. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33783-3_7

    Chapter  Google Scholar 

  12. Griffin, G., Perona, P.: Learning and using taxonomies for fast visual categorization. In: CVPR, pp. 1–8. IEEE (2008)

    Google Scholar 

  13. Ji, P., Gao, X., Hu, X.: Automatic image annotation by combining generative and discriminant models. Neurocomputing 236, 48–55 (2017)

    Article  Google Scholar 

  14. Jing, X.Y., Wu, F., Li, Z., Hu, R., Zhang, D.: Multi-label dictionary learning for image annotation. IEEE Trans. Image Process. 25(6), 2712–2725 (2016)

    Article  MathSciNet  Google Scholar 

  15. Kalayeh, M.M., Idrees, H., Shah, M.: NMF-KNN: image annotation using weighted multi-view non-negative matrix factorization. In: CVPR, pp. 184–191 (2014)

    Google Scholar 

  16. Lauritzen, S.L., Spiegelhalter, D.J.: Local computations with probabilities on graphical structures and their application to expert systems. JRSS Ser. B 50(2), 157–224 (1988)

    MathSciNet  MATH  Google Scholar 

  17. Li, L.J., Socher, R., Fei-Fei, L.: Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: CVPR, pp. 2036–2043. IEEE (2009)

    Google Scholar 

  18. Liu, X., Song, Y., Liu, S., Wang, H.: Automatic taxonomy construction from keywords. In: ACM SIGKDD, pp. 1433–1441. ACM (2012)

    Google Scholar 

  19. Low, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999)

    Google Scholar 

  20. Maihami, V., Yaghmaee, F.: Fuzzy neighbor voting for automatic image annotation. JECEI 4(1), 1–8 (2016)

    Google Scholar 

  21. Marszalek, M., Schmid, C.: Semantic hierarchies for visual object recognition. In: CVPR (2007)

    Google Scholar 

  22. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  23. Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  24. Murthy, V.N., Can, E.F., Manmatha, R.: A hybrid model for automatic image annotation. In: ICMR, pp. 369–376. ACM (2014)

    Google Scholar 

  25. Murthy, V.N., Maji, S., Manmatha, R.: Automatic image annotation using deep learning representations. In: ICMR, pp. 603–606. ACM (2015)

    Google Scholar 

  26. Murthy, V.N., Sharma, A., Chari, V., Manmatha, R.: Image annotation using multi-scale hypergraph heat diffusion framework. In: ICMR. ACM (2016)

    Google Scholar 

  27. Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. PR 29(1), 51–59 (1996)

    Google Scholar 

  28. Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)

    Article  Google Scholar 

  29. Qian, Z., Zhong, P., Chen, J.: Integrating global and local visual features with semantic hierarchies for two-level image annotation. Neurocomputing 171, 1167–1174 (2016)

    Article  Google Scholar 

  30. Swain, M.J., Ballard, D.H.: Color indexing. IJCV 7(1), 11–32 (1991)

    Article  Google Scholar 

  31. Tousch, A.M., Herbin, S., Audibert, J.Y.: Semantic hierarchies for image annotation: a survey. PR 45(1), 333–345 (2012)

    Google Scholar 

  32. Uricchio, T., Ballan, L., Seidenari, L., Bimbo, A.D.: Automatic image annotation via label transfer in the semantic space. PR 71, 144–157 (2017)

    Google Scholar 

  33. Verma, Y., Jawahar, C.V.: Image annotation using metric learning in semantic neighbourhoods. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 836–849. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33712-3_60

    Chapter  Google Scholar 

  34. Wu, L., Hua, X.S., Yu, N., Ma, W.Y., Li, S.: Flickr distance: a relationship measure for visual concepts. TPAMI 34(5), 863–875 (2012)

    Article  Google Scholar 

  35. Zhang, D., Islam, M.M., Lu, G.: A review on automatic image annotation techniques. PR 45(1), 346–362 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Salvatore Tabbone .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bouzaieni, A., Tabbone, S. (2018). Image Annotation Using a Semantic Hierarchy. In: Bai, X., Hancock, E., Ho, T., Wilson, R., Biggio, B., Robles-Kelly, A. (eds) Structural, Syntactic, and Statistical Pattern Recognition. S+SSPR 2018. Lecture Notes in Computer Science(), vol 11004. Springer, Cham. https://doi.org/10.1007/978-3-319-97785-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97785-0_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97784-3

  • Online ISBN: 978-3-319-97785-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics