Pattern graph-based image retrieval system combining semantic and visual features

Allani, Olfa; Zghal, Hajer Baazaoui; Mellouli, Nedra; Akdag, Herman

doi:10.1007/s11042-017-4716-8

Pattern graph-based image retrieval system combining semantic and visual features

Published: 20 May 2017

Volume 76, pages 20287–20316, (2017)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Olfa Allani^1,2,
Hajer Baazaoui Zghal ORCID: orcid.org/0000-0002-2151-7397¹,
Nedra Mellouli^2,3 &
…
Herman Akdag²

395 Accesses
Explore all metrics

Abstract

In the literature, several image retrieval approaches that allow mapping between low-level features and high-level semantics have been proposed. Among these one can cite object recognition, ontologies, and relevance feedback. However, their main limitations concern their high dependence on reliable external resources (existing ontologies, learning sets, etc.) and lack of capacity to combine semantic and visual information and provide relevant results. This paper proposes a system aiming to improve image retrieval results. The proposed system is based on a pattern graph combining semantic and visual features. The idea is (1) to automatically build a modular ontology based on a learning step from textual corpus and terminological resource, (2) to organize visual features in a graph-based model where the combined module and graph represent a unique component called “pattern,” and (3) to build a pattern graph. To this end our system has been implemented. The obtained experimental results show that the pattern graph that we propose enables an improvement of retrieval task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Semantic-Based Image Retrieval Using RS-Tree and Knowledge Graph

Graph-Based Image Retrieval: State of the Art

Search-Based Image Annotation: Extracting Semantics from Similar Images

Notes

References

Allani O, Mellouli N, Baazaoui H, Akdag H, Ben Ghezala H (2015) A relevant visual feature selection approach for image retrieval The international conference on computer vision theory and applications. VISAPP, p 2015
Google Scholar
Arni T, Clough P, Sanderson M, Grubinger M (2008) Overview of the ImageCLEFphoto 2008 photographic retrieval task Workshop of the cross-language evaluation forum for European languages. Springer, Berlin Heidelberg, pp 500–511
Google Scholar
Baker LD, Mccallum AK (1998) Distributional clustering of words for text classification Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 96–103
Google Scholar
Bannour H, Hudelot C (2014) Building And using fuzzy multimedia ontologies for semantic image annotation. Multimedia Tools Appl 72(3):2107–2141
Article Google Scholar
Barbu T (2013) A Novel Image Similarity Metric using SIFT-based Characteristics Mathematical models in engineering and computer science: proceedings of the 2nd international conference on computers, digital communications and computing, ICDCC’13, pp 15–18
Google Scholar
Besbes G, Baazaoui-Zghal H (2014) Modular ontologies and CBR-based hybrid system for web information retrieval. Multimedia Tools Appl 1–25
Cheng Z, Shen J, Xie L, Zhu L (2017) Unsupervised visual hashing with semantic assistant for content-based image retrieval. IEEE Trans Knowl Data Eng 29:472–486
Article Google Scholar
Choi D, Kim J, Kim H, et al. (2012) A method for enhancing image retrieval based on annotation using modified wup similarity in wordnet Proceedings of the 11th WSEAS international conference on artificial intelligence, knowledge engineering and data bases AIKED, vol 2012, pp 83–87
Crucianu M, Ferecatu M, Boujemaa N (2004) Relevance feedback for image retrieval: a short survey. Report of the DELOS2 European Network of Excellence (FP6)
Cui J, Liu Y, Xu Y, Zhao H, Zha H (2013) Tracking generic human motion via fusion of low-and high-dimensional approaches. IEEE Trans Syst Man Cybern Syst 43(4):996–1002
Article Google Scholar
Dao MS, Boato G, DeNatale FG (2012) Discovering inherent event taxonomies from social media collections Proceedings of the 2nd ACM international conference on multimedia retrieval. ACM, p 48
Demir B, Bruzzone L (2015) A novel active learning method in relevance feedback for content-based remote sensing image retrieval. IEEE Trans Geosci Remote Sens 53 (5):2323–2334
Article Google Scholar
D’aquin M, Sabou M, et Motta E (2006) Modularization: a key for the dynamic selection of relevant knowledge components
Escalante HJ, Hernández CA, Gonzalez JA (2010) The segmented and annotated IAPR TC-12 benchmark. Comput Vis Image Underst 114(4):419–428
Article Google Scholar
Feng D., Siu W. C., Zhang H. J. (eds.) (2013) Multimedia information retrieval and management: technological fundamentals and applications. Springer Science and Business Media
Fundel K, Küffner R., Zimmer R (2007) RelEx—Relation extraction using dependency parse trees. Bioinformatics 23(3):365–371
Article Google Scholar
Hammiche S, Benbernou S, et Vakali A (2005) A logic based approach for the multimedia data representation and retrieval 7th IEEE International symposium on multimedia. IEEE, p 8
Google Scholar
Hernández-Gracidas CA, Sucar LE, Montes-Y-Gómez M (2013) Improving image retrieval by using spatial relations. Multimedia Tools Appl 62(2):479–505
Article Google Scholar
Khalid YIA, Noah SA (2011) A Framework for integrating DBpedia in a multi-modality ontology news image retrieval system 2011 International conference on semantic technology and information retrieval (STAIR). IEEE, pp 144–149
Liqiang N, Meng W, Zheng-Jun Z, Tat-Seng C (2012) Oracle in image search: a content-based approach to performance prediction. ACM Trans Inf Syst
Lin D (1998) Automatic retrieval and clustering of similar words Proceedings of the 17th international conference on computational linguistics-Volume 2. Association for Computational Linguistics, vol 1998, pp 768–774
Liu AA, Nie WZ, Gao Y, Su YT (2016) Multi-modal clique-graph matching for view-based 3D model retrieval. IEEE Trans Image Process 25(5):2103–2116
Article MathSciNet Google Scholar
Liu L, Cheng L, Liu Y, Jia Y, Rosenblum DS (2016) Recognizing complex activities by a probabilistic interval-based model AAAI, pp 1266–1272
Google Scholar
Liu Y, Cui J, Zhao H, Zha H (2012) Fusion of low-and high-dimensional approaches by trackers sampling for generic human motion tracking 2012 21st International conference on pattern recognition (ICPR). IEEE, pp 898–901
Liu Y, Liang Y, Liu S, Rosenblum DS, Zheng Y (2016) Predicting urban water quality with ubiquitous data. arXiv:1610.09462
Liu Y, Nie L, Han L, Zhang L, Rosenblum DS (2016) Action2activity: recognizing complex activities from sensor data. arXiv:1611.01872
Liu Y, Nie L, Liu L, Rosenblum DS (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115
Article Google Scholar
Liu Y, Zhang D, Lu G, Ma W-Y (2007) A survey of content-based image retrieval with high-level semantics. Pattern Recogn 40(1):262–282
Article MATH Google Scholar
Liu Y, Zhang L, Nie L, Yan Y, Rosenblum DS (2016) Fortune teller: predicting your career path AAAI, pp 201–207
Google Scholar
Liu Y, Zhang X, Cui J, Wu C, Aghajan H, Zha H (2010) Visual analysis of child-adult interactive behaviors in video sequences 2010 16th International conference on virtual systems and multimedia (VSMM). IEEE, pp 26–33
Liu Y, Zheng Y, Liang Y, Liu S, Rosenblum DS (2016) Urban water quality prediction based on multi-task multi-view learning Proceedings of the international joint conference on artificial intelligence
Google Scholar
Lu Y, Wei Y, Liu L, Zhong J, Sun L, Liu Y (2016) Towards unsupervised physical activity recognition using smartphone accelerometers. Multimedia Tools Appl
Meghini C, Sebastiani F, Straccia U (2001) A model of multimedia information retrieval. J ACM (JACM) 48(5):909–970
Article MathSciNet MATH Google Scholar
Mezaris V, Kompatsiaris I, Strintzis MG (2004) Region-based image retrieval using an object ontology and relevance feedback. Eurasip J Appl Signal Process, 2004 2004:886–901
Article Google Scholar
Minu RI, Thyagharajan KK (2012) Multimodal ontology search for semantic image retrieval. Submitted to International Journal of Computer System Science and Engineering for February, no 2012
Moehrmann J, Heidemann G (2013) Semi-automatic image annotation International conference on computer analysis of images and patterns. Springer, Berlin Heidelberg, pp 266–273
Chapter Google Scholar
Moro A, Raganato A, Navigli R (2014) 2014 Entity linking meets word sense disambiguation: a unified approach. Transactions of the Association for Computational Linguistics (TACL) 2:231–244
Google Scholar
Mustapha NB, Aufaure MA, Zghal HB, Ghezala HB (2012) Modular ontological warehouse for adaptative information search Model and Data Engineering. Springer, Berlin Heidelberg, pp 79–90
Chapter Google Scholar
Navigli R, Ponzetto SP (2012) BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif Intell 193:217–250
Article MathSciNet MATH Google Scholar
Nie L, Wang M, Gao Y, Zha ZJ, Chua TS (2013) Beyond text QA: multimedia answer generation by harvesting web information. IEEE Trans Multimedia 15(2):426–441
Article Google Scholar
Nie W, Liu A, Su Y (2016) Cross-domain semantic transfer from large-scale social media. Multimedia Systems 22(1):75–85
Article Google Scholar
Nie W, Liu A, Zhu X, Su Y (2016) Quality models for venue recommendation in location-based social network. Multimedia Tools and Appl 75(20):12521–12534
Article Google Scholar
Pham T-T, Maillot NE, Lim J-H, Chevallet J-P (2007) Latent semantic fusion model for image retrieval and annotation Proceedings of the 16th ACM conference on conference on information and knowledge management, CIKM’07. ACM, New York, pp 439–444
Chapter Google Scholar
Poslad S, Kesorn K (2014) A multi-modal incompleteness ontology model (MMIO) to enhance information fusion for image retrieval. Information Fusion 20:225–241
Article Google Scholar
Raoui Y, Bouyakhf EH, Devy M, Regragui F (2011) Global and local image descriptors for content based image retrieval and object recognition. Appl Math Sci 5(42):2109–2136
MATH Google Scholar
Rokach L, Oded M (2005) Clustering methods. Data Mining and Knowledge Discovery Handbook. Springer, USA, pp 321–352
Book MATH Google Scholar
Salton G, McGill MJ (1986) Introduction to modern information retrieval. McGraw-Hill, Inc., New York
MATH Google Scholar
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Article Google Scholar
Smucker MD, Allan J, Carterette B (2007) A comparison of statistical significance tests for information retrieval evaluation. CIKM 2007:623–632
Google Scholar
Straccia U, Visco G (2007) DLMedia: an ontology mediated multimedia information retrieval system Description logics
Google Scholar
Yang Y, Nie F, Xu D, Luo J, Zhuang Y, Pan Y (2012) A multimedia retrieval framework based on semi-supervised ranking and relevance feedback. IEEE Trans Pattern Anal Mach Intell 34(4):723–742
Article Google Scholar
Zhang H, Shang X, Luan H, Wang M, Chua TS (2016) Learning from collective intelligence: feature learning using social images and tags. ACM Trans Multimed Comput Commun Appl

Download references

Author information

Authors and Affiliations

RIADI Laboratory, ENSI, University of Manouba, Manouba, Tunisia
Olfa Allani & Hajer Baazaoui Zghal
LIASD Laboratory, Paris 8 University, Saint-Denis, France
Olfa Allani, Nedra Mellouli & Herman Akdag
IUT of Montreuil, Paris 8 University, Saint-Denis, France
Nedra Mellouli

Authors

Olfa Allani
View author publications
You can also search for this author in PubMed Google Scholar
Hajer Baazaoui Zghal
View author publications
You can also search for this author in PubMed Google Scholar
Nedra Mellouli
View author publications
You can also search for this author in PubMed Google Scholar
Herman Akdag
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hajer Baazaoui Zghal.

Appendix: Detailed and summarized algorithms

Concerning the optimization steps of the algorithm, some elements which allowed the improvement of the computing time and memory use are provided below:

During the intra-pattern search (figure1 step VI, algorithm 3 line 13), if the first obtained similarity measures on the randomly selected regions are judged weak (similarity measure less than 0.4), the pattern is left and we enchain with computing similarity to the next pattern. In fact this operation decreases the required search time. On one single pattern, the similarity decision time is reduced from 35214 s to 21539 s (+63.5%)
During the inter-pattern and intra-pattern search, treatments are parallelized using a multi-thread approach. A preliminary study has been conducted concerning the thread number which we have varied between 2 and 6 threads. We noticed that the most relevant value is obtained for 4 threads. These threads allow a reduction of the retrieval time from 2602514 s (for a sequential search) to 1865325 s (for a parallelized search).
Some elements are present in an important number of images such as sky, clouds, sun, etc. These elements are stored in a list and considered as visual stop words that do not bring interesting information to the retrieval process. That is why they are not considered during the retrieval process. Indeed, the application of this step reduces the number of compared regions to 2/3 in 65% of the query images and thus gradually improve the retrieval process cost.

The retrieval results are usually stored in a cash for future use. However if they are not used after a certain time, they are automatically deleted in order to constantly keep sufficient memory space. The memory space provided for this storage step is equal to 10 Mo and its use could reduce the retrieval time if the query image was treated before.

These steps contribute to the improvement of the computing time and the memory use.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Allani, O., Zghal, H.B., Mellouli, N. et al. Pattern graph-based image retrieval system combining semantic and visual features. Multimed Tools Appl 76, 20287–20316 (2017). https://doi.org/10.1007/s11042-017-4716-8

Download citation

Received: 18 October 2016
Revised: 24 March 2017
Accepted: 13 April 2017
Published: 20 May 2017
Issue Date: October 2017
DOI: https://doi.org/10.1007/s11042-017-4716-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Pattern graph-based image retrieval system combining semantic and visual features

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Semantic-Based Image Retrieval Using RS-Tree and Knowledge Graph

Graph-Based Image Retrieval: State of the Art

Search-Based Image Annotation: Extracting Semantics from Similar Images

Notes

References