Abstract
Multimedia search and retrieval are considerably improved by providing explicit meaning to visual content by the help of ontologies. Several multimedia ontologies have been proposed recently as suitable knowledge models to narrow the well known semantic gap and to enable the semantic interpretation of images. Since these ontologies have been created in different application contexts, establishing links between them, a task known as ontology matching, promises to fully unlock their potential in support of multimedia search and retrieval. This paper proposes and compares empirically two extensional ontology matching techniques applied to an important semantic image retrieval issue: automatically associating common-sense knowledge to multimedia concepts. First, we extend a previously introduced matching approach to use both textual and visual knowledge. In addition, a novel matching technique based on a multimodal graph is proposed. We argue that the textual and visual modalities have to be seen as complementary rather than as exclusive means to improve the efficiency of the application of an ontology matching procedure in the multimedia domain. An experimental evaluation is included.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Athanasiadis, T., Tzouvaras, V., Petridis, K., Precioso, F., Avrithis, Y., Kompatsiaris, Y.: Using a multimedia ontology infrastructure for semantic annotation of multimedia content. In: SemAnnot 2005 (2005)
Dasiopoulou, S., Kompatsiaris, I., Strintzis, M.: Using fuzzy dls to enhance semantic image analysis. In: Semantic Multimedia, pp. 31–46. Springer, Heidelberg (2008)
Dasiopoulou, S., Tzouvaras, V., Kompatsiaris, I., Strintzis, M.: Enquiring MPEG-7 based multimedia ontologies. In: MM Tools and Appls., pp. 1–40 (2010)
Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR, pp. 710–719 (2009)
Euzenat, J., Shvaiko, P.: Ontology Matching, 1st edn. Springer, Heidelberg (2007)
Fan, J., Luo, H., Shen, Y., Yang, C.: Integrating visual and semantic contexts for topic network generation and word sense disambiguation. In: ACM CIVR 2009, pp. 1–8 (2009)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. JMLR 3(1), 1157–1182 (2003)
Haveliwala, T.: Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search. IEEE Transactions on Knowledge and Data Engineering, 784–796 (2003)
Hudelot, C., Atif, J., Bloch, I.: Fuzzy Spatial Relation Ontology for Image Interpretation. Fuzzy Sets and Systems 159, 1929–1951 (2008)
Hudelot, C., Maillot, N., Thonnat, M.: Symbol grounding for semantic image interpretation: from image data to semantics. In: SKCV-Workshop, ICCV (2005)
Inoue, M.: On the need for annotation-based image retrieval. In: Proceedings of the Workshop on Information Retrieval in Context (IRiX), Sheffield, UK, pp. 44–46 (2004)
James, N., Todorov, K., Hudelot, C.: Ontology matching for the semantic annotation of images. In: FUZZ-IEEE. IEEE Computer Society Press, Los Alamitos (2010)
Koskela, M., Smeaton, A.: An empirical study of inter-concept similarities in multimedia ontologies. In: CIVR 2007, pp. 464–471. ACM, New York (2007)
Mihalcea, R., Tarau, P., Figa, E.: Pagerank on semantic networks, with application to word sense disambiguation. In: ICCL, p. 1126. Association for Computational Linguistics (2004)
Miller, G.: WordNet: a lexical database for English. Communications of the ACM 38(11), 39–41 (1995)
Pan, J., Yang, H., Faloutsos, C., Duygulu, P.: Automatic multimedia cross-modal correlation discovery. In: ACM SIGKDD, p. 658. ACM, New York (2004)
Peraldi, I.S.E., Kaya, A., Möller, R.: Formalizing multimedia interpretation based on abduction over description logic aboxes. In: Description Logics (2009)
Russell, B., Torralba, A., Murphy, K., Freeman, W.: LabelMe: a database and web-based tool for image annotation. IJCV 77(1), 157–173 (2008)
Smeulders, A., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Patt. An. Mach. Intell., 1349–1380 (2000)
Smith, J., Chang, S.: Large-scale concept ontology for multimedia. IEEE Multimedia 13(3), 86–91 (2006)
Snoek, C., Huurnink, B., Hollink, L., De Rijke, M., Schreiber, G., Worring, M.: Adding semantics to detectors for video retrieval. IEEE Trans. on Mult. 9(5), 975–986 (2007)
Tansley, R.: The multimedia thesaurus: An aid for multimedia information retrieval and navigation. Master’s thesis (1998)
Todorov, K., Geibel, P., Kühnberger, K.-U.: Extensional ontology matching with variable selection for support vector machines. In: CISIS, pp. 962–968. IEEE Computer Society Press, Los Alamitos (2010)
Tong, H., Faloutsos, C., Pan, J.-Y.: Fast random walk with restart and its applications. In: ICDM 2006, pp. 613–622. IEEE Computer Society, Washington, DC (2006)
Wang, C., Jing, F., Zhang, L., Zhang, H.: Image annotation refinement using random walk with restarts. In: ACM MM, p. 650 (2006)
Wu, L., Hua, X.-S., Yu, N., Ma, W.-Y., Li, S.: Flickr distance. In: MM 2008, pp. 31–40. ACM, New York (2008)
Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: Fourteenth ICML, pp. 412–420. Morgan Kaufmann Publishers, San Francisco (1997)
Yao, B., Yang, X., Lin, L., Lee, M., Zhu, S.: I2t: Image parsing to text description. IEEE Proc. Special Issue on Internet Vision (to appear)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
James, N., Todorov, K., Hudelot, C. (2011). Combining Visual and Textual Modalities for Multimedia Ontology Matching. In: Declerck, T., Granitzer, M., Grzegorzek, M., Romanelli, M., Rüger, S., Sintek, M. (eds) Semantic Multimedia. SAMT 2010. Lecture Notes in Computer Science, vol 6725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23017-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-23017-2_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23016-5
Online ISBN: 978-3-642-23017-2
eBook Packages: Computer ScienceComputer Science (R0)