Abstract
The huge number of available documents on the Web makes finding relevant ones a challenging task. The quality of results that traditional full-text search engines provide is still not optimal for many types of user queries. Especially the vagueness of natural languages, abstract concepts, semantic relations and temporal issues are handled inadequately by full-text search. Ontologies and semantic metadata can provide a solution for these problems. This work examines how ontologies can be optimally exploited during the information retrieval process, and proposes a general framework which is based on ontology-supported semantic metadata generation and ontology-based query expansion. The framework can handle imperfect ontologies and metadata by combining results of simple heuristics, instead of relying on a “perfect” ontology. This allows integrating results from traditional full-text engines, and thus supports a gradual transition from classical full-text search engines to ontology-based ones.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This work was partially funded by the VICODI (EU-IST-2001-37534) and DIP (no. FP6 – 507483) EU IST projects.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R.A., Ribeiro-Neto, B.A.: Modern Information Retrieval. ACM Press / Addison-Wesley (1999)
Kuropka, D.: Uselessness of simple co-occurrence measures for IF&IR – a linguistic point of view. In: Proceedings of the 8th International Conference on Business Information Systems, Poznan, Poland (2005)
Salton, G.: Another look at automatic text-retrieval systems. Commun. ACM 29, 648–656 (1986)
Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284, 34–43 (2001)
Nagypál, G., Deswarte, R., Oosthoek, J.: Applying the Semantic Web – the VICODI experience in creating visual contextualization for history. In: Literary and Linguistic Computing (2005) (to appear)
Lenz, M., Bartsch-Spörl, B., Burkhard, H.D., Wess, S. (eds.): Case-Based Reasoning Technology. LNCS (LNAI), vol. 1400. Springer, Heidelberg (1998)
Kalczynski, P.J., Chou, A.: Temporal document retrieval model for business news archives. Information Processing & Management 41, 635–650 (2005)
Nagypál, G., Motik, B.: A fuzzy model for representing uncertain, subjective, and vague temporal knowledge in ontologies. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) CoopIS 2003, DOA 2003, and ODBASE 2003. LNCS, vol. 2888, pp. 906–923. Springer, Heidelberg (2003)
Vallet, D., Fernández, M., Castells, P.: An ontology-based information retrieval model. In: Gómez-Pérez, A., Euzenat, J. (eds.) ESWC 2005. LNCS, vol. 3532, pp. 455–470. Springer, Heidelberg (2005)
Rocha, C., Schwabe, D., Aragao, M.P.: A hybrid approach for searching in the semantic web. In: WWW 2004: Proceedings of the 13th international conference on World Wide Web, pp. 374–383. ACM Press, New York (2004)
Silveira, M.L., Ribeiro-Neto, B.: Concept-based ranking: a case study in the juridical domain. Information Processing & Management 40, 791–805 (2004)
Ribeiro, B.A.N., Muntz, R.: A belief network model for IR. In: SIGIR 1996: Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 253–260. ACM Press, New York (1996)
Voorhees, E.M., Buckland, L.P. (eds.): NIST Special Publication 500-261: The Thirteenth Text REtrieval Conference Proceedings, TREC 2004 (2004)
Surányi, G.M., Nagypál, G., Schmidt, A.: Intelligent retrieval of digital resources by exploiting their semantic context. In: Meersman, R., Tari, Z. (eds.) OTM 2004. LNCS, vol. 3290, pp. 705–723. Springer, Heidelberg (2004)
Motik, B., Maedche, A., Volz, R.: A conceptual modeling approach for semantics-driven enterprise applications. In: Meersman, R., Tari, Z., et al. (eds.) CoopIS 2002, DOA 2002, and ODBASE 2002. LNCS, vol. 2519, Springer, Heidelberg (2002)
Vlach, R., Kazakos, W.: Using common schemas for information extraction from heterogeneous web catalogs. In: Kalinichenko, L.A., Manthey, R., Thalheim, B., Wloka, U. (eds.) ADBIS 2003. LNCS, vol. 2798, pp. 118–132. Springer, Heidelberg (2003)
Stojanovic, N., Studer, R., Stojanovic, L.: An approach for the ranking of query results in the semantic web. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 500–516. Springer, Heidelberg (2003)
Kiryakov, A., Popov, B., Terziev, I., Manov, D., Ognyanoff, D.: Semantic annotation, indexing, and retrieval. Journal of Web Semantics 2 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nagypál, G. (2005). Improving Information Retrieval Effectiveness by Using Domain Knowledge Stored in Ontologies. In: Meersman, R., Tari, Z., Herrero, P. (eds) On the Move to Meaningful Internet Systems 2005: OTM 2005 Workshops. OTM 2005. Lecture Notes in Computer Science, vol 3762. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11575863_98
Download citation
DOI: https://doi.org/10.1007/11575863_98
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29739-0
Online ISBN: 978-3-540-32132-3
eBook Packages: Computer ScienceComputer Science (R0)