Abstract
Many XML retrieval applications require relevance-oriented ranking of retrieved elements in order to capture the vagueness inherent to the information retrieval process. This relevance-oriented ranking should not only support vagueness at the content level, but also at the structural level. In this paper, we use a probabilistic object-relational framework to model representation and retrieval strategies that take into account vagueness at both content and structure level. Our approach makes use of established database technology combined with sound probability theory, thus allowing for fast and flexible prototyping of various representation and retrieval strategies.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abiteboul, S., Quass, D., McHugh, J., Widom, J., Wiener, J., Widom, J.: The Lorel Query Language for Semistructured Data. International Journal on Digital Libraries 1(1), 68–88 (1997)
Amer-Yahia, S., Cho, S., Srivasta, D.: Tree pattern relaxation. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, p. 496. Springer, Heidelberg (2002)
Berglund, A., Boag, S., Chamberlin, D., Fernandez, M.F., Kay, M., Robie, J., Simeon, J.: XML Path Language (XPath) 2.0. W3C Working Draft (November 2002), http://www.w3.org/TR/xpath20
Blanken, H., Schenkel, R., Grabs, T., Weikum, G.: Intelligent Search on XML. Springer, Heidelberg (2003)
Boag, S., Chamberlin, D., Fernandez, M.F., Florescu, D., Robie, J., Simeon, J.: XQuery: An XML query language. W3C Working Draft (2002), http://www.w3.org/TR/XQuery
Carmel, D., Maarek, Y.S., Mandelbrod, M., Mass, Y., Soffer, A.: Searching XML documents via XML fragments. In: Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, July 2003, pp. 151–158 (2003)
Chamberlin, D., Robie, J., Florescu, D.: Quilt: An XML query language for heterogeneous data sources. In: International Workshop on the Web and Databases (WebDB), Texas, USA, May 2000, pp. 53–62 (2000)
Fuhr, N., Goevert, N., Kazai, G., Lalmas, M. (eds.): INEX: Evaluation Initiative for XML retrieval - INEX 2002 Workshop Proceedings, DELOS Workshop (2003)
Fuhr, N., Grossjohann, K.: XIRQL: A query language for information retrieval in XML documents. In: Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, USA (August 2001)
Fuhr, N., Rölleke, T.: A probabilistic relational algebra for the integration of information retrieval and database systems. ACM Transactions on Information Systems 14(1), 32–66 (1997)
Jansen, B.J., Spink, A., Saracevic, T.: Real life, real users and real needs: A study and analysis of user queries on the web. Information Processing & Management 36(2), 207–227 (2000)
Kazai, G., Lalmas, M., Fuhr, N., Goevert, N.: A report on the first year of the INitiative for the Evaluation of XML Retrieval (INEX02). In: Journal of the American Society for Information Science and Technology (2004) (in press)
Lalmas, M., Rölleke, T., Fuhr, N.: Intelligent hypermedia retrieval. In: Szczepaniak, P.S., Segovia, F., Zadeh, L.A. (eds.) Intelligent Exploration of the Web, Springer, Heidelberg (2002)
Roelleke, T.: A frequency-based and a poisson-based probability of being informative. In: Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, July 2003, pp. 227–234 (2003)
Rölleke, T.: POOL: Probabilistic Object-Oriented Logical Representation and Retrieval of Complex Objects. Shaker Verlag, Aachen (1999) ,Dissertation
Rölleke, T., Lalmas, M., Kazai, G., Ruthven, I., Quicker, S.: The accessibility dimension for structured document retrieval. In: Crestani, F., Girolami, M., van Rijsbergen, C.J.K. (eds.) ECIR 2002. LNCS, vol. 2291, p. 284. Springer, Heidelberg (2002)
Schlieder, T., Meuss, M.: Result ranking for structured queries against xml documents. In: DELOS Workshop: Information Seeking, Searching and Querying in Digital Libraries, Zurich, Switzerland (2000)
Theobald, A., Weikum, G.: The index-based XXL search engine for querying XML data with relevance ranking. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, pp. 477–495. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lalmas, M., Rölleke, T. (2004). Modelling Vague Content and Structure Querying in XML Retrieval with a Probabilistic Object-Relational Framework. In: Christiansen, H., Hacid, MS., Andreasen, T., Larsen, H.L. (eds) Flexible Query Answering Systems. FQAS 2004. Lecture Notes in Computer Science(), vol 3055. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25957-2_34
Download citation
DOI: https://doi.org/10.1007/978-3-540-25957-2_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22160-9
Online ISBN: 978-3-540-25957-2
eBook Packages: Springer Book Archive