Abstract
Expressive power is a potential source of benefits for Information Retrieval. Indeed, a number of works have been traditionally devoting their efforts to defining models able to manage structured documents. Similarly, many researchers have looked at query formulation and proposed different methods to generate structured queries. Nevertheless few attempts have addressed the combination of both expressive documents and expressive queries and its effects on retrieval performance. This is mostly due to the lack of a coherent and expressive framework in which both documents and queries can be handled in an homogeneous and efficient way. In this work we aim at filling this gap. We test the impact of logical representations for documents and queries under a large-scale evaluation. The experiments show clearly that, under the same conditions, the use of logical representations for both documents and queries leads to significant improvements in retrieval performance. Moreover, the overall performance results make evident that logic-based approaches can be competitive in the field of Information Retrieval.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
N. J. Belkin, C. Cool, W. B. Croft, and J. P. Callan. The effect of multiple query representations on information retrieval system performance. In Proc. of SIGIR-93, the 16th ACM Conference on Research and Development in Information Retrieval, pages 339–346, Pittsburgh, PA, June 1993.
J. P. Callan. Passage-level evidence in document retrieval. In Proc. SIGIR-94, the 17th ACM Conference on Research and Development in Information Retrieval, pages 302–310, Dublin, UK, July 1994.
F. Crestani, M. Lalmas, and C. J. van Rijsbergen (editors). Information Retrieval, Uncertainty and Logics: advanced models for the representation and retrieval of information. Kluwer Academic, Norwell, MA., 1998.
M. Dalal. Investigations into a theory of knowledge base revision:preliminary report. In Proc. AAAI-88, the 7th National Conference on Artificial Intelligence, pages 475–479, Saint Paul, USA, 1988.
D. Harman. Overview of the third text retrieval conference. In Proc. TREC-3, the 3rd text retrieval conference, 1994.
M. Hearst and C. Plaunt. Subtopic structuring for full-length document access. In Proc. SIGIR-93, the 16th ACM Conference on Research and Development in Information Retrieval, pages 59–68, Pittsburgh, USA, June 1993.
D. A. Hull. Using structured queries for disambiguation in cross-language information retrieval. In Proc. of AAAI spring symposium on cross-language text and speech retrieval, Stanford, CA, March 1997.
J. Kekäläinen and K. Jarvelin. The impact of query structure and query expansion on retrieval performance. In Proc. of SIGIR-98, the 21st ACM Conference on Research and Development in Information Retrieval, pages 130–137, Melbourne, Australia, August 1998.
The lemur toolkit. http://www.cs.cmu.edu/lemur.
D. E. Losada. A logical model of information retrieval based on propositional logic and belief revision. PhD thesis, University of A Corunna, 2001.
D. E. Losada and A. Barreiro. Using a belief revision operator for document ranking in extended boolean models. In Proc. SIGIR-99, the 22nd ACM Conference on Research and Development in Information Retrieval, pages 66–73, Berkeley, USA, August 1999.
D. E. Losada and A. Barreiro. Efficient algorithms for ranking documents represented as DNF formulas. In Proc. SIGIR-2000 Workshop on Mathematical and Formal Methods in Information Retrieval, pages 16–24, Athens, Greece, July 2000.
D. E. Losada and A. Barreiro. Implementing document ranking within a logical framework. In Proc. SPIRE-2000, the 7th Symposium on String Processing and Information Retrieval, pages 188–198, A Coruña, Spain, September 2000.
D. E. Losada and A. Barreiro. An homogeneous framework to model relevance feedback. In Proc. of SIGIR-2001, the 24th ACM Conference on Research and Development in Information Retrieval (poster session), pages 422–423, New Orleans, USA, September 2001.
D. E. Losada and A. Barreiro. A logical model for information retrieval based on propositional logic and belief revision. The Computer Journal, 44(5):410–424, 2001.
D. E. Losada and A. Barreiro. Rating the impact of logical representations on retrieval performance. In Proc. DEXA-2001 Workshop on Logical and Uncertainty Models for Information Systems, LUMIS-2001, pages 247–253, Munich, Germany, September 2001.
D. E. Losada and A. Barreiro. Embedding term similarity and inverse document frequency into a logical model of information retrieval. Journal of the American Society for Information Science and Technology, 2003 (to appear).
GNU mifluz. http://www.gnu.org/software/mifluz.
M. F. Porter. An algorithm for suffix stripping. In K. Sparck Jones and P. Willet, editors, Readings in Information Retrieval, pages 313–316. Morgan Kaufmann Publishers, 1997.
S. E. Robertson, S. Walker, S. Jones, M.M. HancockBeaulieu, and M. Gatford. Okapi at TREC-3. In D. Harman, editor, Proc. TREC-3, the 3rd Text Retrieval Conference, pages 109–127. NIST, 1995.
G. Salton, J. Allan, and C. Buckley. Approaches to passage retrieval in full text information systems. In Proc. SIGIR-93, the 16th ACM Conference on Research and Development in Information Retrieval, pages 49–58, Pittsburgh, USA, June 1993.
Senga: Information retrieval software. http://www.senga.org.
C. J. van Rijsbergen. A non-classical logic for information retrieval. The Computer Journal, 29:481–485, 1986.
R. Wilkinson. Effective retrieval of structured documents. In Proc. SIGIR-94, the 17th ACM Conference on Research and Development in Information Retrieval, pages 311–317, Dublin, UK, July 1994.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Losada, D.E., Barreiro, A. (2003). Propositional Logic Representations for Documents and Queries: A Large-Scale Evaluation. In: Sebastiani, F. (eds) Advances in Information Retrieval. ECIR 2003. Lecture Notes in Computer Science, vol 2633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36618-0_16
Download citation
DOI: https://doi.org/10.1007/3-540-36618-0_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-01274-0
Online ISBN: 978-3-540-36618-8
eBook Packages: Springer Book Archive