Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4518))

Abstract

XML information retrieval (XML-IR) systems aim to better fulfil users’ information needs than traditional IR systems by returning results lower than the document level. In order to use XML-IR systems users must encapsulate their structural and content information needs in a structured query. Historically, these structured queries have been formatted using formal languages such as NEXI. Unfortunately, formal query languages are very complex and too difficult to be used by experienced - let alone casual - users and are too closely bound to the underlying physical structure of the collection. INEX’s NLP task investigates the potential of using natural language to specify structured queries. QUT has participated in the NLP task with our system NLPX since its inception. Here, we discuss the changes we’ve made to NLPX since last year, including our efforts to port NLPX to Wikipedia. Second, we present the results from the 2006 INEX track where NLPX was the best performing participant in the Thorough and Focused tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abney, S.: Parsing by Chunks. In: Principle-Based Parsing, Kluwer Academic Publishers, Boston, MA (1991)

    Google Scholar 

  2. Brill, E.: A Simple Rule-Based Part of Speech Tagger. In: Proceedings of the Third Conference on Applied Computational Linguistics (ACL), Trento, Italy, pp. 152–155 (1992)

    Google Scholar 

  3. Clark J., DeRose, S.: XML Path Language XPath Version 1.0. W3C Recommendation, The World Wide Web Consortium (November 1999), available at http://www.w3.org/TR/xpath

  4. Fox, C.: Lexical Analysis and Stoplists. In: Frankes, W.B., Baeza-Yates, R. (eds.) Information Retrieval: Data Structures and Algorithms, ch. 7, pp. 102–130. Prentice-Hall, Upper Saddle River (1992)

    Google Scholar 

  5. Geva, S.: GPX - Gardens Point XML Information Retrieval INEX 2004. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 221–222. Springer, Heidelberg (2005)

    Google Scholar 

  6. Larson, R.: XML Element Retrieval and Heterogenous Retrieval. In Pursuit of the Impossible? In: Proceedings of INEX 2005 Workshop on Element Retrieval Methodology, Glasgow, Scotland, pp. 38–41 (2005)

    Google Scholar 

  7. O’Keefe, R., Trotman, A.: The Simplest Query Language That Could Possibly Work. In: Fuhr N., Malik, S. (eds.) INEX 2003 Workshop Proceedings. Dagstuhl, Germany, pp. 167–174 (2003)

    Google Scholar 

  8. Ramshaw, L., Marcus, M.: Text Chunking Using Transformation-Based Learning. In: Proceedings of the Third Workshop on Very Large Corpora, pp. 82–94 (1995)

    Google Scholar 

  9. Trotman, A., Lamas, M.: Why structural hints in queries do not help XML-retrieval. In: Efthimiadis E. N., Dumais S. T., Hawking, D., Järvelin, K. (eds.) SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, Washington, USA, August 6-11, 2006, pp. 711–712 (2006)

    Google Scholar 

  10. Trotman, A., Sigurbjörnsson, B.: NEXI: Now and Next. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 410–423. Springer, Heidelberg (2005)

    Google Scholar 

  11. Woodley, A., Geva, S.: NLPX: An XML-IR System with a Natural Language Interface, In: Bruza, P., Moffat, A., Turpin, A (eds.) Proceedings of the Australasian Document Computing Symposium, Melbourne, Australia, pp. 71–74 (2004)

    Google Scholar 

  12. Woodley, A., Geva, S.: NLPX at INEX 2004. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 393–406. Springer, Heidelberg (2005)

    Google Scholar 

  13. Woodley, A., Geva, S.: NLPX at INEX 2005. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 358–372. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  14. van Zowl, R., Bass, J., van Oostendorp, H., Wiering, F.: Query Formulation for XML Retrieval with Bricks. In: Fuhr, N., Lamas, M., Trotman, A. (eds.) Proceedings of INEX 2005 Workshop on Element Retrieval Methodology, Glasgow, Scotland, pp. 75-83 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Norbert Fuhr Mounia Lalmas Andrew Trotman

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Woodley, A., Geva, S. (2007). NLPX at INEX 2006. In: Fuhr, N., Lalmas, M., Trotman, A. (eds) Comparative Evaluation of XML Information Retrieval Systems. INEX 2006. Lecture Notes in Computer Science, vol 4518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73888-6_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73888-6_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73887-9

  • Online ISBN: 978-3-540-73888-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics