Abstract
Effective information retrieval in XML documents requires the user to have good knowledge of document structure and of some formal query language. XML query languages like XPath and XQuery are too complex to be considered for use by end users. We present an approach to XML query processing that supports the specification of both textual and structural constraints in natural language. We implemented a system that supports the evaluation of both formal XPath-like queries and natural language XML queries. We present comparative test results that were performed with the INEX 2004 topics and XML collection. Our results quantify the trade-off in performance of natural language XML queries vs formal queries with favourable results.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Smeaton, A.F.: Information Retrieval: Still Butting Heads with Natural Language Processing? In: Pazienza, M.T. (ed.) SCIE 1997. LNCS, vol. 1299, pp. 115–138. Springer, Heidelberg (1997)
Smeaton, A.F.: Using NLP or NLP Resources for Information Retrieval Tasks. [12], pp. 99–111
Arampatzis, A., van der Weide, T., Koster, C., van Bommel, P.: Linguistically-motivated Information Retrieval. In: Kent, A. (ed.) Encyclopedia of Library and Information Science, vol. 69, pp. 201–222. Marcel Dekker, Inc., New York (2000)
Sparck Jones, K.: What is the role of NLP in text retrieval? [12], pp. 1–24
Androutsopoulos, I., Ritchie, G.D., Thanisch, P.: Natural Language Interfaces to Databases – An Introduction. Journal of Natural Language Engineering 1, 29–81 (1995)
Copestake, A., Jones, K.S.: Natural Language Interfaces to Databases. The Knowledge Engineering Review 5, 225–249 (1990)
Perrault, C., Grosz, B.: Natural Language Interfaces. Exploring Articial Intelligence, 133–172 (1988)
Fuhr, N., Lalmas, M., Malik, S., Szlàvik, Z. (eds.): Advances in XML Information Retrieval. Third Workshop of the Initiative for the Evaluation of XML retrieval (INEX). LNCS, vol. 3493. Springer, Heidelberg (2005)
Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I (NEXI). [8]
Tannier, X., Girardot, J.J., Mathieu, M.: Analysing Natural Language Queries at INEX 2004, [8], pp. 395–409 (2004)
Geva, S.: GPX - Gardens Point XML Information Retrieval at INEX 2004. [8]
Strzalkowski, T. (ed.): Natural Language Information Retrieval. Kluwer Academic Publisher, Dordrecht (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tannier, X., Geva, S. (2005). XML Retrieval with a Natural Language Interface. In: Consens, M., Navarro, G. (eds) String Processing and Information Retrieval. SPIRE 2005. Lecture Notes in Computer Science, vol 3772. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11575832_4
Download citation
DOI: https://doi.org/10.1007/11575832_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29740-6
Online ISBN: 978-3-540-32241-2
eBook Packages: Computer ScienceComputer Science (R0)