Skip to main content

COLE Experiments at QA@CLEF 2004 Spanish Monolingual Track

  • Conference paper
Multilingual Information Access for Text, Speech and Images (CLEF 2004)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3491))

Included in the following conference series:

  • 619 Accesses

Abstract

This paper is a report on our third participation in CLEF. More precisely, this year we have participated in the Spanish Monolingual Question Answering Track for the first time. As a result we have developed a prototype of a QA system. Our prototype continues to apply the Natural Language Processing techniques we had already developed for single word conflation. In addition, the question analysis is based on complex pattern matching either over forms, part-of-speech tags or lemmas of the words involved. Regarding the search for relevant parts of documents containing the required answer, we use conventional IR techniques, whilst the extraction of the answer from the relevant parts of documents is again based on pattern matching.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Vilares, J., Alonso, M.A., Ribadas, F.J., Vilares, M.: COLE experiments at CLEF 2002 Spanish monolingual track. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 265–278. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  2. Vilares, J., Alonso, M.A., Ribadas, F.J.: COLE experiments at CLEF 2003 Spanish monolingual track. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 345–357. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  3. Arampatzis, A., van der Weide, T.P., van Bommel, P., Koster, C.: Linguistically-motivated information retrieval. In: Encyclopedia of Library and Information Science, vol. 69, pp. 201–222. Marcel Dekker, Inc., New York-Basel (2000)

    Google Scholar 

  4. Figuerola, C.G., Gómez, R., Zazo Rodríguez, A.F., Alonso Berrocal, J.L.: Stemming in Spanish: A first approach to its impact on information retrieval. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Vilares, M., Graña, J., Alvariño, P.: Finite-state morphology and formal verification. Journal of Natural Language Engineering, special issue on Extended Finite State Models of Language 3, 303–304 (1997)

    Google Scholar 

  6. Kowalski, G.: Information Retrieval Systems: Theory and Implementation. The Kluwer international series on Information Retrieval. Kluwer Academic Publishers, Boston-Dordrecht-London (1997)

    MATH  Google Scholar 

  7. Palmer, D.D.: Tokenisation and Sentence Segmentation. In: Dale, R., Moisi, H., Somers, H. (eds.) Handbook of Natural Language Processing. Marcel Dekker, Inc., New York (2000)

    Google Scholar 

  8. Graña, J., Barcala, F.M., Vilares, J.: Formal methods of tokenization for part-of-speech tagging. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 240–249. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  9. Barcala, F.M., Vilares, J., Alonso, M.A., Graña, J., Vilares, M.: Tokenization and proper noun recognition for information retrieval. In: 3rd International Workshop on Natural Language and Information Systems (NLIS 2002), September 2-3. IEEE Computer Society Press, Los Alamitos (2002)

    Google Scholar 

  10. Graña, J.: Técnicas de Análisis Sintáctico Robusto para la Etiquetación del Lenguaje Natural. PhD thesis, Departamento de Computación, Universidade da Coruña, A Coruña, Spain (2000)

    Google Scholar 

  11. Brants, T.: TnT - a statistical part-of-speech tagger. In: Proceedings of the Sixth Applied Natural Language Processing Conference (ANLP 2000), Seattle, WA (2000)

    Google Scholar 

  12. Graña, J., Barcala, F.M., Alonso, M.: Compilation methods of minimal acyclic automata for large dictionaries. In: Watson, B.W., Wood, D. (eds.) CIAA 2001. LNCS, vol. 2494, pp. 116–129. Springer, Heidelberg (2003)

    Google Scholar 

  13. Graña, J., Chappelier, J.C., Vilares, M.: Integrating external dictionaries into stochastic part-of-speech taggers. In: Angelova, G., Bontcheva, K., Mitkov, R., Nocolov, N., Nikolov, N. (eds.) EuroConference Recent Advances in Natural Language Processing, Proceedings, Tzigov Chark, Bulgaria, pp. 122–128 (2001)

    Google Scholar 

  14. Graña Gil, J., Alonso Pardo, M.A., Vilares Ferro, M.: A common solution for tokenization and part-of-speech tagging. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2002. LNCS (LNAI), vol. 2448, pp. 3–10. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  15. Abney, S.: Partial parsing via finite-state cascades. Natural Language Engineering 2, 337–344 (1997)

    Article  Google Scholar 

  16. Kaszkiel, M., Zobel, J.: Effective ranking with arbitrary passages. Journal of the American Society of Information Science 52, 344–364 (2001)

    Article  Google Scholar 

  17. Llopis, F., Vicedo, J.L., Ferrández, A.: IR-n system at CLEF-2002. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 291–300. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  18. Jacquemin, C., Tzoukermann, E.: NLP for term variant extraction: synergy between morphology, lexicon and syntax. In: Strzalkowski, T. (ed.) Natural Language Information Retrieval, Text, Speech and Language Technology, vol. 7, pp. 25–74. Kluwer Academic Publishers, Dordrecht (1999)

    Google Scholar 

  19. Koster, C.H.A.: Head/Modifier frames for information retrieval. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 420–432. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  20. http://www.itl.nist.gov/iaui/894.02/works/papers/zp2/zp2.html (site visited August 2004)

  21. Robertson, S.E., Walker, S.: Okapi/Keenbow at TREC-8. In: Voorhees, E.M., Harman, D.K. (eds.) NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC 8), Gaithersburg, MD, USA, Department of Commerce, National Institute of Standards and Technology, pp. 151–162 (2000)

    Google Scholar 

  22. Savoy, J.: Report on CLEF-2002 Experiments: Combining Multiple Sources of Evidence. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 31–46. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  23. ftp://ftp.cs.cornell.edu/pub/smart (site visited August 2004)

  24. Vilares, J., Alonso, M.A.: Dealing with syntactic variation through a locality-based approach. In: Apostolico, A., Melucci, M. (eds.) SPIRE 2004. LNCS, vol. 3246, pp. 255–266. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Díaz, E.M., Ferro, J.V., Souto, D.C. (2005). COLE Experiments at QA@CLEF 2004 Spanish Monolingual Track. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_53

Download citation

  • DOI: https://doi.org/10.1007/11519645_53

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27420-9

  • Online ISBN: 978-3-540-32051-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics