Abstract
This paper is a report on our third participation in CLEF. More precisely, this year we have participated in the Spanish Monolingual Question Answering Track for the first time. As a result we have developed a prototype of a QA system. Our prototype continues to apply the Natural Language Processing techniques we had already developed for single word conflation. In addition, the question analysis is based on complex pattern matching either over forms, part-of-speech tags or lemmas of the words involved. Regarding the search for relevant parts of documents containing the required answer, we use conventional IR techniques, whilst the extraction of the answer from the relevant parts of documents is again based on pattern matching.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vilares, J., Alonso, M.A., Ribadas, F.J., Vilares, M.: COLE experiments at CLEF 2002 Spanish monolingual track. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 265–278. Springer, Heidelberg (2003)
Vilares, J., Alonso, M.A., Ribadas, F.J.: COLE experiments at CLEF 2003 Spanish monolingual track. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 345–357. Springer, Heidelberg (2004)
Arampatzis, A., van der Weide, T.P., van Bommel, P., Koster, C.: Linguistically-motivated information retrieval. In: Encyclopedia of Library and Information Science, vol. 69, pp. 201–222. Marcel Dekker, Inc., New York-Basel (2000)
Figuerola, C.G., Gómez, R., Zazo Rodríguez, A.F., Alonso Berrocal, J.L.: Stemming in Spanish: A first approach to its impact on information retrieval. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406. Springer, Heidelberg (2002)
Vilares, M., Graña, J., Alvariño, P.: Finite-state morphology and formal verification. Journal of Natural Language Engineering, special issue on Extended Finite State Models of Language 3, 303–304 (1997)
Kowalski, G.: Information Retrieval Systems: Theory and Implementation. The Kluwer international series on Information Retrieval. Kluwer Academic Publishers, Boston-Dordrecht-London (1997)
Palmer, D.D.: Tokenisation and Sentence Segmentation. In: Dale, R., Moisi, H., Somers, H. (eds.) Handbook of Natural Language Processing. Marcel Dekker, Inc., New York (2000)
Graña, J., Barcala, F.M., Vilares, J.: Formal methods of tokenization for part-of-speech tagging. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 240–249. Springer, Heidelberg (2002)
Barcala, F.M., Vilares, J., Alonso, M.A., Graña, J., Vilares, M.: Tokenization and proper noun recognition for information retrieval. In: 3rd International Workshop on Natural Language and Information Systems (NLIS 2002), September 2-3. IEEE Computer Society Press, Los Alamitos (2002)
Graña, J.: Técnicas de Análisis Sintáctico Robusto para la Etiquetación del Lenguaje Natural. PhD thesis, Departamento de Computación, Universidade da Coruña, A Coruña, Spain (2000)
Brants, T.: TnT - a statistical part-of-speech tagger. In: Proceedings of the Sixth Applied Natural Language Processing Conference (ANLP 2000), Seattle, WA (2000)
Graña, J., Barcala, F.M., Alonso, M.: Compilation methods of minimal acyclic automata for large dictionaries. In: Watson, B.W., Wood, D. (eds.) CIAA 2001. LNCS, vol. 2494, pp. 116–129. Springer, Heidelberg (2003)
Graña, J., Chappelier, J.C., Vilares, M.: Integrating external dictionaries into stochastic part-of-speech taggers. In: Angelova, G., Bontcheva, K., Mitkov, R., Nocolov, N., Nikolov, N. (eds.) EuroConference Recent Advances in Natural Language Processing, Proceedings, Tzigov Chark, Bulgaria, pp. 122–128 (2001)
Graña Gil, J., Alonso Pardo, M.A., Vilares Ferro, M.: A common solution for tokenization and part-of-speech tagging. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2002. LNCS (LNAI), vol. 2448, pp. 3–10. Springer, Heidelberg (2002)
Abney, S.: Partial parsing via finite-state cascades. Natural Language Engineering 2, 337–344 (1997)
Kaszkiel, M., Zobel, J.: Effective ranking with arbitrary passages. Journal of the American Society of Information Science 52, 344–364 (2001)
Llopis, F., Vicedo, J.L., Ferrández, A.: IR-n system at CLEF-2002. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 291–300. Springer, Heidelberg (2003)
Jacquemin, C., Tzoukermann, E.: NLP for term variant extraction: synergy between morphology, lexicon and syntax. In: Strzalkowski, T. (ed.) Natural Language Information Retrieval, Text, Speech and Language Technology, vol. 7, pp. 25–74. Kluwer Academic Publishers, Dordrecht (1999)
Koster, C.H.A.: Head/Modifier frames for information retrieval. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 420–432. Springer, Heidelberg (2004)
http://www.itl.nist.gov/iaui/894.02/works/papers/zp2/zp2.html (site visited August 2004)
Robertson, S.E., Walker, S.: Okapi/Keenbow at TREC-8. In: Voorhees, E.M., Harman, D.K. (eds.) NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC 8), Gaithersburg, MD, USA, Department of Commerce, National Institute of Standards and Technology, pp. 151–162 (2000)
Savoy, J.: Report on CLEF-2002 Experiments: Combining Multiple Sources of Evidence. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 31–46. Springer, Heidelberg (2003)
ftp://ftp.cs.cornell.edu/pub/smart (site visited August 2004)
Vilares, J., Alonso, M.A.: Dealing with syntactic variation through a locality-based approach. In: Apostolico, A., Melucci, M. (eds.) SPIRE 2004. LNCS, vol. 3246, pp. 255–266. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Díaz, E.M., Ferro, J.V., Souto, D.C. (2005). COLE Experiments at QA@CLEF 2004 Spanish Monolingual Track. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_53
Download citation
DOI: https://doi.org/10.1007/11519645_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)