Abstract
This paper presents the ITC-irst Multilingual Question Answering system Diogene. The system was used successfully on the CLEF-2003, TREC-2003, TREC-2002 and TREC-2001 QA tracks. Diogene relies on a classical three-layer architecture: question processing, document retrieval, answer extraction and validation. Diogene uses MultiWordNet [8] (http:// multiwordnet.itc.it) which facilitates the transfer of knowledge between languages. For answer validation we used the Web. This year we also used a set of linguistic templates for answering specific questions like definition questions, location questions, and a subset of who-is and what-is questions. Diogene participated in both the monolingual Italian-Italian task and in the cross-language Italian-English task. We also collaborated with the Bulgarian Academy of Sciences in the cross-language Bulgarian-English QA task.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Burger, J., Cardie, C., Chaudhri, V., Gaizauskas, R., Harabagiu, S., Israel, D., Jacquemin, C., Lin, C.-Y., Maiorano, S., Miller, G., Moldovan, D., Ogden, B., Prager, J., Riloff, E., Singhal, A., Shrihari, R., Strzalkowski, T., Voorhees, E., Weishedel, R.: Issues, Tasks and Program Structures to Roadmap Research in Question & Answering, Q&A (2001), http://www.nlpir.nist.gov/projects/duc/papers/qa.Roadmap-paper_v2.doc
Magnini, B., Negri, M., Prevete, R., Tanev, H.: Mining Knowledge from Repeated Co-occurrences: Diogene at TREC 2002 Proceedings of the Eleventh Text Retrieval Conference (TREC 2002), Gaithersburg, MD (2002)
Magnini, B., Negri, M., Prevete, R., Tanev, H.: Comparing Statistical and Content-Based Techniques for Answer Validation on the Web. In: Proceedings of the VIII Convegno AI*IA, Siena, Italy (2002)
Magnini, B., Negri, M., Prevete, R., Tanev, H.: A WordNet-Based Approach to Named Entities Recognition. In: Proceedings of SemaNet 2002, COLING Workshop on Building and Using Semantic Networks, Taipei, Taiwan (2002)
Magnini, B., Negri, M., Prevete, R., Tanev, H.: Is It the Right Answer? Exploiting Web Redundancy for Answer Validation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), Philadelphia, PA (2002)
Manning, C., Shutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Negri, M., Tanev, H., Magnini, B.: Bridging Languages for Question Answering: Diogene at CLEF-2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 501–513. Springer, Heidelberg (2004)
Pianta, E., Bentivogli, L., Girardi, C.: MultiWordNet: Developing an Aligned Multilingual Database. In: Proceedings of the 1stInternational Global WordNet Conference, Mysore, India (2002)
Ravichandran, D., Hovy, E.: Learning Surface Text Patterns for a Question Answering System. In: Proceedings of the 40th ACL Conference, University of Pennsylvania, Philadelphia (2002)
Tanev, H., Kouylekov, M., Negri, M., Coppola, B., Magnini, B.: Multilingual Pattern Libraries for Question Answering: a Case Study for Definition Questions. In: Fourth International Conference on Language Resources and Evaluation (LREC 2004) Proceedings, Lisbon, Portugal, May 26-28 (2004)
Voorhees, E.: Overview of the TREC 2003 Question Answering Track. In: Proceedings of the Sixth Retrieval Conference (TREC 2003), Gaithersburg, MD (2004)
Witten, I.H., Moffat, A., Bell, T.: Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd edn. Morgan Kaufmann Publishers, New York (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tanev, H., Negri, M., Magnini, B., Kouylekov, M. (2005). The DIOGENE Question Answering System at CLEF-2004. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_43
Download citation
DOI: https://doi.org/10.1007/11519645_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)