Abstract
This paper describes the Bulgarian part of a Bulgarian–English question answering system. The Bulgarian modules are implemented as a question analysis procedure within a Bulgarian question answering system — BulQA. The paper presents the available language resources and corresponding technology which is used for the analysis of the questions in Bulgarian and their translation into English format, which is necessary for answer extraction. CLaRK System is used as an implementation platform.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Balabanova, E., Ivanova, K.: Creating a Machine-Readable Version of Bulgarian Valence Dictionary (A case study of CLaRK system application). In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 1–12 (2002)
Ivanova, K., Doikoff, D.: Cascaded Regular Grammars and Constraints over Morphologically Annotated Data for Ambiguity Resolution. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 96–113 (2002)
Magnini, B., Negri, M., Prevete, R., Tanev, H.: Comparing Statistical and Content-Based Techniques for Answer Validation on the Web. In: Proceedings of the VIII Convegno AI*IA, Siena, Italy (2002)
Negri, M., Tanev, H., Magnini, B.: Bridging Languages for Question Answering: DIOGENE at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 321–330. Springer, Heidelberg (2004)
Osenova, P.: Bulgarian Nominal Chunks and Mapping Strategies for Deeper Syntactic Analyses. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 150–166 (2002)
Osenova, P., Kolkovska, S.: Combining the named-entity recognition task and NP chunking strategy for robust pre-processing. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 167–182 (2002)
Osenova, P., Simov, K.: Learning a token classification from a large corpus (A case study in abbreviations). In: Proceedings of the ESSLLI Workshop on Machine Learning Approaches in Computational Linguistics, Trento, Italy, pp. 16–28 (2002)
Pianta, E., Bentivogli, L., Girardi, C.: MULTIWORDNET: Developing an Aligned Multilingual Database. In: Proceedings of the 1st International Global WordNet Conference, Mysore, India (2002)
Popov, D., Simov, K., Vidinska, S.: A Dictionary of Writing, Pronunciation and Punctuation of Bulgarian Language, Atlantis LK, Sofia, Bulgaria (1998)
Simov, K., Peev, Z., Kouylekov, M., Simov, A., Dimitrov, M., Kiryakov, A.: CLaRK — an XML-based System for Corpora Development. In: Proceedings of the Corpus Linguistics 2001 Conference, pp. 558–560 (2001)
Simov, K., Osenova, P.: A Hybrid System for MorphoSyntactic Disambiguation in Bulgarian. In: Proceedings of the RANLP 2001, Tzigov chark, Bulgaria, pp. 288–290 (2001)
Simov, K., Popova, G., Osenova, P.: HPSG-based syntactic treebank of Bulgarian (BulTreeBank). A Rainbow of Corpora: Corpus Linguistics and the Languages of the World. In: Wilson, A., Rayson, P., McEnery, T. (eds.) A Rainbow of Corpora: Corpus Linguistics and the Languages of the World, Lincom-Europa, Munich, pp. 135–142 (2002)
Simov, K., Kouylekov, M., Simov, A.: Cascaded Regular Grammars over XML Documents. In: Proceedings of the 2nd Workshop on NLP and XML (NLPXML 2002), Taipei, Taiwan (2002)
Simov, K., Simov, A., Kouylekov, M.: Constraints for Corpora Development and Validation. In: Proceedings of the Corpus Linguistics 2003 Conference, pp. 698–705 (2003)
Simov, K., Simov, A., Osenova, P.: An XML Architecture for Shallow and Deep Processing. In: Proceedings of the ESSLLI 2004 Workshop on Combining Shallow and Deep Processing for NLP, Nancy, France, pp. 51–60 (2004)
Slavcheva, M.: Segmentation Layers in the Group of the Predicate: a Case Study of Bulgarian within the BulTreeBank Framework. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 199–210 (2002)
XML: Extensible Markup Language (XML) 1.0, 2nd edn. W3C Recommendation (2000), http://www.w3.org/TR/REC-xml
XPath, X.M.L.: Path Language (XPath) version 1.0. W3C Recommendation (1999), http://www.w3.org/TR/xpath
XSLT: XSL Transformations (XSLT). version 1.0. W3C Recommendation (1999), http://www.w3.org/TR/xslt
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Osenova, P., Simov, A., Simov, K., Tanev, H., Kouylekov, M. (2005). Bulgarian-English Question Answering: Adaptation of Language Resources. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_45
Download citation
DOI: https://doi.org/10.1007/11519645_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)