Skip to main content

Bulgarian-English Question Answering: Adaptation of Language Resources

  • Conference paper
Multilingual Information Access for Text, Speech and Images (CLEF 2004)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3491))

Included in the following conference series:


This paper describes the Bulgarian part of a Bulgarian–English question answering system. The Bulgarian modules are implemented as a question analysis procedure within a Bulgarian question answering system — BulQA. The paper presents the available language resources and corresponding technology which is used for the analysis of the questions in Bulgarian and their translation into English format, which is necessary for answer extraction. CLaRK System is used as an implementation platform.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others


  1. Balabanova, E., Ivanova, K.: Creating a Machine-Readable Version of Bulgarian Valence Dictionary (A case study of CLaRK system application). In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 1–12 (2002)

    Google Scholar 

  2. Ivanova, K., Doikoff, D.: Cascaded Regular Grammars and Constraints over Morphologically Annotated Data for Ambiguity Resolution. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 96–113 (2002)

    Google Scholar 

  3. Magnini, B., Negri, M., Prevete, R., Tanev, H.: Comparing Statistical and Content-Based Techniques for Answer Validation on the Web. In: Proceedings of the VIII Convegno AI*IA, Siena, Italy (2002)

    Google Scholar 

  4. Negri, M., Tanev, H., Magnini, B.: Bridging Languages for Question Answering: DIOGENE at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 321–330. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  5. Osenova, P.: Bulgarian Nominal Chunks and Mapping Strategies for Deeper Syntactic Analyses. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 150–166 (2002)

    Google Scholar 

  6. Osenova, P., Kolkovska, S.: Combining the named-entity recognition task and NP chunking strategy for robust pre-processing. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 167–182 (2002)

    Google Scholar 

  7. Osenova, P., Simov, K.: Learning a token classification from a large corpus (A case study in abbreviations). In: Proceedings of the ESSLLI Workshop on Machine Learning Approaches in Computational Linguistics, Trento, Italy, pp. 16–28 (2002)

    Google Scholar 

  8. Pianta, E., Bentivogli, L., Girardi, C.: MULTIWORDNET: Developing an Aligned Multilingual Database. In: Proceedings of the 1st International Global WordNet Conference, Mysore, India (2002)

    Google Scholar 

  9. Popov, D., Simov, K., Vidinska, S.: A Dictionary of Writing, Pronunciation and Punctuation of Bulgarian Language, Atlantis LK, Sofia, Bulgaria (1998)

    Google Scholar 

  10. Simov, K., Peev, Z., Kouylekov, M., Simov, A., Dimitrov, M., Kiryakov, A.: CLaRK — an XML-based System for Corpora Development. In: Proceedings of the Corpus Linguistics 2001 Conference, pp. 558–560 (2001)

    Google Scholar 

  11. Simov, K., Osenova, P.: A Hybrid System for MorphoSyntactic Disambiguation in Bulgarian. In: Proceedings of the RANLP 2001, Tzigov chark, Bulgaria, pp. 288–290 (2001)

    Google Scholar 

  12. Simov, K., Popova, G., Osenova, P.: HPSG-based syntactic treebank of Bulgarian (BulTreeBank). A Rainbow of Corpora: Corpus Linguistics and the Languages of the World. In: Wilson, A., Rayson, P., McEnery, T. (eds.) A Rainbow of Corpora: Corpus Linguistics and the Languages of the World, Lincom-Europa, Munich, pp. 135–142 (2002)

    Google Scholar 

  13. Simov, K., Kouylekov, M., Simov, A.: Cascaded Regular Grammars over XML Documents. In: Proceedings of the 2nd Workshop on NLP and XML (NLPXML 2002), Taipei, Taiwan (2002)

    Google Scholar 

  14. Simov, K., Simov, A., Kouylekov, M.: Constraints for Corpora Development and Validation. In: Proceedings of the Corpus Linguistics 2003 Conference, pp. 698–705 (2003)

    Google Scholar 

  15. Simov, K., Simov, A., Osenova, P.: An XML Architecture for Shallow and Deep Processing. In: Proceedings of the ESSLLI 2004 Workshop on Combining Shallow and Deep Processing for NLP, Nancy, France, pp. 51–60 (2004)

    Google Scholar 

  16. Slavcheva, M.: Segmentation Layers in the Group of the Predicate: a Case Study of Bulgarian within the BulTreeBank Framework. In: Proceedings of The TLT Workshop, Sozopol, Bulgaria, pp. 199–210 (2002)

    Google Scholar 

  17. XML: Extensible Markup Language (XML) 1.0, 2nd edn. W3C Recommendation (2000),

  18. XPath, X.M.L.: Path Language (XPath) version 1.0. W3C Recommendation (1999),

  19. XSLT: XSL Transformations (XSLT). version 1.0. W3C Recommendation (1999),

Download references

Author information

Authors and Affiliations


Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Osenova, P., Simov, A., Simov, K., Tanev, H., Kouylekov, M. (2005). Bulgarian-English Question Answering: Adaptation of Language Resources. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg.

Download citation

  • DOI:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27420-9

  • Online ISBN: 978-3-540-32051-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics