Skip to main content

Rule-Based Morphological Tagger for an Inflectional Language

  • Conference paper
Cognitive Behavioural Systems

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7403))

Abstract

This paper aims to present an alternative view on the task of morphological tagging - a rule based system with new and simple learning method that uses just basic arithmetic operations to create an efficient knowledge base. Matching process of this rule-based approach follows specific-to-general technique, where rules for more specific contexts are applied whenever they are available in the rule-base. As a consequence, the major accuracy and performance improvements can be achieved by pruning the rule-base.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nouza, J., Zdansky, J., Cerva, P., Silovsky, J.: Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak). In: Esposito, A., Campbell, N., Vogel, C., Hussain, A., Nijholt, A. (eds.) COST 2102 Int. Training School 2009. LNCS, vol. 5967, pp. 225–241. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  2. Beňuš, S., Cerňak, M., Rusko, M., Trnka, M., Darjaa, S.: Adapting slovak asr for native germans speaking slovak. In: Proceedings of the First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, DIALECTS 2011, pp. 60–64. Association for Computational Linguistics, Stroudsburg (2011)

    Google Scholar 

  3. Brants, T.: Tnt: A statistical part-of-speech tagger. In: Proc. of the Sixth Conference on Applied Natural Language Processing, ANLC 2000, pp. 224–231. Association for Computational Linguistics, Stroudsburg (2000)

    Google Scholar 

  4. Collins, M.: Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 1–8. Association for Computational Linguistics (2002)

    Google Scholar 

  5. Schmid, H., Laws, F.: Estimation of conditional probabilities with decision trees and an application to fine-grained POS tagging. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1, pp. 777–784. Association for Computational Linguistics (2008)

    Google Scholar 

  6. Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, NAACL 2003, vol. 1, pp. 173–180. Association for Computational Linguistics, Stroudsburg (2003)

    Google Scholar 

  7. Spoustová, D., Hajič, J., Raab, J., Spousta, M.: Semi-supervised training for the averaged perceptron pos tagger. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009, pp. 763–771. Association for Computational Linguistics, Stroudsburg (2009)

    Chapter  Google Scholar 

  8. Spoustová, D., Hajič, J., Votrubec, J., Krbec, P., Květoň, P.: The best of two worlds: Cooperation of statistical and rule-based taggers for Czech. In: Proc. of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies, pp. 67–74. Association for Computational Linguistics (2007)

    Google Scholar 

  9. Holland, J.H.: Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems. Machine Learning: An Artificial Intelligence Approach 2 (1986)

    Google Scholar 

  10. Sigaud, O., Wilson, S.: Learning classifier systems: a survey. Soft Computing-A Fusion of Foundations, Methodologies and Applications 11(11), 1065–1078 (2007)

    MATH  Google Scholar 

  11. Hládek, D.: Learning System Based on Generalization of Fuzzy Rules. PhD thesis, Technical University of Kosice (2009)

    Google Scholar 

  12. Jazykovedný ústav Ľ. Štúra SAV: Slovenský národný korpus prim-3.0-public-all (2007)

    Google Scholar 

  13. Horák, A., Gianitsová, L., Šimková, M., Šmotlák, M., Garabík, R.: Slovak National Corpus. In: Sojka, P., Kopeček, I., Pala, K., et al. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 89–93. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  14. Halácsy, P., Kornai, A., Oravecz, C.: HunPos - An open source trigram tagger. In: Proc. of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, ACL 2007, pp. 209–212. Association for Computational Linguistics, Stroudsburg (2007)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hládek, D., Staš, J., Juhár, J. (2012). Rule-Based Morphological Tagger for an Inflectional Language. In: Esposito, A., Esposito, A.M., Vinciarelli, A., Hoffmann, R., Müller, V.C. (eds) Cognitive Behavioural Systems. Lecture Notes in Computer Science, vol 7403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34584-5_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34584-5_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34583-8

  • Online ISBN: 978-3-642-34584-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics