Skip to main content

A Computational Grammar of Sinhala

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7181))

Abstract

A Computational Grammar for a language is a very useful resource for carrying out various language processing tasks for that language such as Grammar checking, Machine Translation and Question Answering. As is the case in most South Indian Languages, Sinhala is a highly inflected language with three gender forms and two number forms among other grammatical features. While piecemeal descriptions of Sinhala grammar is reported in the literature, no comprehensive effort to develop a context-free grammar (CFG) has been made that has been able to account for any significant coverage of the language. This paper describes the development of a feature-based CFG for non-trivial sentences in Sinhala. The resulting grammar covers a significant subset of Sinhala as described in a well-known grammar book. A parser for producing the appropriate parse tree(s) of input sentences was also developed using the NLTK toolkit. The grammar also detects and so rejects ungrammatical sentences. Two hundred sample sentences taken from primary grade Sinhala grammar books were used to test the grammar. The grammar accounted for 60% of the coverage over these sentences.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abhayasinghe, A.A.: Sinhala bhashave sarala vakya vibagaya (1998)

    Google Scholar 

  2. Ayesha Binte Mosaddeque, A.B., Haque, N.: Context-Free Grammar for Bangla. BRAC University, Dhaka

    Google Scholar 

  3. Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O’Reilly Media (2009)

    Google Scholar 

  4. Disanayaka, J.B.: Bashavaka rata samudaya. Lake house investment Co. Ltd., Colombo 2 (1969)

    Google Scholar 

  5. Fairbanks, G.H., Gair, J.W., Silva, M.W.S.D.: Colloquial Sinhalese. Cornell University, New York (1968)

    Google Scholar 

  6. Gair, J.W., Karunatilaka, W.S.: Literary Sinhala inflected forms: A Synopsis with a Translation Guide to Sinhala script. Cornell University, New York

    Google Scholar 

  7. Gair, J.W., Karunatilaka, W.S.: Literary Sinhala. Cornell University, New York (1974)

    Google Scholar 

  8. Gunasekara, A.M.: A Comprehensive Grammar of the Sinhalese Language. Godage International Publishers (PVT) Ltd. (2008)

    Google Scholar 

  9. Hettige, B., Karunananda, A.S.: Computational Model of Grammar for English to Sinhala Machine Translation. In: Proceedings of the International Conference on Advances in ICT for Emerging Regions (2011)

    Google Scholar 

  10. Jayawardhane, T.: The surface case system in Sinhala. KALYANI, pp. 264–277. University of Kelaniya (1996)

    Google Scholar 

  11. Kariyakarawana, S.M.: The Syntax of Focus and Wh-Questions in Sinhala. Karunaratne & Sons Ltd. (1998)

    Google Scholar 

  12. Karunatilaka, W.S.: Sinhala bhasha vyakaranaya. M. D. Gunasena & Co. Ltd. (2009)

    Google Scholar 

  13. Kekulawala, S.L.: The future tense in Sinhalese – an ‘unorthodox’ point of view. Journal of the Vidyalankara University of Ceylon (1972)

    Google Scholar 

  14. Khan, N., Khan, M.: Developing a Computational Grammar for Bengali Using the HPSG Formalism. In: Proceedings of the 9th International Conference on Computer and Information Technology, ICCIT 2006 (2006)

    Google Scholar 

  15. Rajapaksha, D.: Sinhala bhashave pada bedima saha virama lakshana bhavithaya (2008)

    Google Scholar 

  16. Sagar, B.M., Shobha, G., Kumar, R.: Context Free Grammar (CFG) Analysis for simple Kannada sentences. In: Proceedings of the International Conference [ACCTA-2010] on Special Issue of IJCCT, vol. 1(2, 3, 4) (2010)

    Google Scholar 

  17. Sagar, B.M., Shobha, G., Kumar, R.: Solving the Noun Phrase and Verb Phrase Agreement in Kannada Sentences. International Journal of Computer Theory and Engineering 1(3) (August 2009)

    Google Scholar 

  18. Wikipedia (English), http://en.wikipedia.org/wiki/Sinhala_language

  19. Dasanayaka, A.E.S.: Kumara rachanaya; Grade 4. M. D. Gunasena & Co. Ltd. (1990)

    Google Scholar 

  20. Dasanayaka, A.E.S.: Kumara rachanaya; Grade 5, M. D. Gunasena & Co. Ltd. (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liyanage, C., Pushpananda, R., Herath, D.L., Weerasinghe, R. (2012). A Computational Grammar of Sinhala. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2012. Lecture Notes in Computer Science, vol 7181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28604-9_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28604-9_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28603-2

  • Online ISBN: 978-3-642-28604-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics