Development of Prototype Morphological Analyzer for he South Indian Language of Kannada

Vikram, T. N.; Urs, Shalini R.

doi:10.1007/978-3-540-77094-7_18

T. N. Vikram¹ &
Shalini R. Urs¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4822))

Included in the following conference series:

International Conference on Asian Digital Libraries

1776 Accesses
4 Citations

Abstract

A prototype morphological analyzer for the south Indian language of Kannada is presented in this work. The analyzer is based on Finite state machines and can handle 500 distinct Noun and Verb stems of Kannada. The morphological analyzer can simultaneously serve as a stemmer, part of speech tagger and spell checker and hence it becomes a very efficient tool for content management.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval, British Library, London (1980)
Google Scholar
Zhou, Y., Qin, J., Chen, H., Nunamaker, J.F.: Multilingual Web Retrieval: An Experiment on a Multilingual Business Intelligence Portal. Digital Object Identifier (2005), doi:10.1109/HICSS.2005.450
Google Scholar
Idris, N., Syed, S.M.F.D.: Stemming for Term Conflation in Malay Texts. International Conference on Artificial Intelligence (IC-AI 2001) (2001)
Google Scholar
Ma, Q.: Natural language processing with neural networks. Language Engineering Conference, pp. 45–56 (2002)
Google Scholar
Sahoo, K., Vidyasagar, E.V.: Kannada WordNet - A Lexical Database. TENCON Asia Pacific, pp. 1352–1356 (2003)
Google Scholar
Setter, S., Goswami, S., Abhishek, H K.: Indexing software for Ancient Kannada Books. Language Engineering Conference (2002)
Google Scholar
Braschler, M., Ripplinger, B.: How Effective is Stemming and Decompounding for German Text Retrieval? Information Retrieval, 291–306 (2004)
Google Scholar
Tomlinson, S.: Lexical and Algorithmic Stemming Compared for 9 European Languages with Hummingbird SearchServer^TM at CLEF 2003. pp. 286–300 (2003)
Google Scholar
Lee, C.Y.: Local grammar based lexical analyzer for Korean language. In: Proceedings of VEXTEL (1999)
Google Scholar
Min, J., Sun, L., Zhang, J.: ISCAS in English-Chinese CLIR at NTCIR-5. In: Proceedings of NTCIR (2005)
Google Scholar
Sharma, U., Kalita, J., Das, R.: Unsupervised learning of morphology for building lexicon for a highly inflectional language. ACL SIGPHON, 1–10 (2002)
Google Scholar
Das, M., Borgohain, S., Gogoi, J., Nair, S.B.: Design and implementation of spell checker for Assamese (2002)
Google Scholar
Mohanty, S., Santi, P.K., Adhikary, K.P.D.: Analysis and Design of Oriya Morphological Analyser: Some Tests with OriNet. In: Proceeding of symposium on Indian Morphology, phonology and Language Engineering, IIT Kharagpur (2004)
Google Scholar
http://tdil.mit.gov.in/TDIL-OCT-2003/morph%20analyzer.pdf
Hiremath, R.C.: The Structure of Kannada. PhD Thesis. Karnatak University (1961)
Google Scholar
Amsalu, S., Gibbon, D.: Finite state morphology of Amharic. Workshop on RNLAP (2005)
Google Scholar
http://www.research.att.com/~fsmtools/fsm/
Sharada, B.A.: Transformation of Natural language into an indexing language: Kannada- A case study. PhD Thesis. University of Mysore (2002)
Google Scholar
Kay. Nonconcatenative Finite State Morphology. EACL. pp. 2–10 (1985)
Google Scholar
Aho, A.V., Sethi, R., Ulmann, J.D.: Compilers: Principles, Techniques and Tools. Addison wesley, Reading (1985)
Google Scholar
Pal, U., Chaudhuri, B.B.: Indian script character recognition. Pattern Recognition 37, 1887–1899 (2004)
Article Google Scholar
Cao, H.-L., Zhao, T.-J., Li, S., Sun, J., Zhang, C.-X.: Chinese POS tagging based on bilexical co-occurrences. Machine Learning and Cybernetics Conf. (2005)
Google Scholar
http://www.indictrans.in
http://ccat.sas.upenn.edu/plc/kannada/

Download references

Author information

Authors and Affiliations

International School of Information Management, University of Mysore, Manasagangotri, Mysore-570006, Karnataka, India
T. N. Vikram & Shalini R. Urs

Authors

T. N. Vikram
View author publications
You can also search for this author in PubMed Google Scholar
Shalini R. Urs
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Dion Hoe-Lian Goh Tru Hoang Cao Ingeborg Torvik Sølvberg Edie Rasmussen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vikram, T.N., Urs, S.R. (2007). Development of Prototype Morphological Analyzer for he South Indian Language of Kannada. In: Goh, D.HL., Cao, T.H., Sølvberg, I.T., Rasmussen, E. (eds) Asian Digital Libraries. Looking Back 10 Years and Forging New Frontiers. ICADL 2007. Lecture Notes in Computer Science, vol 4822. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77094-7_18

Download citation

DOI: https://doi.org/10.1007/978-3-540-77094-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77093-0
Online ISBN: 978-3-540-77094-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics