Abstract
Word prediction is the problem of guessing the words which are likely to follow in a given text segment by displaying a list of the most probable words that could appear in that position. In this research, we designed and implemented three word predictors for Persian. Our baseline is a statistical-based system which uses language models. The first system uses word statistics; in the second one we use the main syntactic categories of a Persian POS tagged corpus; and the last one uses the main syntactic categories along with their morphological, syntactic and semantic subcategories. Using KeyStroke Saving (KSS) as the most important metrics to evaluate systems’ performance, the primary word-based statistical system achieved 37% KSS, and the second system that used only the main syntactic categories with word-statistics achieved 38.95% KSS. Our last system which used all of the available information to the words get the best result by 42.45% KSS.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Booth, L., Beattie, W., Newell, A.: I know what you mean. Special Children, pp. 26-27 (1990)
Carlberger, J.: Word Prediction: Design and Implementation of a probabilistic Word Prediction Program. Master dissertation. Royal Institute of Technology. Stockholm (1997)
Carlberger, A., Magnuson, T., Carlberger, J., Wachtmeister, H., Hunnicutt, S.: Probability-based word prediction for writing support in dyslexia. In: Barner, R., Heldner, M., Sullivan, K., Wretling, P. (eds.) Proceedings of Fonetik 1997 Conference, vol. 4, pp. 17–20 (1997a)
Carlberger, A., Carlberger, J., Magnuson, T., Hunnicutt, M.S., Palazuelos-Cagigas, S.E., Navarro, S.A.: Profet, a new generation of word prediction: An evaluation study. In: Copestake, A., Langer, S., Palazuelos-Cagigas, S. (eds.) Natural Language Processing for Communication aids, In Proceedings of a workshop sponsored by ACL, Madrid, Spain, pp. 23–28 (1997b)
Fazly, A.: The Use of Syntax in Word Completion Utilities. Master dissertation. University of Toronto, Canada (2002)
Freund, Y., Shapire, R.E.: Experiments with new boosting algorithm. In: Proceedings of ICML (1996)
Ghayoomi, M.: Word Prediction in Computational Processing of the Persian Language. Master dissertation. Iran: Islamic Azad University, Tehran Central Branch (2004)
Ghayoomi, M.: Using word prediction systems for users with disabilities: A case study. In: Proceedings of the 2nd Workshop on the Persian Language and Computer, Tehran University, Iran, June 27-28, 2006, pp. 216–225 (2006)
Ghayoomi, M., Assi, S.M.: Word prediction in a running text: A statistical language modeling for the Persian language. In: Proceedings of the Australasian Language Technology Workshop, University of Sydney, Australia, Dec. 10-11, 2005, pp. 57–63 (2005)
Gustavii, E., Pettersson, E.: A Swedish Grammar for Word Prediction. Uppsala University, Stockholm (2003)
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice-Hall, New Jersey (2000)
Klund, J., Novak, M.: If word prediction can help, which program do you choose? (2001), http://trace.wisc.edu/docs/wordprediction2001/index.htm
Manning, C.D., SchĂĽtze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
McCoy, K., Demasco, P.: Some application of natural language processing to the field of augmentative and alternative communication. In: Proceeding of the IJCAI – 1995 Workshop on Developing AI Applications for People with Disabilities (1995)
Morris, C., Newell, A., Booth, L., Arnott, J.: Syntax pal: A system to improve the written syntax of language-impaired users. Assistive Technology 4(2), 51–59 (1992)
Nantais, T., Shein, F., Johansson, M.: Efficacy of the word prediction algorithm in WordQ. In: Proceedings of the 24th Annual Conference on Technology and Disability, RESNA (2001)
Rosenfeld, R.: Adaptive Statistical Language Modeling: A Maximum Entropy Approach. PhD. dissertation. Pittsburgh, Canegie Mellon University (1994)
Shein, F., Nantais, T., Nishiyama, R., Tam, C., Marshall, P.: Word cuing for persons with writing difficulties: WordQ. In: The16th Annual International Conference on Technology and Persons with Disabilities, California State University at Northridge, Los Angeles, CA (March 2001)
Soede, M., Foulds, R.A.: Dilemma of prediction in communication aids and mental load. In: Proceedings of the 9th Annual Conference on Rehabilitation Technology, pp. 357–359 (1986)
Swiffin, A.L., Pickering, J.A., Arnott, J.L., Newell, A.F.: PAL: An effort efficient portable communication aid and keyboard emulator. In: Proceedings of the 8th Annual Conference on Rehabilitation Technology, pp. 197–199 (1985)
Wood, M.E.J.: Syntactic Pre-Processing in Single-Word Prediction for Disabled People. Ph.D. dissertation. University of Bristol, Bristol (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ghayoomi, M., Daroodi, E. (2008). A POS-Based Word Prediction System for the Persian Language. In: Nordström, B., Ranta, A. (eds) Advances in Natural Language Processing. GoTAL 2008. Lecture Notes in Computer Science(), vol 5221. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85287-2_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-85287-2_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85286-5
Online ISBN: 978-3-540-85287-2
eBook Packages: Computer ScienceComputer Science (R0)