Abstract
The design and development of a spellchecker for highly inflected languages is commonly regarded as a challenging task. In this paper we present the architecture of Hascheck, a spellchecking system developed for Croatian language. We describe functional elements that make it an intelligent system and discuss specific issues related to Hascheck’s dictionary size as well as its guessing and learning capabilities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dolgopov, A.S.: Automatic spelling correction. Cybernetics 22(3), 332–339 (1986)
Dembitz, S.: Word generator and its applicability. In: Proc. of the International Zurich Seminar on Digital Communications: Man-Machine Interaction, Zürich, Switzerland, pp. 59–64 (1982)
Morris, R., Cherry, L.L.: Computer detection of typographical errors. IEEE Transactions on Professional Communications PC-18(1), 54–64 (1975)
Bratanić, M.: English-Croatian Lexicographic Corpus. Bulletin of the Institute of Linguistics in Zagreb 1(1), 71–73 (1975) (in Croatian)
McIllroy, M.D.: Development of a spelling list. IEEE Trans. Commun. COM-30(1), 91–99 (1982)
Turba, T.N.: Checking for spelling and typographical errors in computer-based text. ACM SIGPLAN Notices 16(6), 51–60 (1981)
Damerau, F.J.: A technique for computer detection and correction of spelling errors. Communications of the ACM 7(3), 171–176 (1964)
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady 10, 707–710 (1966)
Dembitz, Š.: Distance between languages. In: Proc. SoftCOM 1996, Split, Croatia, pp. 219–296 (1996) (in Croatian)
Goldsmith, J.: Unsupervised Learning of the morphology of a natural language. Computational Linguistics 27(2), 153–198 (2001)
Navarro, G.: NR-grep: a fast and flexible pattern matching tool. Software Practice and Experience SPE-31, 1265–1312 (2001a)
Kukich, K.: Techniques for automatically correcting words in text. ACM Computing Surveys 24(4), 377–439 (1992)
Dembitz, Š., Knežević, P., Sokele, M.: Hascheck – The Croatian academic spelling checker. In: Milne, R., Macintosh, A., Bramer, M. (eds.) Applications and Innovations in Expert Systems VI, pp. 184–197. Springer, Heidelberg (1999)
Peterson, J.L.: A note on undetected typing errors. Communications of the ACM 29(7), 633–637 (1986)
Damerau, F.J., Mays, E.: An examination of undetected typing errors. Information Processing and Management 25(6), 659–664 (1989)
Zipf, G.K.: Human Behavior and the Principle of Least Effort. Addison-Wesley, Cambridge (1949)
CLC, Croatian Language Corpus, Institute of Croatian Language and Linguistics, Zagreb, Croatia (2010), http://riznica.ihjj.hr/ (March 14, 2010)
Dembitz, Š., Gledec, G., Randić, M.: Spellchecker, Wiley Encyclopedia of Computer Science and Engineering. In: Wah, B.W. (ed.), vol. 5, pp. 2793–2804. John Wiley & Sons, Hoboken (2009)
Dembitz, Š., Sokele, M.: Comparison of Croatian spelling checkers. In: Proc. SoftCOM 1997, Split-Dubrovnik (Croatia) – (Italy), pp. 191–200 (1997) (in Croatian)
Orangoo. Copyright 2007 TextTrust.com (2009), http://orangoo.com/spell/ (March 14, 2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dembitz, Š., Gledec, G., Blašković, B. (2010). Architecture of Hascheck – An Intelligent Spellchecker for Croatian Language. In: Setchi, R., Jordanov, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2010. Lecture Notes in Computer Science(), vol 6277. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15390-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-15390-7_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15389-1
Online ISBN: 978-3-642-15390-7
eBook Packages: Computer ScienceComputer Science (R0)