Abstract
Homonyms are words that have the same form but a different meaning. These types of words have always been considered to be of interest for linguistic developments in the course of general linguistics. In this regard, homonyms found in texts are separately studied in Russian and European linguistics in connection with the linguistic corpus. This article analyzes methods and models in linguistic programs and systems based on linguistic software in world linguistics, such as Brill’s method, hidden Markov model, modification of models and their connection with the linguistic corpus, and provides insights into the importance of the national corpus in linguistic processing. In this article, several methods for identifying homonyms in Uzbek language texts were deliberated, and the N-gram method was identified as one of the most reliable methods for the Uzbek language. For this, in work are made 75 models for bigram and trigram ways of connecting words in the Uzbek language.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abjalova, M.: Linguistic modules of the program of editing and analyzing texts in the Uzbek language (for the program of editing texts in official and scientific style). Dissertation, Doctor of Philosophy in Philology (2020)
Abjalova, M.A., Yuldashev, A.: Methods for determining homonyms in homonymy and linguistic systems. ACADEMICIA: Int. Multidisc. Res. J. 11(2), 1370–1375 (2021). Impact Factor: SJIF 2021 = 7.492 (https://saarj.com). ISSN 2249-7137. https://doi.org/10.5958/2249-7137.2021.00522.X
Abjalova, M.A.: Linguistic modules of editing and analysis programs. Monograph. – Tashkent, Nodirabegim, p. 176 (2020)
Abjalova, M., Iskandarov O.: Methods of tagging part of speech of Uzbek language. In: IEEE – UBMK – 2021: 6th International Conference on Computer Science and Engineering. – Ankara – Turkey, 15–17 September 2021, pp. 82–85 (2021)
Abjalova, M., Gulomova, N.: Author’s corpus of Alisher Navoi and its semantic database. In: IEEE – UBMK – 2022: 7th International Conference on Computer Science and Engineering, 24–26 September 2022. – Diyarbakir, Turkey, pp. 182–187 (2022). Impakt Factor 5.5. https://doi.org/10.1109/UBMK55850.2022.9919546
Abjalova, M, Adalı, E., Iskandarov O.: Educational corpus of the Uzbek language and its opportunities. In: 2023 8th International Conference on Computer Science and Engineering (UBMK), Burdur, Turkiye, pp. 590–594 (2023). https://doi.org/10.1109/UBMK59864.2023.10286682
Baum, L.E., Sell, G.R.: Growth transformations for functions on manifolds. Pac. J. Math. 27(2), 211–227 (1968). https://en.wikipedia.org/wiki/Hidden_Markov_model
Brill, E.: A simple rule-based part of speech tagger. In: ANLC ‘92: Proceedings of the Third Conference on Applied Natural Language Processing, pp. 152–155 (1992). https://doi.org/10.3115/974499.974526
Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21, 543–565 (1995). http://acl.ldc.upenn.edu/J/J95/J95-4004.pdf
Sharipov, M., Yuldashov, O.: UzbekStemmer: development of a rule-based stemming algorithm for Uzbek language. In: CEUR Workshop Proceedings, vol. 3315, pp. 137–144 (2022)
Rakhmatullayev, S.: Explanatory dictionary of Uzbek homonyms. Tashkent: Teacher, 214 p. (1984)
Rizayev, S.: Fundamentals of linguostatistics in Uzbek linguistics. Tashkent: Science, 18 (2006)
http://tech.yandex.ru/mystem. Accessed 28 Nov 2023
https://www.freecodecamp.org/news/an-introduction-to-part-of-speech-tagging-and-the-hidden-markov-model-953d45338f24. Accessed 28 Nov 2023
https://habr.com/ru/post/125988. Accessed 28 Nov 2023
http://samag.ru/archive/article/3059. Accessed 28 Nov 2023
Tukeyev, U.A.: Computational Morphology of Turkic Languages: Textbook, 161 p. Qazaq University, Almaty (2023)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Abjalova, M., Tukeyev, U., Abduraxmanova, M., Adilova, M. (2024). Development and Realization of Bigram Models for Recognizing Homonyms in the Uzbek Language. In: Nguyen, N.T., et al. Recent Challenges in Intelligent Information and Database Systems. ACIIDS 2024. Communications in Computer and Information Science, vol 2145. Springer, Singapore. https://doi.org/10.1007/978-981-97-5934-7_27
Download citation
DOI: https://doi.org/10.1007/978-981-97-5934-7_27
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5933-0
Online ISBN: 978-981-97-5934-7
eBook Packages: Computer ScienceComputer Science (R0)