Development and Realization of Bigram Models for Recognizing Homonyms in the Uzbek Language

Abjalova, Manzura; Tukeyev, Ualsher; Abduraxmanova, Mukaddas; Adilova, Munojot

doi:10.1007/978-981-97-5934-7_27

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2145))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

97 Accesses

Abstract

Homonyms are words that have the same form but a different meaning. These types of words have always been considered to be of interest for linguistic developments in the course of general linguistics. In this regard, homonyms found in texts are separately studied in Russian and European linguistics in connection with the linguistic corpus. This article analyzes methods and models in linguistic programs and systems based on linguistic software in world linguistics, such as Brill’s method, hidden Markov model, modification of models and their connection with the linguistic corpus, and provides insights into the importance of the national corpus in linguistic processing. In this article, several methods for identifying homonyms in Uzbek language texts were deliberated, and the N-gram method was identified as one of the most reliable methods for the Uzbek language. For this, in work are made 75 models for bigram and trigram ways of connecting words in the Uzbek language.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Geoparsing Recognition and Extraction from Amazigh Corpus Using the NooJ Complex Annotation Structures

Expert Assessment of Synonymic Rows in RuWordNet

An Attempt for Wordnet Construction for Odia Language

References

Abjalova, M.: Linguistic modules of the program of editing and analyzing texts in the Uzbek language (for the program of editing texts in official and scientific style). Dissertation, Doctor of Philosophy in Philology (2020)
Google Scholar
Abjalova, M.A., Yuldashev, A.: Methods for determining homonyms in homonymy and linguistic systems. ACADEMICIA: Int. Multidisc. Res. J. 11(2), 1370–1375 (2021). Impact Factor: SJIF 2021 = 7.492 (https://saarj.com). ISSN 2249-7137. https://doi.org/10.5958/2249-7137.2021.00522.X
Abjalova, M.A.: Linguistic modules of editing and analysis programs. Monograph. – Tashkent, Nodirabegim, p. 176 (2020)
Google Scholar
Abjalova, M., Iskandarov O.: Methods of tagging part of speech of Uzbek language. In: IEEE – UBMK – 2021: 6th International Conference on Computer Science and Engineering. – Ankara – Turkey, 15–17 September 2021, pp. 82–85 (2021)
Google Scholar
Abjalova, M., Gulomova, N.: Author’s corpus of Alisher Navoi and its semantic database. In: IEEE – UBMK – 2022: 7th International Conference on Computer Science and Engineering, 24–26 September 2022. – Diyarbakir, Turkey, pp. 182–187 (2022). Impakt Factor 5.5. https://doi.org/10.1109/UBMK55850.2022.9919546
Abjalova, M, Adalı, E., Iskandarov O.: Educational corpus of the Uzbek language and its opportunities. In: 2023 8th International Conference on Computer Science and Engineering (UBMK), Burdur, Turkiye, pp. 590–594 (2023). https://doi.org/10.1109/UBMK59864.2023.10286682
Baum, L.E., Sell, G.R.: Growth transformations for functions on manifolds. Pac. J. Math. 27(2), 211–227 (1968). https://en.wikipedia.org/wiki/Hidden_Markov_model
Brill, E.: A simple rule-based part of speech tagger. In: ANLC ‘92: Proceedings of the Third Conference on Applied Natural Language Processing, pp. 152–155 (1992). https://doi.org/10.3115/974499.974526
Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21, 543–565 (1995). http://acl.ldc.upenn.edu/J/J95/J95-4004.pdf
Sharipov, M., Yuldashov, O.: UzbekStemmer: development of a rule-based stemming algorithm for Uzbek language. In: CEUR Workshop Proceedings, vol. 3315, pp. 137–144 (2022)
Google Scholar
Rakhmatullayev, S.: Explanatory dictionary of Uzbek homonyms. Tashkent: Teacher, 214 p. (1984)
Google Scholar
Rizayev, S.: Fundamentals of linguostatistics in Uzbek linguistics. Tashkent: Science, 18 (2006)
Google Scholar
http://tech.yandex.ru/mystem. Accessed 28 Nov 2023
https://www.freecodecamp.org/news/an-introduction-to-part-of-speech-tagging-and-the-hidden-markov-model-953d45338f24. Accessed 28 Nov 2023
https://habr.com/ru/post/125988. Accessed 28 Nov 2023
http://samag.ru/archive/article/3059. Accessed 28 Nov 2023
Tukeyev, U.A.: Computational Morphology of Turkic Languages: Textbook, 161 p. Qazaq University, Almaty (2023)
Google Scholar

Download references

Author information

Authors and Affiliations

Tashkent State University of Uzbek Language and Literature, Tashkent, Uzbekistan
Manzura Abjalova, Mukaddas Abduraxmanova & Munojot Adilova
Al-Farabi Kazakh National University, Almaty, Kazakhstan
Ualsher Tukeyev

Authors

Manzura Abjalova
View author publications
You can also search for this author in PubMed Google Scholar
Ualsher Tukeyev
View author publications
You can also search for this author in PubMed Google Scholar
Mukaddas Abduraxmanova
View author publications
You can also search for this author in PubMed Google Scholar
Munojot Adilova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manzura Abjalova .

Editor information

Editors and Affiliations

Wroclaw University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
University of Pau and Adour Countries, Pau, France
Richard Chbeir
Open University of Cyprus, Latsia, Cyprus
Yannis Manolopoulos
Iwate Prefectural University, Takizawa, Japan
Hamido Fujita
National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong
Japan Advanced Institute of Science and Technology, Nomi, Japan
Le Minh Nguyen
Wrocław University of Science and Technology, Wrocław, Poland
Krystian Wojtkiewicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abjalova, M., Tukeyev, U., Abduraxmanova, M., Adilova, M. (2024). Development and Realization of Bigram Models for Recognizing Homonyms in the Uzbek Language. In: Nguyen, N.T., et al. Recent Challenges in Intelligent Information and Database Systems. ACIIDS 2024. Communications in Computer and Information Science, vol 2145. Springer, Singapore. https://doi.org/10.1007/978-981-97-5934-7_27

Download citation

DOI: https://doi.org/10.1007/978-981-97-5934-7_27
Published: 13 August 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5933-0
Online ISBN: 978-981-97-5934-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Development and Realization of Bigram Models for Recognizing Homonyms in the Uzbek Language

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Geoparsing Recognition and Extraction from Amazigh Corpus Using the NooJ Complex Annotation Structures

Expert Assessment of Synonymic Rows in RuWordNet

An Attempt for Wordnet Construction for Odia Language

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Development and Realization of Bigram Models for Recognizing Homonyms in the Uzbek Language

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Geoparsing Recognition and Extraction from Amazigh Corpus Using the NooJ Complex Annotation Structures

Expert Assessment of Synonymic Rows in RuWordNet

An Attempt for Wordnet Construction for Odia Language

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation