Supervised Approach to Finding Most Frequent Senses in Russian

Loukachevitch, Natalia; Chetviorkin, Ilia

doi:10.1007/978-3-319-26123-2_33

Natalia Loukachevitch¹⁵ &
Ilia Chetviorkin¹⁶

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 542))

Included in the following conference series:

International Conference on Analysis of Images, Social Networks and Texts

988 Accesses

Abstract

The paper describes a supervised approach for the detection of the most frequent sense on the basis of RuThes thesaurus, which is a large linguistic ontology for Russian. Due to the large number of monosemous multiword expressions and the set of RuThes relations it is possible to calculate several context features for ambiguous words and to study their contribution in a supervised model for detecting frequent senses.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agirre, E., Màrquez, L., Wicentowski, R. (eds.): Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval). Association for Computational Linguistics, Prague (2007)
Google Scholar
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. (CSUR), 41(2), 10, 1–69 (2009)
Google Scholar
Landes, S., Leacock, C., Tengi, R.: Building semantic concordances. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database. The MIT Press, Cambridge (Mass) (1998)
Google Scholar
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
MATH Google Scholar
Petrolito, T., Bond, F.: A survey of wordnet annotated corpora. In: Proceedings of Global WordNet Conference, GWC-2014, pp. 236–245 (2014)
Google Scholar
Mitra, S., Mitra, R., Riedl, M., Biemann, C., Mukherjee, A., Goyal, P.: That’s sick dude!: automatic identification of word sense change across different timescales. In: Proceedings of ACL-2014 (2014)
Google Scholar
Mohammad, S., Hirst, G.: Determining word sense dominance using a thesaurus. In: Proceedings of EACL-2006, pp. 121–128 (2006)
Google Scholar
McCarthy, D., Koeling, R., Weeds, J., Carroll, J.: Finding predominant word senses in untagged text. In: Proceedings of ACL-2004 (2004)
Google Scholar
McCarthy, D., Koeling, R., Weeds, J., Carroll, J.: Unsupervised acquisition of predominant word senses. Comput. Linguist. 33(4), 553–590 (2007)
Article Google Scholar
Koeling, R., McCarthy, D., Carroll, J.: Domain-specific sense distributions and predominant sense acquisition. In: Proceedings EMNLP-2005, Vancouver, pp. 419–426 (2005)
Google Scholar
Loukachevitch, N., Dobrov, B.: RuThes linguistic ontology vs. Russian wordnets. In: Proceedings of Global WordNet Conference GWC-2014 (2014)
Google Scholar
Snyder, B., Palmer, M.: The english all-words task. In: Mihalcea, R., Chklowski, T. (eds.) Proceedings of SENSEVAL-3: Third International Workshop on Evaluating Word Sense Disambiguating Systems, pp. 41–43 (2004)
Google Scholar
Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of the 17th International Conference on Computational linguistics COLING-1998, pp. 768–774 (1998)
Google Scholar
Lau, J.H., Cook, P., McCarthy, D., Gella, S., Baldwin, T.: Learning word sense distributions, detecting unattested senses and identifying novel senses using topic models. In: Proceedings of ACL-2014, pp. 259–270 (2014)
Google Scholar
Agirre, E., Lacalle, O.L.: Publicly available topic signatures for all wordnet nominal senses. In: Proceedings of LREC-2004 (2004)
Google Scholar
Leacock, C., Miller, G., Chodorow, M.: Using corpus statistics and wordnet relations for sense identification. Comput. Linguist. 24(1), 147–165 (1998)
Google Scholar
Mihalcea, R.: Bootstrapping large sense tagged corpora. In: Proceedings of LREC-2002 (2002)
Google Scholar
Azarowa, I.: RussNet as a computer Lexicon for Russian. In: Proceedings of the Intelligent Information systems IIS-2008, pp. 341–350 (2008)
Google Scholar
Balkova, V., Suhonogov, A., Yablonsky, S.: Some issues in the construction of a Russian wordnet grid. In: Proceedings of the Forth International WordNet Conference, Szeged, pp. 44–55 (2008)
Google Scholar
Braslavski, P., Ustalov, D., Mukhin, M.: A spinning wheel for YARN: user interface for a crowdsourced thesaurus. In: Proceedings of EACL-2014, Sweden (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Research Computing Center of Lomonosov Moscow State University, Moscow, Russia
Natalia Loukachevitch
Lomonosov Moscow State University, Moscow, Russia
Ilia Chetviorkin

Authors

Natalia Loukachevitch
View author publications
You can also search for this author in PubMed Google Scholar
Ilia Chetviorkin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Natalia Loukachevitch .

Editor information

Editors and Affiliations

Krasovsky Institute of Mathematics and Mechanics, Yekaterinburg, Russia
Mikhail Yu. Khachay
Wolverhampton, United Kingdom
Natalia Konstantinova
Technische Universität Darmstadt, Darmstadt, Germany
Alexander Panchenko
National Research University Higher School of Economics, Moscow, Russia
Dmitry Ignatov
Ural Federal University, Yekaterinbug, Russia
Valeri G. Labunets

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Loukachevitch, N., Chetviorkin, I. (2015). Supervised Approach to Finding Most Frequent Senses in Russian. In: Khachay, M., Konstantinova, N., Panchenko, A., Ignatov, D., Labunets, V. (eds) Analysis of Images, Social Networks and Texts. AIST 2015. Communications in Computer and Information Science, vol 542. Springer, Cham. https://doi.org/10.1007/978-3-319-26123-2_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-26123-2_33
Published: 05 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26122-5
Online ISBN: 978-3-319-26123-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics