Abstract
The paper describes a supervised approach for the detection of the most frequent sense on the basis of RuThes thesaurus, which is a large linguistic ontology for Russian. Due to the large number of monosemous multiword expressions and the set of RuThes relations it is possible to calculate several context features for ambiguous words and to study their contribution in a supervised model for detecting frequent senses.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agirre, E., MÃ rquez, L., Wicentowski, R. (eds.): Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval). Association for Computational Linguistics, Prague (2007)
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. (CSUR), 41(2), 10, 1–69 (2009)
Landes, S., Leacock, C., Tengi, R.: Building semantic concordances. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database. The MIT Press, Cambridge (Mass) (1998)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Petrolito, T., Bond, F.: A survey of wordnet annotated corpora. In: Proceedings of Global WordNet Conference, GWC-2014, pp. 236–245 (2014)
Mitra, S., Mitra, R., Riedl, M., Biemann, C., Mukherjee, A., Goyal, P.: That’s sick dude!: automatic identification of word sense change across different timescales. In: Proceedings of ACL-2014 (2014)
Mohammad, S., Hirst, G.: Determining word sense dominance using a thesaurus. In: Proceedings of EACL-2006, pp. 121–128 (2006)
McCarthy, D., Koeling, R., Weeds, J., Carroll, J.: Finding predominant word senses in untagged text. In: Proceedings of ACL-2004 (2004)
McCarthy, D., Koeling, R., Weeds, J., Carroll, J.: Unsupervised acquisition of predominant word senses. Comput. Linguist. 33(4), 553–590 (2007)
Koeling, R., McCarthy, D., Carroll, J.: Domain-specific sense distributions and predominant sense acquisition. In: Proceedings EMNLP-2005, Vancouver, pp. 419–426 (2005)
Loukachevitch, N., Dobrov, B.: RuThes linguistic ontology vs. Russian wordnets. In: Proceedings of Global WordNet Conference GWC-2014 (2014)
Snyder, B., Palmer, M.: The english all-words task. In: Mihalcea, R., Chklowski, T. (eds.) Proceedings of SENSEVAL-3: Third International Workshop on Evaluating Word Sense Disambiguating Systems, pp. 41–43 (2004)
Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of the 17th International Conference on Computational linguistics COLING-1998, pp. 768–774 (1998)
Lau, J.H., Cook, P., McCarthy, D., Gella, S., Baldwin, T.: Learning word sense distributions, detecting unattested senses and identifying novel senses using topic models. In: Proceedings of ACL-2014, pp. 259–270 (2014)
Agirre, E., Lacalle, O.L.: Publicly available topic signatures for all wordnet nominal senses. In: Proceedings of LREC-2004 (2004)
Leacock, C., Miller, G., Chodorow, M.: Using corpus statistics and wordnet relations for sense identification. Comput. Linguist. 24(1), 147–165 (1998)
Mihalcea, R.: Bootstrapping large sense tagged corpora. In: Proceedings of LREC-2002 (2002)
Azarowa, I.: RussNet as a computer Lexicon for Russian. In: Proceedings of the Intelligent Information systems IIS-2008, pp. 341–350 (2008)
Balkova, V., Suhonogov, A., Yablonsky, S.: Some issues in the construction of a Russian wordnet grid. In: Proceedings of the Forth International WordNet Conference, Szeged, pp. 44–55 (2008)
Braslavski, P., Ustalov, D., Mukhin, M.: A spinning wheel for YARN: user interface for a crowdsourced thesaurus. In: Proceedings of EACL-2014, Sweden (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Loukachevitch, N., Chetviorkin, I. (2015). Supervised Approach to Finding Most Frequent Senses in Russian. In: Khachay, M., Konstantinova, N., Panchenko, A., Ignatov, D., Labunets, V. (eds) Analysis of Images, Social Networks and Texts. AIST 2015. Communications in Computer and Information Science, vol 542. Springer, Cham. https://doi.org/10.1007/978-3-319-26123-2_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-26123-2_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26122-5
Online ISBN: 978-3-319-26123-2
eBook Packages: Computer ScienceComputer Science (R0)