Probabilistic Associations as a Proxy for Semantic Relatedness

Jabeen, Shahida; Gao, Xiaoying; Andreae, Peter

doi:10.1007/978-3-319-11749-2_38

Shahida Jabeen¹⁹,
Xiaoying Gao¹⁹ &
Peter Andreae¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8786))

Included in the following conference series:

International Conference on Web Information Systems Engineering

1584 Accesses

Abstract

Semantic relatedness computation is a well known problem with multidisciplinary applications. Existing approaches to computing semantic relatedness ignore the asymmetric associations of words. In the absence of an explicit topical context, these asymmetric associations can be effectively used to represent the relation of words in directional contexts. Motivated by the idea of word associations, this paper presents a new approach to computing semantic relatedness using asymmetric association based probabilities of words extracted from the directional contexts of words based on the Wikipedia corpus. The performance evaluation of the proposed approach on a variety of publicly available benchmark datasets shows that the asymmetric association based measures outperformed not only the baseline symmetric measures but also most of the state-of-art approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A survey of semantic relatedness evaluation datasets and procedures

Article 23 December 2019

Measuring Semantic Relatedness with Knowledge Association Network

Measuring similarity and relatedness using multiple semantic relations in WordNet

Article 01 August 2019

References

Sahlgren, M.: Vector-based semantic analysis: Representing word meanings based on random labels. In: Proceedings of ESSLI Workshop on Semantic Knowledge Acquistion and Categorization. Kluwer Academic Publishers (2001)
Google Scholar
Landauer, T.K., Dumais, S.T.: A solution to plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 211–240 (1997)
Google Scholar
Islam, A., Inkpen, D.: Second order co-occurrence pmi for determining the semantic similarity of words. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2006), pp. 1033–1038 (2006)
Google Scholar
Liu, H., Bao, H., Xu, D.: Concept vector for semantic similarity and relatedness based on wordnet structure. Journal of Systems and Softwares 85, 370–381 (2012)
Article Google Scholar
Jabeen, S., Gao, X., Andreae, P.: Directional Context Helps: Guiding Semantic Relatedness Computation by Asymmetric Word Associations. In: Lin, X., Manolopoulos, Y., Srivastava, D., Huang, G. (eds.) WISE 2013, Part I. LNCS, vol. 8180, pp. 92–101. Springer, Heidelberg (2013)
Chapter Google Scholar
Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16, 22–29 (1990)
Google Scholar
Turney, P.D.: Mining the web for synonyms: Pmi-ir versus lsa on toefl. In: Proceedings of the 12th European Conference on Machine Learning, EMCL 2001, pp. 491–502 (2001)
Google Scholar
Higgins, D.: Which statistics reflect semantics? rethinking synonymy and word similarity. In: Proceedings of International Conference on Linguistic Evidence, pp. 265–284 (2004)
Google Scholar
Lin, D.: An information-theoretic definition of similarity. In: Proceedings of 15th International Conference on Machine Learning (ICML1998), pp. 296–304 (1998)
Google Scholar
Bollegala, D., Matsuo, Y., Ishizuka, M.: A web search engine-based approach to measure semantic similarity between words. IEEE Trans. on Knowl. and Data Eng. 23(7), 977–990 (2011)
Article Google Scholar
Cilibrasi, R.L., Vitanyi, P.M.B.: The google similarity distance. IEEE Trans. on Knowl. and Data Eng. 19(3), 370–383 (2007)
Article Google Scholar
Gracia, J.L., Mena, E.: Web-based measure of semantic relatedness. In: Bailey, J., Maier, D., Schewe, K.-D., Thalheim, B., Wang, X.S. (eds.) WISE 2008. LNCS, vol. 5175, pp. 136–150. Springer, Heidelberg (2008)
Chapter Google Scholar
Halawi, G., Dror, G., Gabrilovich, E., Koren, Y.: Large-scale learning of word relatedness with constraints. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012, pp. 1406–1414 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering and Computer Science, Victoria University of Wellington, P.O. Box 600, Wellington, New Zealand
Shahida Jabeen, Xiaoying Gao & Peter Andreae

Authors

Shahida Jabeen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoying Gao
View author publications
You can also search for this author in PubMed Google Scholar
Peter Andreae
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of New South Wales, Sydney, Australia
Boualem Benatallah
Boston University, Boston, MA, USA
Azer Bestavros
Aristotle University of Thessaloniki, Thessaloniki, Greece
Yannis Manolopoulos & Athena Vakali &
Victoria University, Footscray, VIC, Australia
Yanchun Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jabeen, S., Gao, X., Andreae, P. (2014). Probabilistic Associations as a Proxy for Semantic Relatedness. In: Benatallah, B., Bestavros, A., Manolopoulos, Y., Vakali, A., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2014. WISE 2014. Lecture Notes in Computer Science, vol 8786. Springer, Cham. https://doi.org/10.1007/978-3-319-11749-2_38

Download citation

DOI: https://doi.org/10.1007/978-3-319-11749-2_38
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11748-5
Online ISBN: 978-3-319-11749-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics