research-article

Efficient and density adaptive edge weight model for measuring semantic similarity

Authors:

Sixing HeAuthors Info & Claims

ICCIP '18: Proceedings of the 4th International Conference on Communication and Information Processing

Pages 127 - 134

https://doi.org/10.1145/3290420.3290459

Published: 02 November 2018 Publication History

Abstract

The measurement of semantic similarity between concepts is an important research topic in natural language processing. However, previous efforts suffered from the mismatch of the accuracy and efficiency. In this paper, we propose an edge weight model for improving the accuracy of edge-based measures that have an inherent high efficiency. It combines the edge counting model with the information theory and deduces a function of edge weight based on the number of direct hyponyms of the subsumer in the edge. This model doesn't require any additional parameter and can adapt the effect of different densities to edges. Extensive experiments on four test datasets for WordNet and SNOMED-CT demonstrate that the proposed edge weight model can significantly improve the accuracy of various edge-based similarity measures and has a wide coverage over different ontologies. Compared with IC-based measures, our model has a remarkable advantage in efficiency and is comparable to it in accuracy.

References

[1]

A. Otegi, X. Arregi, O. Ansa, and E. Agirre (2015). Using knowledge-based relatedness for information retrieval. Knowl. Inf. Syst., 44(3), 689--718.

Digital Library

[2]

Ganggao Zhu, and C. A. Iglesias (2018). Exploiting semantic similarity for named entity disambiguation in knowledge graphs. Expert Syst. Appl., 101, 8--24.

[3]

D. Sánchez (2010). A methodology to learn ontological attributes from the Web. Data Knowl. Eng., 69(60), 573--597.

Digital Library

[4]

J. Atkinson, A. Ferreira, and E. Aravena (2009). Discovering implicit intention-level knowledge from natural-language texts. Know-Based Syst., 22(70), 502--508.

Digital Library

[5]

D. Sánchez, D. Isern, and M. Millan (2011). Content annotation for the semantic web: an automatic web-based approach. Knowl. Inf. Syst., 27(3), 393--418.

Digital Library

[6]

M. A. H. Taieb, M. B. Aouicha, and Y. Bourouis (2015). FM3S: Features-based measure of sentences semantic similarity. In HAIS (2015), 515--529.

[7]

Y. Li, Z. Bandar, and D. McLean (2003). An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans. Knowl. Data. Eng., 15(4), 871 -882.

Digital Library

[8]

M. A. H. Taieb, M. B. Aouicha, and A. B. Hamadou (2014). A new semantic relatedness measurement using WordNet features. Knowledge & Information Systems, 41(2), 467--497.

Digital Library

[9]

A. Tversky (1977). Features of similarity. Psychological Review, 84, 327--352.

[10]

J. B. Gao, B. W. Zhang, and X. H. Chen (2015). A WordNet-based semantic similarity measurement combining edge-counting and information content theory. Engineering Applications of Artificial Intelligence, 39, 80--88.

[11]

X. Zhu, F. Li, H. Chen, and Q. Peng (2018). An efficient path computing model for measuring semantic similarity using edge and density. Knowledge and Information Systems, 55(1), 79--111.

Digital Library

[12]

P. Resnik (1995). Using information content to evaluate semantic similarity in a taxonomy. In IJCAI (1995), 448--453.

Digital Library

[13]

J. J. Jiang, and D. W. Conrath (1997). Semantic similarity based on corpus statistics and lexical taxonomy. Iin ROCLING (1997), 19--33.

[14]

D. Lin (1998). An information-theoretic definition of similarity. In ICML (1998), 296--304.

Digital Library

[15]

M. G. Ahsaee, M. Naghibzadeh, and S. E. Y Naeini (2014). Semantic similarity assessment of words using weighted WordNet. International Journal of Machine Learning & Cybernetics, 5(3), 479--490.

[16]

G. Zhu, C. A. Iglesias (2017). Computing Semantic Similarity of Concepts in Knowledge Graphs. IEEE Transactions on Knowledge & Data Engineering, 99, 1--1.

Digital Library

[17]

R. Rada, H. Mili, and E. Bicknell (1989). Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. Syst., 19(10), 17--30.

[18]

C. Leacock, and M. Chodorow (1998). Combining local context and WordNet similarity for word sense identification. In Fellbaum (1998), 265--283.

[19]

X. Liu, Y. Zhou, and R. Zheng (2007). Measuring semantic similarity in WordNet. In ICMLC (2007), 3431--3435.

[20]

D. Sánchez, and M. Batet (2011). Ontology-based information content computation. Know-Based Syst., 24(20), 297--303.

Digital Library

[21]

N. Seco, T. Veale, and J. Hayes (2004). An intrinsic information content metric for semantic similarity in WordNet. In ECAI (2004), 1089--1090.

Digital Library

[22]

G. A. Miller, and W. G. Charles (1991). Contextual correlates of semantic similarity. Lang. Cognit. Process, 6(1), 1--28.

[23]

H. Rubenstein, and J. B. Goodenough (1965). Contextual correlates of synonymy. Commun. Assoc. Comput. Mach., 8(10), 627--633.

Digital Library

[24]

E. Agirre, E. Alfonseca, K. Hall, and J. Kravalova (2009). A study on similarity and relatedness using distributional and WordNet-based approaches. In NAACL (2009), 19--27.

Digital Library

[25]

T. Pedersen, S. V. Pakhomov, S. Patwardhan, and C. G. Chute (2007). Measures of semantic similarity and relatedness in the biomedical. J. Biomed. Inform., 40(3), 288--299.

Digital Library

[26]

M. B. Aouicha, M. A. H. Taieb, and A. B. Hamadou (2016). Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness. Applied Intelligence, 45(2), 475--511.

Index Terms

Efficient and density adaptive edge weight model for measuring semantic similarity
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

An efficient path computing model for measuring semantic similarity using edge and density

The shortest path between two concepts in a taxonomic ontology is commonly used to represent the semantic distance between concepts in edge-based semantic similarity measures. In the past, edge counting, which is simple and intuitive and has low ...
Measures of semantic similarity and relatedness in the biomedical domain

Measures of semantic similarity between concepts are widely used in Natural Language Processing. In this article, we show how six existing domain-independent measures can be adapted to the biomedical domain. These measures were originally based on ...
Measuring Semantic Similarity Based on WordNet
WISA '09: Proceedings of the 2009 Sixth Web Information Systems and Applications Conference

Semantic similarity between concepts is a fundamental problem and plays an important role in many applications of artificial intelligence, knowledge sharing and Web mining. In this paper, a new measure based on semantic ontology database WordNet is ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICCIP '18: Proceedings of the 4th International Conference on Communication and Information Processing

November 2018

326 pages

ISBN:9781450365345

DOI:10.1145/3290420

Conference Chairs:
Jalel Ben-Othman
University of Paris 13, France
,
Hui Yu
University of Portsmouth, the United Kingdom, UK
,
Program Chairs:
Herwig Unger
University of Hagen, Germany
,
Masayuki Arai
Graduate School of Science and Engineering Teikyo University, Japan

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

the National Natural Science Foundation of China
Graduate Technological Innovation Project of Beijing Institute of Technology

Conference

ICCIP 2018

ICCIP 2018: 2018 the 4th International Conference on Communication and Information Processing

November 2 - 4, 2018

Qingdao, China

Acceptance Rates

Overall Acceptance Rate 61 of 301 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
51
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 25 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten