skip to main content
10.1145/3290420.3290459acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccipConference Proceedingsconference-collections
research-article

Efficient and density adaptive edge weight model for measuring semantic similarity

Published: 02 November 2018 Publication History

Abstract

The measurement of semantic similarity between concepts is an important research topic in natural language processing. However, previous efforts suffered from the mismatch of the accuracy and efficiency. In this paper, we propose an edge weight model for improving the accuracy of edge-based measures that have an inherent high efficiency. It combines the edge counting model with the information theory and deduces a function of edge weight based on the number of direct hyponyms of the subsumer in the edge. This model doesn't require any additional parameter and can adapt the effect of different densities to edges. Extensive experiments on four test datasets for WordNet and SNOMED-CT demonstrate that the proposed edge weight model can significantly improve the accuracy of various edge-based similarity measures and has a wide coverage over different ontologies. Compared with IC-based measures, our model has a remarkable advantage in efficiency and is comparable to it in accuracy.

References

[1]
A. Otegi, X. Arregi, O. Ansa, and E. Agirre (2015). Using knowledge-based relatedness for information retrieval. Knowl. Inf. Syst., 44(3), 689--718.
[2]
Ganggao Zhu, and C. A. Iglesias (2018). Exploiting semantic similarity for named entity disambiguation in knowledge graphs. Expert Syst. Appl., 101, 8--24.
[3]
D. Sánchez (2010). A methodology to learn ontological attributes from the Web. Data Knowl. Eng., 69(60), 573--597.
[4]
J. Atkinson, A. Ferreira, and E. Aravena (2009). Discovering implicit intention-level knowledge from natural-language texts. Know-Based Syst., 22(70), 502--508.
[5]
D. Sánchez, D. Isern, and M. Millan (2011). Content annotation for the semantic web: an automatic web-based approach. Knowl. Inf. Syst., 27(3), 393--418.
[6]
M. A. H. Taieb, M. B. Aouicha, and Y. Bourouis (2015). FM3S: Features-based measure of sentences semantic similarity. In HAIS (2015), 515--529.
[7]
Y. Li, Z. Bandar, and D. McLean (2003). An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans. Knowl. Data. Eng., 15(4), 871 -882.
[8]
M. A. H. Taieb, M. B. Aouicha, and A. B. Hamadou (2014). A new semantic relatedness measurement using WordNet features. Knowledge & Information Systems, 41(2), 467--497.
[9]
A. Tversky (1977). Features of similarity. Psychological Review, 84, 327--352.
[10]
J. B. Gao, B. W. Zhang, and X. H. Chen (2015). A WordNet-based semantic similarity measurement combining edge-counting and information content theory. Engineering Applications of Artificial Intelligence, 39, 80--88.
[11]
X. Zhu, F. Li, H. Chen, and Q. Peng (2018). An efficient path computing model for measuring semantic similarity using edge and density. Knowledge and Information Systems, 55(1), 79--111.
[12]
P. Resnik (1995). Using information content to evaluate semantic similarity in a taxonomy. In IJCAI (1995), 448--453.
[13]
J. J. Jiang, and D. W. Conrath (1997). Semantic similarity based on corpus statistics and lexical taxonomy. Iin ROCLING (1997), 19--33.
[14]
D. Lin (1998). An information-theoretic definition of similarity. In ICML (1998), 296--304.
[15]
M. G. Ahsaee, M. Naghibzadeh, and S. E. Y Naeini (2014). Semantic similarity assessment of words using weighted WordNet. International Journal of Machine Learning & Cybernetics, 5(3), 479--490.
[16]
G. Zhu, C. A. Iglesias (2017). Computing Semantic Similarity of Concepts in Knowledge Graphs. IEEE Transactions on Knowledge & Data Engineering, 99, 1--1.
[17]
R. Rada, H. Mili, and E. Bicknell (1989). Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. Syst., 19(10), 17--30.
[18]
C. Leacock, and M. Chodorow (1998). Combining local context and WordNet similarity for word sense identification. In Fellbaum (1998), 265--283.
[19]
X. Liu, Y. Zhou, and R. Zheng (2007). Measuring semantic similarity in WordNet. In ICMLC (2007), 3431--3435.
[20]
D. Sánchez, and M. Batet (2011). Ontology-based information content computation. Know-Based Syst., 24(20), 297--303.
[21]
N. Seco, T. Veale, and J. Hayes (2004). An intrinsic information content metric for semantic similarity in WordNet. In ECAI (2004), 1089--1090.
[22]
G. A. Miller, and W. G. Charles (1991). Contextual correlates of semantic similarity. Lang. Cognit. Process, 6(1), 1--28.
[23]
H. Rubenstein, and J. B. Goodenough (1965). Contextual correlates of synonymy. Commun. Assoc. Comput. Mach., 8(10), 627--633.
[24]
E. Agirre, E. Alfonseca, K. Hall, and J. Kravalova (2009). A study on similarity and relatedness using distributional and WordNet-based approaches. In NAACL (2009), 19--27.
[25]
T. Pedersen, S. V. Pakhomov, S. Patwardhan, and C. G. Chute (2007). Measures of semantic similarity and relatedness in the biomedical. J. Biomed. Inform., 40(3), 288--299.
[26]
M. B. Aouicha, M. A. H. Taieb, and A. B. Hamadou (2016). Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness. Applied Intelligence, 45(2), 475--511.

Index Terms

  1. Efficient and density adaptive edge weight model for measuring semantic similarity

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCIP '18: Proceedings of the 4th International Conference on Communication and Information Processing
    November 2018
    326 pages
    ISBN:9781450365345
    DOI:10.1145/3290420
    • Conference Chairs:
    • Jalel Ben-Othman,
    • Hui Yu,
    • Program Chairs:
    • Herwig Unger,
    • Masayuki Arai
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 November 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. SNOMED-CT
    2. edge-weight
    3. information theory
    4. semantic similarity
    5. wordnet

    Qualifiers

    • Research-article

    Funding Sources

    • the National Natural Science Foundation of China
    • Graduate Technological Innovation Project of Beijing Institute of Technology

    Conference

    ICCIP 2018

    Acceptance Rates

    Overall Acceptance Rate 61 of 301 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 51
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media