Skip to main content
Log in

Measuring Entity Relatedness via Entity and Text Joint Embedding

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

As unique identifiers of objects and basic components of knowledge graphs, entities are crucial to many natural language processing related works, such as entity linking and question answering, in which the estimation of entity relatedness is required. Current entity relatedness measures either consider entities as words, which neglects the rich semantics entities contain, or are integrated into extrinsic applications, which fail to evaluate the intrinsic effectiveness. In this work, we propose E5, an effective entity relatedness measure taking into account of entity description text in a neural embedding manner. We first jointly map words and entities to the same high-dimensional vector space, the output of which is utilized as the input for the following joint entity and text embedding training. The well-trained entity and text embedding network is then leveraged to measure similarity between entities and entity descriptions, which in combination with a graph structure based method, constitute the eventual entity relatedness measure. The experimental results validate the usefulness of E5.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. The Wikipedia dump could be downloaded from https://dumps.wikimedia.org/enwiki/20180320/.

  2. https://radimrehurek.com/gensim/.

References

  1. Usbeck R, Röder M, Ngomo AN, Baron C, Both A, Brümmer M, Ceccarelli D, Cornolti M, Cherix D, Eickmann B, Ferragina P, Lemke C, Moro A, Navigli R, Piccinno F, Rizzo G, Sack H, Speck R, Troncy R, Waitelonis J, Wesemann L (2015) GERBIL: general entity annotator benchmarking framework. In: Proceedings of the 24th international conference on world wide web, WWW 2015, Florence, Italy, pp 1133–1143. https://doi.org/10.1145/2736277.2741626

  2. Zeng W, Zhao X, Tang J, Shang H (2018) Collective list-only entity linking: a graph-based approach. IEEE Access 6:16035–16045

    Article  Google Scholar 

  3. Xiong C, Power R, Callan J (2017) Explicit semantic ranking for academic search via knowledge graph embedding. In: Proceedings of the 26th international conference on world wide web, WWW 2017, Perth, Australia, April 3–7, 2017, pp 1271–1279. https://doi.org/10.1145/3038912.3052558

  4. Yamada I, Shindo H, Takeda H, Takefuji Y (2016) Joint learning of the embedding of words and entities for named entity disambiguation. In: Proceedings of the 20th SIGNLL conference on computational natural language learning, CoNLL 2016, Berlin, Germany, August 11–12, 2016, pp 250–259. http://aclweb.org/anthology/K/K16/K16-1025.pdf

  5. Huang H, Heck LP, Ji H (2015) Leveraging deep neural networks and knowledge graphs for entity disambiguation. CoRR abs/1504.07678. http://arxiv.org/abs/1504.07678

  6. Hoffart J, Seufert S, Nguyen DB, Theobald M, Weikum G (2012) KORE: keyphrase overlap relatedness for entity disambiguation. In: 21st ACM international conference on information and knowledge management, CIKM’12, Maui, HI, USA, October 29–November 02, 2012, pp 545–554. https://doi.org/10.1145/2396761.2396832

  7. Ponza M, Ferragina P, Chakrabarti S (2017) A two-stage framework for computing entity relatedness in wikipedia. In: Proceedings of the 2017 ACM on conference on information and knowledge management, CIKM 2017, Singapore, November 06–10, 2017, pp 1867–1876. https://doi.org/10.1145/3132847.3132890

  8. Witten IH, Milne D (2008) An effective, low-cost measure of semantic relatedness obtained from wikipedia links. In: Proceedings of AAAI

  9. Strube M, Ponzetto SP (2006) WikiRelate! computing semantic relatedness using wikipedia. In: Proceedings, the twenty-first national conference on artificial intelligence and the eighteenth innovative applications of artificial intelligence conference, July 16–20, 2006, Boston, Massachusetts, USA, pp 1419–1424. http://www.aaai.org/Library/AAAI/2006/aaai06-223.php

  10. Aggarwal N, Buitelaar P (2014) Wikipedia-based distributional semantics for entity relatedness. In: AAAI fall symposium

  11. Zhao Y, Liu Z, Sun M (2015) Representation learning for measuring entity relatedness with rich information. In: Proceedings of the twenty-fourth international joint conference on artificial intelligence, IJCAI 2015, Buenos Aires, Argentina, July 25–31, 2015, pp 1412–1418. http://ijcai.org/Abstract/15/203

  12. Ceccarelli D, Lucchese C, Orlando S, Perego R, Trani S (2013) Learning relatedness measures for entity linking. In: 22nd ACM international conference on information and knowledge management, CIKM’13, San Francisco, CA, USA, October 27–November 1, 2013, pp 139–148. https://doi.org/10.1145/2505515.2505711

  13. Akhlaghi MI, Sukhov SV (2017) Knowledge fusion in feedforward artificial neural networks. Neural Process Lett 48:257–272. https://doi.org/10.1007/s11063-017-9712-5

    Article  Google Scholar 

  14. Chandra R, Gupta A, Ong YS, Goh CK (2017) Evolutionary multi-task learning for modular knowledge representation in neural networks. Neural Process Lett 47:907–919. https://doi.org/10.1007/s11063-017-9718-z

    Article  Google Scholar 

  15. Wu C, Shi X, Su J, Chen Y, Huang Y (2017) Co-training for implicit discourse relation recognition based on manual and distributed features. Neural Process Lett 46(1):233. https://doi.org/10.1007/s11063-017-9582-x

    Article  Google Scholar 

  16. Zhao S, King I, Lyu MR (2017) Aggregated temporal tensor factorization model for point-of-interest recommendation. Neural Process Lett 47:975–992. https://doi.org/10.1007/s11063-017-9681-8

    Article  Google Scholar 

  17. Jung C, Shen Y, Jiao L (2015) Learning to rank with ensemble ranking SVM. Neural Process Lett 42(3):703. https://doi.org/10.1007/s11063-014-9382-5

    Article  Google Scholar 

  18. Zhao X, Xiao C, Lin X, Zhang W, Wang Y (2018) Efficient structure similarity searches: a partition-based approach. VLDB J 27(1):53. https://doi.org/10.1007/s00778-017-0487-0

    Article  Google Scholar 

  19. Liu X, Zhu X, Li M, Wang L, Tang C, Yin J, Shen D, Wang H, Gao W (2018) Late fusion incomplete multi-view clustering. In: IEEE transactions on pattern analysis and machine intelligence

  20. Gabrilovich E, Markovitch S (2007) Computing semantic relatedness using wikipedia-based explicit semantic analysis. In: IJCAI 2007, Proceedings of the 20th international joint conference on artificial intelligence, Hyderabad, India, January 6–12, 2007, pp 1606–1611. http://ijcai.org/Proceedings/07/Papers/259.pdf

  21. Hoffman MD, Blei DM, Bach FR (2010) Online learning for latent Dirichlet allocation. In: Advances in neural information processing systems 23: 24th annual conference on neural information processing systems 2010. Proceedings of a meeting held 6–9 December 2010, Vancouver, British Columbia, Canada, pp 856–864. http://papers.nips.cc/paper/3902-online-learning-for-latent-dirichlet-allocation

  22. Haveliwala TH (2002) Topic-sensitive PageRank. In: Proceedings of the eleventh international world wide web conference, WWW 2002, May 7–11, 2002, Honolulu, Hawaii, pp 517–526. https://doi.org/10.1145/511446.511513

  23. Rothe S, Schütze H (2014) CoSimRank: a flexible & efficient graph-theoretic similarity measure. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, ACL 2014, June 22–27, 2014, Baltimore, MD, USA, volume 1: long papers, pp 1392–1402. http://aclweb.org/anthology/P/P14/P14-1131.pdf

  24. Perozzi B, Al-Rfou R, Skiena S (2014) DeepWalk: online learning of social representations. In: The 20th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’14, New York, NY–August 24–27, 2014, pp 701–710. https://doi.org/10.1145/2623330.2623732

  25. Zwicklbauer S, Seifert C, Granitzer M (2016) Robust and collective entity disambiguation through semantic embeddings. In: Proceedings of the 39th international ACM SIGIR conference on research and development in information retrieval, SIGIR 2016, Pisa, Italy, July 17–21, 2016, pp 425–434. https://doi.org/10.1145/2911451.2911535

  26. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems 26: 27th annual conference on neural information processing systems 2013. Proceedings of a meeting held December 5–8, 2013, Lake Tahoe, Nevada, United States, pp 3111–3119

  27. Le QV, Mikolov T (2014) Distributed representations of sentences and documents. In: Proceedings of the 31th international conference on machine learning, ICML 2014, Beijing, China, 21–26 June 2014, pp 1188–1196. http://jmlr.org/proceedings/papers/v32/le14.html

  28. Yamada I, Shindo H, Takeda H, Takefuji Y (2017) Learning distributed representations of texts and entities from knowledge base. TACL 5:397. https://transacl.org/ojs/index.php/tacl/article/view/1065

  29. Brümmer M, Dojchinovski M, Hellmann S (2016) DBpedia abstracts: a large-scale, open, multilingual NLP, training corpus. In: Proceedings of the tenth international conference on language resources and evaluation LREC 2016, Portorož, Slovenia, May 23–28, 2016. http://www.lrec-conf.org/proceedings/lrec2016/summaries/895.html

  30. Joachims T (2002) Optimizing search engines using clickthrough data. In: Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining, July 23–26, 2002, Edmonton, Alberta, Canada, pp 133–142

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their insightful and constructive comments, which greatly contributed to improving the quality of the paper. This work was partially supported by NSFC under Grants Nos. 61872446 and 71690233.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiang Zhao.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zeng, W., Tang, J. & Zhao, X. Measuring Entity Relatedness via Entity and Text Joint Embedding. Neural Process Lett 50, 1861–1875 (2019). https://doi.org/10.1007/s11063-018-9966-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-018-9966-6

Keywords

Navigation