Skip to main content

Combining Heuristics and Learning for Entity Linking

  • Conference paper
  • First Online:
  • 682 Accesses

Abstract

Entity linking refers to the task of mapping name strings in a text to their corresponding entities in a given knowledge base. It is an essential component in natural language processing applications and a challenging task. This paper proposes a method that combines heuristics and learning for entity linking by (i) learning coherence among co-occurrence entities within the text based on Wikipedia’s link structure and (ii) exploiting some heuristics based on the contexts and coreference relations among name strings. The experiment results on TAC-KBP2011 dataset show that our method achieves performance comparable to the state-of-the-art methods. The results also show that the proposed model is simple because of using a classifier trained on just two popular features in combination with some heuristics, but effective.

The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-3-319-05939-6_37

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Sen, P.: Collective context-aware topic models for entity disambiguation. In: WWW 2012 (2012)

    Google Scholar 

  2. Ji, H., Grishman, R., Dang, H.T.: An overview of the TAC2011 knowledge base population track. In: Proceedings of Text Analysis Conference (TAC 2011) (2011)

    Google Scholar 

  3. Han, X., Sun, L., and Zhao, J.: Collective entity linking in web text: a graph-based method. In: Proceedings of SIGIR 2011, pp. 765–774 (2011)

    Google Scholar 

  4. Zhang, W., Sim, Y.C., Su, J., Tan, C.-L.: Entity linking with effective acronym expansion, instance selection and topic modeling. In: Proceedings of the 20th IJCAI (IJCAI 2011), pp. 1909–1904 (2011)

    Google Scholar 

  5. Ratinov, L., Roth, D., Downey, D., Anderson, M.: Local and global algorithms for disambiguation to Wikipedia. In: Proceedings of ACL-HLT 2011 (2011)

    Google Scholar 

  6. Kataria, S., Kumar, K., Rastogi, R., Sen, P., Sengamedu, S.: Entity disambiguation with hierarchical topic models. In: KDD 2011

    Google Scholar 

  7. Zhang, W., Su, J., Tan, C.-L., Wang, W.: Entity linking leveraging automatically generated annotation. In: Proceedings of COLING 2010 (2010)

    Google Scholar 

  8. Zheng, Z., Li, F., Huang, M., Zhu, X.: Learning to link entities with knowledge base. In: Proceedings of HLT: NAACL 2010 (2010)

    Google Scholar 

  9. Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Proceedings of COLING 2010 (2010)

    Google Scholar 

  10. Milne, D. and Witten, I.H.: Learning to link with Wikipedia. In: Proceedings of the 17th ACM CIKM (CIKM 2008), pp. 509–518 (2008)

    Google Scholar 

  11. Medelyan, O., Witten, I.H., Milne, D.: Topic indexing with Wikipedia. In: Proceedings of Wikipedia and AI Workshop at the AAAI-2008 Conference (2008)

    Google Scholar 

  12. Bunescu, R., Paşca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proceedings of the 11th Conference of the EACL (EACL 2006), pp. 9–16 (2006)

    Google Scholar 

  13. Cucerzan, S.: Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of EMNLP-CoNLL Joint Conference (EMNLP-CoNLL 2007), pp. 708–716 (2007)

    Google Scholar 

  14. Kulkarni, S., Singh, A., Ramakrishnan, G., Chakrabarti, S.: collective annotation of Wikipedia entities in web text. In: KDD 2009 (2009)

    Google Scholar 

  15. Mihalcea, R., Csomai, A.: Wikify!: linking documents to encyclopedic knowledge. In: Proceedings of the 16th ACM CIKM, pp. 233–242 (2007)

    Google Scholar 

  16. Bontcheva, K., Dimitrov, M., Maynard, D., Tablan, V., Cunningham, H.: Shallow methods for named entity coreference resolution. In: Proceedings of TALN 2002 Workhop (2002)

    Google Scholar 

  17. Li, Y., Wang, C., Han, F., Han, J., Roth, D., Yan, X.: Mining evidences for named entity disambiguation. In: KDD’2013 (2013)

    Google Scholar 

  18. Zhang, W., Su, J., Chen, B., Wang, W., Toh, Z., Sim, Y., Tan, C. L.: I2r-nus-msra at tac 2011: entity linking. In: Proceedings of Text Analysis Conference (TAC 2011) (2011)

    Google Scholar 

  19. Monahan, S., Lehmann, J., Nyberg, T., Plymale, J., Jung, A.: Cross-lingual cross-document coreference with entity linking. In: Proceedings of Text Analysis Conference (TAC 2011) (2011)

    Google Scholar 

  20. Chang, A.X., Spitkovsky, V.I., Agirre, E., Manning, C.D.: Stanford-UBC entity linking at TAC-KBP, again. In: Proceedings of Text Analysis Conference (TAC 2011) (2011)

    Google Scholar 

  21. Radford, W., Hachey, B., Honnibal, M., Nothman, J., Curran, J.R.: Naıve but effective NIL clustering baselines–CMCRC at TAC 2011. In: Proceedings of Text Analysis Conference (TAC 2011) (2011)

    Google Scholar 

  22. Taylor Cassidy, Z.C., Artiles, J., Ji, H., Deng, H., Ratinov, L.A., Zheng, J., Roth, D.: CUNY-UIUC-SRI TAC-KBP2011 entity linking system description. In: Proceedings Text Analysis Conference (TAC2011) (2011)

    Google Scholar 

  23. Nguyen, H.T., Cao, T.H.: Named entity disambiguation: a hybrid approach. Int. J. Comput. Intell. Syst. 5(6), 1052–1067 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hien T. Nguyen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering

About this paper

Cite this paper

Nguyen, H.T. (2014). Combining Heuristics and Learning for Entity Linking. In: Vinh, P., Alagar, V., Vassev, E., Khare, A. (eds) Context-Aware Systems and Applications. ICCASA 2013. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 128. Springer, Cham. https://doi.org/10.1007/978-3-319-05939-6_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05939-6_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05938-9

  • Online ISBN: 978-3-319-05939-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics