Skip to main content

ICRC-DSEDL: A Film Named Entity Discovery and Linking System Based on Knowledge Bases

  • Conference paper
  • First Online:
Book cover Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data (CCKS 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 650))

Included in the following conference series:

  • 1550 Accesses

Abstract

Named entity discovery and linking are hot topics in text mining, which is very important for text understanding as named entities that usually presented in various formats and some of them are ambiguous. To accelerate the development of related technology, the China Conference on Knowledge Graph and Semantic Computing (CCKS) in 2016 launches a competition, which includes a task on film named entity discovery and linking (i.e., task 1). We participate this competition and develop a system for task 1 of the CCKS competition. The system consists of two individual parts for named entity discovery (NED) and entity linking (EL) respectively. The first part is a hybrid subsystem based on conditional random field (CRF) and structural support vector machine (SSVM) with rich features, and the second part is a ranking subsystem where not only the given knowledge base but also open knowledge bases are used for candidate generation and SVMrank is used for candidate ranking. On the official test dataset of Task1 of CCKS 2016 competition, our system achieves an F1-score of 77.83% on NED, an accuracy of 86.53% on EL and an overall F1-score of 67.35%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Santos, C.N., Milidiú, R.L.: Named entity recognition. Entropy Guid. Transform. Learn. Algorithms Appl. 51–58 (2012)

    Google Scholar 

  2. Nanyun, P., Dredze, M.: Named entity recognition for chinese social media with jointly trained embeddings. In: Proceedings of EMNLP (2015)

    Google Scholar 

  3. Li, H.: Learning to rank for information retrieval and natural language processing. Synth. Lect. Hum. Lang. Technol. 7(3), 1–121 (2014)

    Article  Google Scholar 

  4. Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: COLING, vol. 96, pp. 466–471 (1996)

    Google Scholar 

  5. Chinchor, N., Marsh, E.: Muc-7 information extraction task definition. In: Proceeding of the Seventh Message Understanding Conference (MUC-7), Appendices, pp. 359–367 (1998)

    Google Scholar 

  6. Shen, W., Wang, J., Han, J.: Entity linking with a knowledge base: issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27(2), 443–460 (2015)

    Article  Google Scholar 

  7. Yuan, J., Yang, Y., Jia, Z., Yin, H., Huang, J., Zhu, J.: Entity recognition and linking in Chinese search queries. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 507–519. Springer, Heidelberg (2015). doi:10.1007/978-3-319-25207-0_47

    Chapter  Google Scholar 

  8. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. Dep. Pap. CIS, June 2001

    Google Scholar 

  9. Altun, Y., Tsochantaridis, I., Hofmann, T.: Hidden Markov support vector machines. In: ICML 2003, vol. 3, pp. 3–10 (2003)

    Google Scholar 

Download references

Acknowledgments

This paper is supported in part by grants: National 863 Program of China (2015AA015405), NSFCs (National Natural Science Foundation of China) (61402128, 61473101, 61173075 and 61272383) and Strategic Emerging Industry Development Special Funds of Shenzhen (JCYJ20140508161040764, JCYJ20140417172417105 and JCYJ20140627163809422)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Buzhou Tang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Zhao, Y. et al. (2016). ICRC-DSEDL: A Film Named Entity Discovery and Linking System Based on Knowledge Bases. In: Chen, H., Ji, H., Sun, L., Wang, H., Qian, T., Ruan, T. (eds) Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data. CCKS 2016. Communications in Computer and Information Science, vol 650. Springer, Singapore. https://doi.org/10.1007/978-981-10-3168-7_20

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3168-7_20

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3167-0

  • Online ISBN: 978-981-10-3168-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics