Abstract
Named entity discovery and linking are hot topics in text mining, which is very important for text understanding as named entities that usually presented in various formats and some of them are ambiguous. To accelerate the development of related technology, the China Conference on Knowledge Graph and Semantic Computing (CCKS) in 2016 launches a competition, which includes a task on film named entity discovery and linking (i.e., task 1). We participate this competition and develop a system for task 1 of the CCKS competition. The system consists of two individual parts for named entity discovery (NED) and entity linking (EL) respectively. The first part is a hybrid subsystem based on conditional random field (CRF) and structural support vector machine (SSVM) with rich features, and the second part is a ranking subsystem where not only the given knowledge base but also open knowledge bases are used for candidate generation and SVMrank is used for candidate ranking. On the official test dataset of Task1 of CCKS 2016 competition, our system achieves an F1-score of 77.83% on NED, an accuracy of 86.53% on EL and an overall F1-score of 67.35%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Santos, C.N., Milidiú, R.L.: Named entity recognition. Entropy Guid. Transform. Learn. Algorithms Appl. 51–58 (2012)
Nanyun, P., Dredze, M.: Named entity recognition for chinese social media with jointly trained embeddings. In: Proceedings of EMNLP (2015)
Li, H.: Learning to rank for information retrieval and natural language processing. Synth. Lect. Hum. Lang. Technol. 7(3), 1–121 (2014)
Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: COLING, vol. 96, pp. 466–471 (1996)
Chinchor, N., Marsh, E.: Muc-7 information extraction task definition. In: Proceeding of the Seventh Message Understanding Conference (MUC-7), Appendices, pp. 359–367 (1998)
Shen, W., Wang, J., Han, J.: Entity linking with a knowledge base: issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27(2), 443–460 (2015)
Yuan, J., Yang, Y., Jia, Z., Yin, H., Huang, J., Zhu, J.: Entity recognition and linking in Chinese search queries. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 507–519. Springer, Heidelberg (2015). doi:10.1007/978-3-319-25207-0_47
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. Dep. Pap. CIS, June 2001
Altun, Y., Tsochantaridis, I., Hofmann, T.: Hidden Markov support vector machines. In: ICML 2003, vol. 3, pp. 3–10 (2003)
Acknowledgments
This paper is supported in part by grants: National 863 Program of China (2015AA015405), NSFCs (National Natural Science Foundation of China) (61402128, 61473101, 61173075 and 61272383) and Strategic Emerging Industry Development Special Funds of Shenzhen (JCYJ20140508161040764, JCYJ20140417172417105 and JCYJ20140627163809422)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhao, Y. et al. (2016). ICRC-DSEDL: A Film Named Entity Discovery and Linking System Based on Knowledge Bases. In: Chen, H., Ji, H., Sun, L., Wang, H., Qian, T., Ruan, T. (eds) Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data. CCKS 2016. Communications in Computer and Information Science, vol 650. Springer, Singapore. https://doi.org/10.1007/978-981-10-3168-7_20
Download citation
DOI: https://doi.org/10.1007/978-981-10-3168-7_20
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3167-0
Online ISBN: 978-981-10-3168-7
eBook Packages: Computer ScienceComputer Science (R0)