Abstract
This article proposes a new approach to dynamically determine the tree span for tree kernel-based semantic relation extraction between named entities. The basic idea is to employ constituent dependency information in keeping the necessary nodes and their head children along the path connecting the two entities in the syntactic parse tree, while removing the noisy information from the tree, eventually leading to a dynamic syntactic parse tree. This article also explores various entity features and their possible combinations via a unified syntactic and semantic tree framework, which integrates both structural syntactic parse information and entity-related semantic information. Evaluation on the ACE RDC 2004 English and 2005 Chinese benchmark corpora shows that our dynamic syntactic parse tree much outperforms all previous tree spans, indicating its effectiveness in well representing the structural nature of relation instances while removing redundant information. Moreover, the unified parse and semantic tree significantly outperforms the single syntactic parse tree, largely due to the remarkable contributions from entity-related semantic features such as its type, subtype, mention-level as well as their bi-gram combinations. Finally, the best performance so far in semantic relation extraction is achieved via a composite kernel, which combines this tree kernel with a linear, state-of-the-art, feature-based kernel.
- ACE. 2002--2008. Automatic content extraction. http://www.ldc.upenn.edu/Projects/ACE/.Google Scholar
- Bunescu, R. C. and Mooney, R. J. 2005. A shortest path dependency kernel for relation extraction. In Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (EMNLP’05). 724--731. Google ScholarDigital Library
- Chang, P. C., Tseng, H., Jurafsky, D., and Christopher, D. M. 2009. Discriminative reordering with Chinese grammatical relations features. In Proceedings of the 3rd Workshop on Syntax and Structure in Statistical Translation (SSST’09). 51--59. Google ScholarDigital Library
- Charniak, E. 2001. Intermediate-head parsing for language models. In Proceedings of the 39th Annual Meeting of the Association of Computational Linguistics (ACL’01). 116--123. Google ScholarDigital Library
- Che, W. X., Jiang, J. M., Su, Z., Pan, Y., and Liu, T. 2005a. Improved-edit-distance kernel for Chinese relation extraction. In Proceedings of the 2nd International Joint Conference on Natural Language Processing (IJCNLP’05).Google Scholar
- Che, W. X., Liu, T., and Li, S. 2005b. Automatic entity relation extraction. J. Chi. Inf. Proc. 19, 2, 1--6.Google Scholar
- Collins, M. 2003. Head-driven statistics models for natural language parsing. Comput. Linguist. 29, 4, 589--617. Google ScholarDigital Library
- Collins, M. and Duffy, N. 2001. Convolution kernels for natural language. In Proceedings of Neural Information Processing Systems (NIPS’01). 625--632.Google Scholar
- Collins, M. and Duffy, N. 2002. New ranking algorithms for parsing and tagging: Kernels over discrete structures, and the voted perceptron. In Proceedings of the 42nd Annual Meeting of the Association of Computational Linguistics (ACL’02). Google ScholarDigital Library
- Culotta, A. and Sorensen, J. 2004. Dependency tree kernels for relation extraction. In Proceedings of the 42nd Annual Meeting of the Association of Computational Linguistics (ACL’04). 423--439. Google ScholarDigital Library
- Dong, J., Sun, L., Feng, Y. H., and Huang, R. H. 2007. Chinese automatic entity relation extraction. J. Chi. Inf. Proc. 21, 4, 80--85, 91.Google Scholar
- Grishman, R. and Sundheim, B. 1996. Message understanding Conference-6: A brief history. In Proceedings of the 16th Conference on Computational Linguistics (COLING’96). 466--471. Google ScholarDigital Library
- Haussler, D. 1999. Convolution Kernels on Discrete Structures. Tech. rep. UCS-CRL-99-10, University of California, Santa Cruz.Google Scholar
- Huang, R. H., Sun, L., and Feng, Y. Y. 2008. Study of kernel-based methods for Chinese relation extraction. Lecture Notes in Computer Science 4993: 598--604. Springer: Berlin/Heidelberg. Google ScholarDigital Library
- Jiang, J. and Zhai, C. X. 2007. A systematic exploration of the feature space for relation extraction. In Proceedings of the Human Language Technology Conference/North American Chapter of the Association for Computational Linguistics (HLT-NAACL’07). 113--120.Google Scholar
- Joachims, T. 1998. Text categorization with support vector machine: Learning with many relevant features. In Proceedings of the 10th European Conference on Machine Learning (ECML’98). 137--142. Google ScholarDigital Library
- Kambhatla, N. 2004. Combining lexical, syntactic and semantic features with maximum entropy models for extracting relations. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’04). 178--181. Google ScholarDigital Library
- Li, W. J., Zhang, P., Wei, F. R., Hou, Y. X., and Lu, Q. 2008. A novel feature-based approach to Chinese entity relation extraction. In Proceedings of the Human Language Technology Conference (HLT’08). 89--92. Google ScholarDigital Library
- Liu, K. B., Li, F., Liu, L., and Hang, Y. 2007. Implementation of a kernel-based Chinese relation extraction system. J. Comput. Res. Dev. 44, 8, 1406--1411.Google ScholarCross Ref
- Lodhi, H., Saunders, C., Shaw-Taylor, J., Cristianini, N., and Watkins, C. 2002. Text classification using string kernel. J. Mach. Learn. Res. 2, 419--444. Google ScholarDigital Library
- Moschitti, A. 2004. A study on convolution kernels for shallow semantic parsing. In Proceedings of the 42nd Annual Meeting of the Association of Computational Linguistics (ACL’04). Google ScholarDigital Library
- Moschitti, A. 2006. Making tree kernels practical for natural language learning. In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL’06). 113--120.Google Scholar
- Moschitti, A., Pighin, D., and Basili, R. 2006. Tree kernel engineering in semantic role labeling systems. In Proceedings of the Workshop on Learning Structured Information for Natural Language Applications, the Conference of the European Chapter of the Association for Computational Linguistics (EACL’06). 49--56.Google Scholar
- Moschitti, A., Pighin, D., and Basili, R. 2008. Tree kernels for semantic role labeling, special issue on semantic role labeling. Comput. Linguist. 34, 2, 194--224. Google ScholarDigital Library
- MUC. 1987--1998. http://www.itl.nist.gov/iaui/894.02/related_projects/muc/.Google Scholar
- MUC-7. 1998. Proceedings of the 7th Message Understanding Conference (MUC’98). Morgan Kaufmann, San Mateo, CA.Google Scholar
- Nguyen, T. T., Moschitti, A., and Riccardi, G. 2009. Convolution kernels on constituent, dependency and sequential structures for relation extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’09). 1378--1387. Google ScholarDigital Library
- Qian, L. H., Zhou, G. D. Zhu, Q. M., and Qian, P. D. 2007. Relation extraction using convolution tree kernel expanded with entity features. In Proceedings of the 21st Pacific Asian Conference on Language, Information and Computation (PACLIC’07). 415--421.Google Scholar
- Schölkopf, B. and Smola, A. J. 2001. Learning with Kernels: SVM, Regularization, Optimization and Beyond. MIT Press, Cambridge, MA, 407--423. Google ScholarDigital Library
- Suzuki, J., Hirao, T., Sasaki, Y., and Maeda, E. 2003. Hierarchical directed acyclic graph kernel: Methods for structured natural language data. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’03). 32--39. Google ScholarDigital Library
- Swampillai, K. and Stevenson, M. 2010. Inter-sentential relations in information extraction corpora. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC’10). 2637--2641.Google Scholar
- Yu, H. H., Qian, L. H., Zhou, G. D., and Zhu, Q. M. 2010. Chinese semantic relation extraction based on unified syntactic and entity semantic tree. J. Chi. Inf. Proc. 24, 5, 17--23.Google Scholar
- Zelenko, D., Aone, C., and Richardella, A. 2003. Kernel methods for relation extraction. J. Mach. Learn. Res. 3, 1083--1106. Google ScholarDigital Library
- Zhang, M., Zhang, M., Su, J., and Zhou, G. D. 2006. A composite kernel to extract relations between entities with both flat and structured features. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association of Computational Linguistics (COLING/ACL’06). 825--832. Google ScholarDigital Library
- Zhang, M., Zhou, G. D., and Aw, A. T. 2008a. Exploring syntactic structured features over parse trees for relation extraction using kernel methods. Inf. Proc. Man. 44, 687--701. Google ScholarDigital Library
- Zhang, M., Che, W. X., Zhou, G. D., Aw, A. T., Tan, C. L., Liu, T., and Li, S. 2008b. Semantic role labeling using a grammar-driven convolution tree kernel. IEEE Trans. Audio, Speech Lang. Proc. 16, 7, 1315--1329. Google ScholarDigital Library
- Zhang, J., Ouyang, Y., Li, Y., and Hou, Y. X. 2009. A novel composite approach to Chinese relation extraction. In Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages (ICCPOL’09).Google Scholar
- Zhao, S. B. and Grishman, R. 2005. Extracting relations with integrated information using kernel methods. In Proceedings of the 43rd Annual Meeting of the Association of Computational Linguistics (ACL’05). 419--426. Google ScholarDigital Library
- Zhou, G. D. and Zhang, M. 2007. Extracting relation information from text documents by exploring various types of knowledge. Inf. Proc. Man. 43, 969--982. Google ScholarDigital Library
- Zhou, G. D., Su, J., Zhang, J., and Zhang, M. 2005. Exploring various knowledge in relation extraction. In Proceedings of the 43rd Annual Meeting of the Association of Computational Linguistics (ACL’05). 427--434. Google ScholarDigital Library
- Zhou, G. D., Zhang, M., Ji, D. H., and Zhu, Q. M. 2007. Tree kernel-based relation extraction with context-sensitive structured parse tree information. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP/CoNLL’07). 728--736.Google Scholar
- Zhou, G. D., Qian, L. H., and Zhu, Q. M. 2009. Label propagation via bootstrapped support vectors for semantic relation extraction between named entities. Comput. Speech Lang. 23, 4, 464--478. Google ScholarDigital Library
Index Terms
- Employing Constituent Dependency Information for Tree Kernel-Based Semantic Relation Extraction between Named Entities
Recommendations
Tree kernel-based semantic relation extraction with rich syntactic and semantic information
This paper proposes a novel tree kernel-based method with rich syntactic and semantic information for the extraction of semantic relations between named entities. With a parse tree and an entity pair, we first construct a rich semantic relation tree ...
Tree Kernel-Based Semantic Relation Extraction Using Unified Dynamic Relation Tree
ALPIT '08: Proceedings of the 2008 International Conference on Advanced Language Processing and Web Information TechnologyThis paper proposes a Unified Dynamic Relation Tree (DRT) span for tree kernel-based semantic relation extraction between entity names. The basic idea is to apply a variety of linguistics-driven rules to dynamically prune out noisy information from a ...
Exploiting constituent dependencies for tree kernel-based semantic relation extraction
COLING '08: Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1This paper proposes a new approach to dynamically determine the tree span for tree kernel-based semantic relation extraction. It exploits constituent dependencies to keep the nodes and their head children along the path connecting the two entities, ...
Comments