skip to main content
research-article

Employing Constituent Dependency Information for Tree Kernel-Based Semantic Relation Extraction between Named Entities

Published:01 September 2011Publication History
Skip Abstract Section

Abstract

This article proposes a new approach to dynamically determine the tree span for tree kernel-based semantic relation extraction between named entities. The basic idea is to employ constituent dependency information in keeping the necessary nodes and their head children along the path connecting the two entities in the syntactic parse tree, while removing the noisy information from the tree, eventually leading to a dynamic syntactic parse tree. This article also explores various entity features and their possible combinations via a unified syntactic and semantic tree framework, which integrates both structural syntactic parse information and entity-related semantic information. Evaluation on the ACE RDC 2004 English and 2005 Chinese benchmark corpora shows that our dynamic syntactic parse tree much outperforms all previous tree spans, indicating its effectiveness in well representing the structural nature of relation instances while removing redundant information. Moreover, the unified parse and semantic tree significantly outperforms the single syntactic parse tree, largely due to the remarkable contributions from entity-related semantic features such as its type, subtype, mention-level as well as their bi-gram combinations. Finally, the best performance so far in semantic relation extraction is achieved via a composite kernel, which combines this tree kernel with a linear, state-of-the-art, feature-based kernel.

References

  1. ACE. 2002--2008. Automatic content extraction. http://www.ldc.upenn.edu/Projects/ACE/.Google ScholarGoogle Scholar
  2. Bunescu, R. C. and Mooney, R. J. 2005. A shortest path dependency kernel for relation extraction. In Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (EMNLP’05). 724--731. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chang, P. C., Tseng, H., Jurafsky, D., and Christopher, D. M. 2009. Discriminative reordering with Chinese grammatical relations features. In Proceedings of the 3rd Workshop on Syntax and Structure in Statistical Translation (SSST’09). 51--59. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Charniak, E. 2001. Intermediate-head parsing for language models. In Proceedings of the 39th Annual Meeting of the Association of Computational Linguistics (ACL’01). 116--123. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Che, W. X., Jiang, J. M., Su, Z., Pan, Y., and Liu, T. 2005a. Improved-edit-distance kernel for Chinese relation extraction. In Proceedings of the 2nd International Joint Conference on Natural Language Processing (IJCNLP’05).Google ScholarGoogle Scholar
  6. Che, W. X., Liu, T., and Li, S. 2005b. Automatic entity relation extraction. J. Chi. Inf. Proc. 19, 2, 1--6.Google ScholarGoogle Scholar
  7. Collins, M. 2003. Head-driven statistics models for natural language parsing. Comput. Linguist. 29, 4, 589--617. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Collins, M. and Duffy, N. 2001. Convolution kernels for natural language. In Proceedings of Neural Information Processing Systems (NIPS’01). 625--632.Google ScholarGoogle Scholar
  9. Collins, M. and Duffy, N. 2002. New ranking algorithms for parsing and tagging: Kernels over discrete structures, and the voted perceptron. In Proceedings of the 42nd Annual Meeting of the Association of Computational Linguistics (ACL’02). Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Culotta, A. and Sorensen, J. 2004. Dependency tree kernels for relation extraction. In Proceedings of the 42nd Annual Meeting of the Association of Computational Linguistics (ACL’04). 423--439. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Dong, J., Sun, L., Feng, Y. H., and Huang, R. H. 2007. Chinese automatic entity relation extraction. J. Chi. Inf. Proc. 21, 4, 80--85, 91.Google ScholarGoogle Scholar
  12. Grishman, R. and Sundheim, B. 1996. Message understanding Conference-6: A brief history. In Proceedings of the 16th Conference on Computational Linguistics (COLING’96). 466--471. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Haussler, D. 1999. Convolution Kernels on Discrete Structures. Tech. rep. UCS-CRL-99-10, University of California, Santa Cruz.Google ScholarGoogle Scholar
  14. Huang, R. H., Sun, L., and Feng, Y. Y. 2008. Study of kernel-based methods for Chinese relation extraction. Lecture Notes in Computer Science 4993: 598--604. Springer: Berlin/Heidelberg. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jiang, J. and Zhai, C. X. 2007. A systematic exploration of the feature space for relation extraction. In Proceedings of the Human Language Technology Conference/North American Chapter of the Association for Computational Linguistics (HLT-NAACL’07). 113--120.Google ScholarGoogle Scholar
  16. Joachims, T. 1998. Text categorization with support vector machine: Learning with many relevant features. In Proceedings of the 10th European Conference on Machine Learning (ECML’98). 137--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Kambhatla, N. 2004. Combining lexical, syntactic and semantic features with maximum entropy models for extracting relations. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’04). 178--181. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Li, W. J., Zhang, P., Wei, F. R., Hou, Y. X., and Lu, Q. 2008. A novel feature-based approach to Chinese entity relation extraction. In Proceedings of the Human Language Technology Conference (HLT’08). 89--92. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Liu, K. B., Li, F., Liu, L., and Hang, Y. 2007. Implementation of a kernel-based Chinese relation extraction system. J. Comput. Res. Dev. 44, 8, 1406--1411.Google ScholarGoogle ScholarCross RefCross Ref
  20. Lodhi, H., Saunders, C., Shaw-Taylor, J., Cristianini, N., and Watkins, C. 2002. Text classification using string kernel. J. Mach. Learn. Res. 2, 419--444. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Moschitti, A. 2004. A study on convolution kernels for shallow semantic parsing. In Proceedings of the 42nd Annual Meeting of the Association of Computational Linguistics (ACL’04). Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Moschitti, A. 2006. Making tree kernels practical for natural language learning. In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL’06). 113--120.Google ScholarGoogle Scholar
  23. Moschitti, A., Pighin, D., and Basili, R. 2006. Tree kernel engineering in semantic role labeling systems. In Proceedings of the Workshop on Learning Structured Information for Natural Language Applications, the Conference of the European Chapter of the Association for Computational Linguistics (EACL’06). 49--56.Google ScholarGoogle Scholar
  24. Moschitti, A., Pighin, D., and Basili, R. 2008. Tree kernels for semantic role labeling, special issue on semantic role labeling. Comput. Linguist. 34, 2, 194--224. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. MUC. 1987--1998. http://www.itl.nist.gov/iaui/894.02/related_projects/muc/.Google ScholarGoogle Scholar
  26. MUC-7. 1998. Proceedings of the 7th Message Understanding Conference (MUC’98). Morgan Kaufmann, San Mateo, CA.Google ScholarGoogle Scholar
  27. Nguyen, T. T., Moschitti, A., and Riccardi, G. 2009. Convolution kernels on constituent, dependency and sequential structures for relation extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’09). 1378--1387. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Qian, L. H., Zhou, G. D. Zhu, Q. M., and Qian, P. D. 2007. Relation extraction using convolution tree kernel expanded with entity features. In Proceedings of the 21st Pacific Asian Conference on Language, Information and Computation (PACLIC’07). 415--421.Google ScholarGoogle Scholar
  29. Schölkopf, B. and Smola, A. J. 2001. Learning with Kernels: SVM, Regularization, Optimization and Beyond. MIT Press, Cambridge, MA, 407--423. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Suzuki, J., Hirao, T., Sasaki, Y., and Maeda, E. 2003. Hierarchical directed acyclic graph kernel: Methods for structured natural language data. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’03). 32--39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Swampillai, K. and Stevenson, M. 2010. Inter-sentential relations in information extraction corpora. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC’10). 2637--2641.Google ScholarGoogle Scholar
  32. Yu, H. H., Qian, L. H., Zhou, G. D., and Zhu, Q. M. 2010. Chinese semantic relation extraction based on unified syntactic and entity semantic tree. J. Chi. Inf. Proc. 24, 5, 17--23.Google ScholarGoogle Scholar
  33. Zelenko, D., Aone, C., and Richardella, A. 2003. Kernel methods for relation extraction. J. Mach. Learn. Res. 3, 1083--1106. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Zhang, M., Zhang, M., Su, J., and Zhou, G. D. 2006. A composite kernel to extract relations between entities with both flat and structured features. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association of Computational Linguistics (COLING/ACL’06). 825--832. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Zhang, M., Zhou, G. D., and Aw, A. T. 2008a. Exploring syntactic structured features over parse trees for relation extraction using kernel methods. Inf. Proc. Man. 44, 687--701. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Zhang, M., Che, W. X., Zhou, G. D., Aw, A. T., Tan, C. L., Liu, T., and Li, S. 2008b. Semantic role labeling using a grammar-driven convolution tree kernel. IEEE Trans. Audio, Speech Lang. Proc. 16, 7, 1315--1329. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Zhang, J., Ouyang, Y., Li, Y., and Hou, Y. X. 2009. A novel composite approach to Chinese relation extraction. In Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages (ICCPOL’09).Google ScholarGoogle Scholar
  38. Zhao, S. B. and Grishman, R. 2005. Extracting relations with integrated information using kernel methods. In Proceedings of the 43rd Annual Meeting of the Association of Computational Linguistics (ACL’05). 419--426. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Zhou, G. D. and Zhang, M. 2007. Extracting relation information from text documents by exploring various types of knowledge. Inf. Proc. Man. 43, 969--982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Zhou, G. D., Su, J., Zhang, J., and Zhang, M. 2005. Exploring various knowledge in relation extraction. In Proceedings of the 43rd Annual Meeting of the Association of Computational Linguistics (ACL’05). 427--434. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Zhou, G. D., Zhang, M., Ji, D. H., and Zhu, Q. M. 2007. Tree kernel-based relation extraction with context-sensitive structured parse tree information. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP/CoNLL’07). 728--736.Google ScholarGoogle Scholar
  42. Zhou, G. D., Qian, L. H., and Zhu, Q. M. 2009. Label propagation via bootstrapped support vectors for semantic relation extraction between named entities. Comput. Speech Lang. 23, 4, 464--478. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Employing Constituent Dependency Information for Tree Kernel-Based Semantic Relation Extraction between Named Entities

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian Language Information Processing
      ACM Transactions on Asian Language Information Processing  Volume 10, Issue 3
      September 2011
      114 pages
      ISSN:1530-0226
      EISSN:1558-3430
      DOI:10.1145/2002980
      Issue’s Table of Contents

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 September 2011
      • Accepted: 1 April 2011
      • Revised: 1 February 2011
      • Received: 1 November 2010
      Published in talip Volume 10, Issue 3

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader