Hierarchical Learning Strategy in Relation Extraction Using Support Vector Machines

Zhou, GuoDong; Zhang, Min; Fu, Guohong

doi:10.1007/11880592_6

GuoDong Zhou^20,21,
Min Zhang²¹ &
Guohong Fu²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4182))

Included in the following conference series:

Asia Information Retrieval Symposium

991 Accesses
1 Citations

Abstract

This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered, a discriminative function is determined in a top-down way. As the upper-level class normally has much more positive training examples than the lower-level class, the corresponding discriminative function can be determined more reliably and effectively, and thus guide the discriminative function learning in the lower-level, which otherwise might suffer from limited training data. In this paper, the state-of-the-art Support Vector Machines is applied as the basic classifier learning approach using the hierarchical learning strategy. Evaluation on the ACE RDC 2003 and 2004 corpora shows that the hierarchical learning strategy much improves the performance on least- and medium- frequent relations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Improving Relation Classification Using Relation Hierarchy

Classifier-Based Pattern Selection Approach for Relation Instance Extraction

A Benchmark for Relation Extraction Kernels

References

ACE (2000-2005). Automatic Content Extraction, http://www.ldc.upenn.edu/Projects/ACE/
Bunescu, R., Mooney, R.J.: A shortest path dependency kernel for relation extraction. In: HLT/EMNLP 2005, Vancover, B.C, October 6-8, pp. 724–731 (2005)
Google Scholar
Collins, M.: Head-driven statistical models for natural language parsing. Ph.D. Dissertation, University of Pennsylvania (1999)
Google Scholar
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: ACL 2004, Barcelona, Spain, July 21-26, pp. 423–429 (2004)
Google Scholar
Miller, G.A.: WordNet: An online lexical database. International Journal of Lexicography 3(4), 235–312 (1990)
Article Google Scholar
Miller, S., Fox, H., Ramshaw, L., Weischedel, R.: A novel use of statistical parsing to extract information from text. In: ANLP 2000, Seattle, USA, April 29 - May 4, pp. 226–233 (2000)
Google Scholar
MUC-7. In: Proceedings of the 7th Message Understanding Conference (MUC-7). Morgan Kaufmann, San Francisco (1998)
Google Scholar
Kambhatla, N.: Combining lexical, syntactic and semantic features with Maximum Entropy models for extracting relations. In: ACL 2004 (Poster), Barcelona, Spain, July 21-26, pp. 178–181 (2004)
Google Scholar
Platt, J.: Probabilistic Outputs for Support Vector Machines and Comparisions to regularized Likelihood Methods. In: Smola, J., Bartlett, P., Scholkopf, B., Schuurmans, D. (eds.) Advances in Large Margin Classifiers. MIT Press, Cambridge (1999)
Google Scholar
Zelenko, D., Aone, C., Richardella, A.: Kernel methods for relation extraction. Journal of Machine Learning Research 3, 1083–1106 (2003)
Article MATH MathSciNet Google Scholar
Zhang, M., Su, J., Wang, D.M., Zhou, G.D., Tan, C.L.: Discovering relations from a large raw corpus using tree similarity-based clustering. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 378–389. Springer, Heidelberg (2005)
Chapter Google Scholar
Zhao, S.B., Grisman, R.: Extracting relations with integrated information using kernel methods. In: ACL 2005, June 25-30, pp. 419–426. Univ. of Michgan-Ann Arbor, USA (2005)
Chapter Google Scholar
Zhou, G.D., Su, J., Zhang, J., Zhang, M.: Exploring various knowledge in relation extraction. In: ACL 2005, Ann Arbor, Michgan, USA, June 25-30, pp. 427–434 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Suzhou University, 215006, China
GuoDong Zhou
Institute for Infocomm Research, 119613, Singapore
GuoDong Zhou & Min Zhang
Department of Linguistics, The University of Hong Kong, Hong Kong
Guohong Fu

Authors

GuoDong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Min Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guohong Fu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, National University of Singapore, 3 Science Drive 2, 117543, Singapore
Hwee Tou Ng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Mun-Kew Leong
Department of Computer Science, School of Computing, National University of Singapore, 117543, Singapore
Min-Yen Kan
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, P.O. Box, 119613, Singapore
Donghong Ji

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhou, G., Zhang, M., Fu, G. (2006). Hierarchical Learning Strategy in Relation Extraction Using Support Vector Machines. In: Ng, H.T., Leong, MK., Kan, MY., Ji, D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880592_6

Download citation

DOI: https://doi.org/10.1007/11880592_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45780-0
Online ISBN: 978-3-540-46237-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics