Abstract
This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered, a discriminative function is determined in a top-down way. As the upper-level class normally has much more positive training examples than the lower-level class, the corresponding discriminative function can be determined more reliably and effectively, and thus guide the discriminative function learning in the lower-level, which otherwise might suffer from limited training data. In this paper, the state-of-the-art Support Vector Machines is applied as the basic classifier learning approach using the hierarchical learning strategy. Evaluation on the ACE RDC 2003 and 2004 corpora shows that the hierarchical learning strategy much improves the performance on least- and medium- frequent relations.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
ACE (2000-2005). Automatic Content Extraction, http://www.ldc.upenn.edu/Projects/ACE/
Bunescu, R., Mooney, R.J.: A shortest path dependency kernel for relation extraction. In: HLT/EMNLP 2005, Vancover, B.C, October 6-8, pp. 724–731 (2005)
Collins, M.: Head-driven statistical models for natural language parsing. Ph.D. Dissertation, University of Pennsylvania (1999)
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: ACL 2004, Barcelona, Spain, July 21-26, pp. 423–429 (2004)
Miller, G.A.: WordNet: An online lexical database. International Journal of Lexicography 3(4), 235–312 (1990)
Miller, S., Fox, H., Ramshaw, L., Weischedel, R.: A novel use of statistical parsing to extract information from text. In: ANLP 2000, Seattle, USA, April 29 - May 4, pp. 226–233 (2000)
MUC-7. In: Proceedings of the 7th Message Understanding Conference (MUC-7). Morgan Kaufmann, San Francisco (1998)
Kambhatla, N.: Combining lexical, syntactic and semantic features with Maximum Entropy models for extracting relations. In: ACL 2004 (Poster), Barcelona, Spain, July 21-26, pp. 178–181 (2004)
Platt, J.: Probabilistic Outputs for Support Vector Machines and Comparisions to regularized Likelihood Methods. In: Smola, J., Bartlett, P., Scholkopf, B., Schuurmans, D. (eds.) Advances in Large Margin Classifiers. MIT Press, Cambridge (1999)
Zelenko, D., Aone, C., Richardella, A.: Kernel methods for relation extraction. Journal of Machine Learning Research 3, 1083–1106 (2003)
Zhang, M., Su, J., Wang, D.M., Zhou, G.D., Tan, C.L.: Discovering relations from a large raw corpus using tree similarity-based clustering. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 378–389. Springer, Heidelberg (2005)
Zhao, S.B., Grisman, R.: Extracting relations with integrated information using kernel methods. In: ACL 2005, June 25-30, pp. 419–426. Univ. of Michgan-Ann Arbor, USA (2005)
Zhou, G.D., Su, J., Zhang, J., Zhang, M.: Exploring various knowledge in relation extraction. In: ACL 2005, Ann Arbor, Michgan, USA, June 25-30, pp. 427–434 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhou, G., Zhang, M., Fu, G. (2006). Hierarchical Learning Strategy in Relation Extraction Using Support Vector Machines. In: Ng, H.T., Leong, MK., Kan, MY., Ji, D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880592_6
Download citation
DOI: https://doi.org/10.1007/11880592_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45780-0
Online ISBN: 978-3-540-46237-8
eBook Packages: Computer ScienceComputer Science (R0)