An Empirical Research on Extracting Relations from Wikipedia Text

Huang, Jin-Xia; Ryu, Pum-Mo; Choi, Key-Sun

doi:10.1007/978-3-540-88906-9_31

Jin-Xia Huang⁵,
Pum-Mo Ryu⁵ &
Key-Sun Choi⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5326))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1734 Accesses

Abstract

A feature based relation classification approach is presented, in which probabilistic and semantic relatedness features between patterns and relation types are employed with other linguistic information. The importance of each feature set is evaluated with Chi-square estimator, and the experiments show that, the relatedness features have big impact on the relation classification performance. A series experiments are also performed to evaluate the different machine learning approaches on relation classification, among which Bayesian outperformed other approaches including Support Vector Machine (SVM).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kambhatla, N.: Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Extracting Relations. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (2004)
Google Scholar
Zhou, G., Su, J., Zhang, J., Zhang, M.: Exploring Various Knowledge in Relation Extraction. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pp. 427–434 (2005)
Google Scholar
Zhou, G., Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge. Inf. Process. Manage. 43(4), 969–982 (2007)
Article Google Scholar
Miller, G.A.: WordNet: An online lexical database. International Journal of Lexicography 3(4), 235–312 (1990)
Article Google Scholar
Manning, et al.: Text classification and Naïve Bayes. In: An Introduction to Information Retrieval, pp. 253–287. Cambridge University Press, Cambridge (2008) (online version)
Chapter Google Scholar
Connexor: The Connexor Language Parsers and Taggers for English Website (2008), http://www.connexor.eu/
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar
LIBSVM, A Library for Support Vector Machines (2008), http://www.csie.ntu.edu.tw/~cjlin/libsvm/
John, G.H., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: Proceeding of the 11th conference on Uncertainty in Artificial Intelligence, pp. 338–345. Morgan Kaufmann, San Mateo (1995)
Google Scholar
Aha, D., Kibler, D.: Instance-based Learning Algorithms. Machine Learning 6, 37–66 (1991)
MATH Google Scholar
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufman, San Mateo (1993)
Google Scholar
Vapnik, V.N.: An overview of statistical learning theory. IEEE Trans. Neural Network 10(5) (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

SWRC, Computer Science Division, EECS Dept. KAIST, 335 Gwahangno, Yuseong-gu, Daejeon, 305-701, Republic of Korea
Jin-Xia Huang, Pum-Mo Ryu & Key-Sun Choi

Authors

Jin-Xia Huang
View author publications
You can also search for this author in PubMed Google Scholar
Pum-Mo Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Key-Sun Choi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of West Scotland, PA1 2BE, Paisley, Scotland
Colin Fyfe
KAIST, Daejeon, Korea
Dongsup Kim
Brain Science Research Center and Department of Bio & Brain Engineering, Korea Advanced Institute of Science and Technology, 373-1 Guseong-dong, Yuseong-gu, 305-701, Daejeon, Korea
Soo-Young Lee
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, JX., Ryu, PM., Choi, KS. (2008). An Empirical Research on Extracting Relations from Wikipedia Text. In: Fyfe, C., Kim, D., Lee, SY., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2008. IDEAL 2008. Lecture Notes in Computer Science, vol 5326. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88906-9_31

Download citation

DOI: https://doi.org/10.1007/978-3-540-88906-9_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88905-2
Online ISBN: 978-3-540-88906-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics