Extraction of Class Attributes from Online Encyclopedias

Guo, Hongzhi; Chen, Qincai; Sun, Chunxiao

doi:10.1007/978-3-662-45652-1_30

Extraction of Class Attributes from Online Encyclopedias

Hongzhi Guo⁵,
Qincai Chen⁶ &
Chunxiao Sun⁵

Conference paper
First Online: 01 January 2014

1592 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 481))

Abstract

Class attributes are important resources in question answering, knowledge base building and semantic retrieval. In this paper, we propose an approach extracting class attributes from online encyclopedias. This approach combines the tolerance rough set model and semantic relatedness computing. Firstly, the implementation of the tolerance rough set model ensures a high precision of top-\(k\) extracted class attributes, and then the semantic relatedness computing improves the coverage of top-\(k\) extracted class attributes in order to achieve higher accuracy. Finally experiments on the extracted class attributes show the effectiveness of our approach.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pasca, M., Durme, B.V.: What you seek is what you get: extraction of class attributes from query logs. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence Hyderabad, India, pp. 2832–2837(2007)
Google Scholar
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia. In: The 16th International Conference on World Wide Web, Banff, Alberta, Canada, pp. 697–706 (2007)
Google Scholar
Pasca, M.: Open-Domain Fine-Grained Class Extraction from Web Search Queries. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 403–414 (2013)
Google Scholar
Yoshinaga, N., Torisawa, K.: Open-Domain Attribute-Value Acquisition from Semi-Structured Texts. In: Proceedings of the Workshop on Ontolex, pp. 55–66 (2007)
Google Scholar
Medelyan, O., Milne, D., Legg, C., Witten, I.H.: Mining meaning from Wikipedia. International Journal of Human-Computer Studies 67, 716–754 (2009)
Article Google Scholar
Ngo, C.L., Nguyen, H.S.: A Tolerance Rough Set Approach to Clustering Web Search Results. In: The 8th European Conference on Principles and Practice of Knowledge Discovery in Databases, Pisa, Italy, pp. 515–517 (2004)
Google Scholar
Guo, H.Z., Chen, Q.C., Cui, L., Wand, X.L.: Tolerance rough set based attribute extraction approach for multiple semantic knowledge base integration. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 19(4), 659–684 (2011)
Article MathSciNet Google Scholar
Gabrilovich, E., Markovitch, S.: Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis. In: Proceedings of the 20th International Joint Conference on Artifical Intelligence, Hyderabad, India, pp. 16060–1611 (2007)
Google Scholar
Cilibrasi, R.L., Vitanyi, P.M.B.: The Google Similarity Distance. IEEE Trans. on Knowl. and Data Eng. 19, 370–383 (2007)
Article Google Scholar
Pedersen, T., Kulkarni, A.: Discovering Identities in Web Contexts with Unsupervised Clustering. In: Proceedings of the IJCAI 2007 Workshop on Analytics for Noisy Unstructured Text Data, Hyderabad, India, pp. 23–30 (2007)
Google Scholar
Voorhees, E.M.: Evaluating Answers to Definition Questions. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 2, Edmonton, Canada, pp. 109–111 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Xidian University, Xi’an, 710071, P.R. China
Hongzhi Guo & Chunxiao Sun
Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen, 518055, P.R. China
Qincai Chen

Authors

Hongzhi Guo
View author publications
You can also search for this author in PubMed Google Scholar
Qincai Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chunxiao Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongzhi Guo .

Editor information

Editors and Affiliations

Hebei University, Baoding, China
Xizhao Wang
Department of Electrical and Computer En, University of Alberta, Edmonton, Alberta, Canada
Witold Pedrycz
South China University of Technology, Guangzhou, China
Patrick Chan
Hebei University, Baoding, China
Qiang He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guo, H., Chen, Q., Sun, C. (2014). Extraction of Class Attributes from Online Encyclopedias. In: Wang, X., Pedrycz, W., Chan, P., He, Q. (eds) Machine Learning and Cybernetics. ICMLC 2014. Communications in Computer and Information Science, vol 481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45652-1_30

Download citation

DOI: https://doi.org/10.1007/978-3-662-45652-1_30
Published: 05 December 2014
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45651-4
Online ISBN: 978-3-662-45652-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics