Skip to main content

Extraction of Class Attributes from Online Encyclopedias

  • Conference paper
  • First Online:
  • 1592 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 481))

Abstract

Class attributes are important resources in question answering, knowledge base building and semantic retrieval. In this paper, we propose an approach extracting class attributes from online encyclopedias. This approach combines the tolerance rough set model and semantic relatedness computing. Firstly, the implementation of the tolerance rough set model ensures a high precision of top-\(k\) extracted class attributes, and then the semantic relatedness computing improves the coverage of top-\(k\) extracted class attributes in order to achieve higher accuracy. Finally experiments on the extracted class attributes show the effectiveness of our approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pasca, M., Durme, B.V.: What you seek is what you get: extraction of class attributes from query logs. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence Hyderabad, India, pp. 2832–2837(2007)

    Google Scholar 

  2. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia. In: The 16th International Conference on World Wide Web, Banff, Alberta, Canada, pp. 697–706 (2007)

    Google Scholar 

  3. Pasca, M.: Open-Domain Fine-Grained Class Extraction from Web Search Queries. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 403–414 (2013)

    Google Scholar 

  4. Yoshinaga, N., Torisawa, K.: Open-Domain Attribute-Value Acquisition from Semi-Structured Texts. In: Proceedings of the Workshop on Ontolex, pp. 55–66 (2007)

    Google Scholar 

  5. Medelyan, O., Milne, D., Legg, C., Witten, I.H.: Mining meaning from Wikipedia. International Journal of Human-Computer Studies 67, 716–754 (2009)

    Article  Google Scholar 

  6. Ngo, C.L., Nguyen, H.S.: A Tolerance Rough Set Approach to Clustering Web Search Results. In: The 8th European Conference on Principles and Practice of Knowledge Discovery in Databases, Pisa, Italy, pp. 515–517 (2004)

    Google Scholar 

  7. Guo, H.Z., Chen, Q.C., Cui, L., Wand, X.L.: Tolerance rough set based attribute extraction approach for multiple semantic knowledge base integration. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 19(4), 659–684 (2011)

    Article  MathSciNet  Google Scholar 

  8. Gabrilovich, E., Markovitch, S.: Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis. In: Proceedings of the 20th International Joint Conference on Artifical Intelligence, Hyderabad, India, pp. 16060–1611 (2007)

    Google Scholar 

  9. Cilibrasi, R.L., Vitanyi, P.M.B.: The Google Similarity Distance. IEEE Trans. on Knowl. and Data Eng. 19, 370–383 (2007)

    Article  Google Scholar 

  10. Pedersen, T., Kulkarni, A.: Discovering Identities in Web Contexts with Unsupervised Clustering. In: Proceedings of the IJCAI 2007 Workshop on Analytics for Noisy Unstructured Text Data, Hyderabad, India, pp. 23–30 (2007)

    Google Scholar 

  11. Voorhees, E.M.: Evaluating Answers to Definition Questions. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 2, Edmonton, Canada, pp. 109–111 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongzhi Guo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Guo, H., Chen, Q., Sun, C. (2014). Extraction of Class Attributes from Online Encyclopedias. In: Wang, X., Pedrycz, W., Chan, P., He, Q. (eds) Machine Learning and Cybernetics. ICMLC 2014. Communications in Computer and Information Science, vol 481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45652-1_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-45652-1_30

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-45651-4

  • Online ISBN: 978-3-662-45652-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics