Automatic Acquisition of Attributes for Ontology Construction

Cui, Gaoying; Lu, Qin; Li, Wenjie; Chen, Yirong

doi:10.1007/978-3-642-00831-3_23

Automatic Acquisition of Attributes for Ontology Construction

Gaoying Cui²¹,
Qin Lu²¹,
Wenjie Li²¹ &
…
Yirong Chen²¹

Conference paper

860 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5459))

Abstract

An ontology can be seen as an organized structure of concepts according to their relations. A concept is associated with a set of attributes that themselves are also concepts in the ontology. Consequently, ontology construction is the acquisition of concepts and their associated attributes through relations. Manual ontology construction is time-consuming and difficult to maintain. Corpus-based ontology construction methods must be able to distinguish concepts themselves from concept instances. In this paper, a novel and simple method is proposed for automatically identifying concept attributes through the use of Wikipedia as the corpus. The built-in {{Infobox}} in Wiki is used to acquire concept attributes and identify semantic types of the attributes. Two simple induction rules are applied to improve the performance. Experimental results show precisions of 92.5% for attribute acquisition and 80% for attribute type identification. This is a very promising result for automatic ontology construction.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Almuhareb, A., Poesio, M.: Attribute-Based and Value-Based Clustering: An Evaluation. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), Barcelona, Spain (2004)
Google Scholar
Grefenstette, G.: SEXTANT: Extracting semantics from raw text implementation details. Heuristics: The Journal of Knowledge Engineering (1993)
Google Scholar
Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of the 17th International Conference on Computational Linguistics and 36th Annual Meeting of the Association for Computational Linguistics (COLING-ACL), Montreal, pp. 768–774 (1998)
Google Scholar
Curran, J.R., Moens, M.: Improvements in automatic thesaurus extraction. In: Proceedings of the ACL-SIGLEX Workshop on Unsupervised Lexical Acquisition, Philadelphia, PA, USA, pp. 59–66 (2002)
Google Scholar
Kilgarriff, A.: Thesauruses for Natural Language Processing. In: Proceedings of the IEEE 2003 International Conference on Natural Language Processing and Knowledge Engineering (NLPKE 2003), Beijing (2003)
Google Scholar
Natalya, F., Noy, Deborah, L.: McGuinness: Ontology Development 101: A Guide to Creating Your First Ontology (2001) (last visited September 20th, 2008), http://protege.stanford.edu/publications/ontology_development/ontology101-noy-mcguinness.html
Niles, I., Pease, A.: Towards a Standard Upper Ontology. In: Proceedings of the Second International Conference on Formal Ontology in Information Systems (FOIS 2001) (2001) (last visited September 20th, 2008), http://home.earthlink.net/~adampease/professional/FOIS.pdf
Chen, Y., Lu, Q., Li, W., Li, W., Ji, L., Cui, G.: Automatic Construction of a Chinese Core Ontology from an English-Chinese Term Bank. In: Proceeding of ISWC 2007 Workshop OntoLex 2007 - From Text to Knowledge: The Lexicon/Ontology Interface, Busan, Korea, pp. 78–87 (2007)
Google Scholar
Lee, C.S., Kao, Y.F., Kuo, Y.H., Wang, M.H.: Automated ontology construction for unstructured text documents. Data & Knowledge Engineering 60, 547–566 (2007)
Article Google Scholar
Yang, Y., Lu, Q., Zhao, T.: A Clustering Based Approach for Domain Relevant Relation Extraction. In: Proceedings of the 2008 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE 2008), Beijing, China, October 19-22 (2008)
Google Scholar
Yoshinaga, N., Torisawa, K.: Open-Domain Attribute-Value Acquisition from Semi-Structured Texts. In: Proceedings of the OntoLex 2007 - From Text to Knowledge: The Lexicon/Ontology Interface, Busan, South-Korea, November 11th (2007)
Google Scholar
Pasca, M., Durme, B.V.: Weakly-supervised Acquisition of Open-domain Classes and Class Attributes from Web Documents and Quey Logs. In: Proceedings of ACL 2008: HLT, Columbus, Ohio, USA, pp. 19–27 (2008)
Google Scholar
Cui, G., Lu, Q., Li, W., Chen, Y.: Corpus Exploitation from Wikipedia. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco, May 28-30 (2008)
Google Scholar
Wikipedia (English version), http://en.Wikipedia.org
Poesio, M., Almuhareb, A.: Identifying Concept Attributes Using a Classifier. In: Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition, Ann Arbor, pp. 18–27 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China
Gaoying Cui, Qin Lu, Wenjie Li & Yirong Chen

Authors

Gaoying Cui
View author publications
You can also search for this author in PubMed Google Scholar
Qin Lu
View author publications
You can also search for this author in PubMed Google Scholar
Wenjie Li
View author publications
You can also search for this author in PubMed Google Scholar
Yirong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
Wenjie Li
Division of Information and Communication Sciences, Macquarie University, NSW 2109, Sydney, Australia
Diego Mollá-Aliod

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cui, G., Lu, Q., Li, W., Chen, Y. (2009). Automatic Acquisition of Attributes for Ontology Construction. In: Li, W., Mollá-Aliod, D. (eds) Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy. ICCPOL 2009. Lecture Notes in Computer Science(), vol 5459. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00831-3_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-00831-3_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00830-6
Online ISBN: 978-3-642-00831-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics