Abstract
The increasing need for ontologies and the difficulties of manual construction give place to initiatives proposing methods for automatic and semi-automatic ontology learning. In this work we present a semi-automatic method for domain ontologies extraction from Wikipedia’s categories. In order to validate the method, we have conducted a case study in which we implemented a prototype generating a Tourism ontology. The results are evaluated against a manually built Golden Standard reporting 79.51% Precision and 91.95% Recall, comparable to those found in the literature for other languages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zareen, S., et al.: Wikipedia as an Ontology for Describing Documents.In: Proceedings of the Second International Conference on Weblogs and Social Media (2008)
Wu, F., Weld, D.S.: Autonomously semantifying wikipedia. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management (CIKM 2007), pp. 41–50. ACM, New York (2007)
Gruber, T.R.: A translation approach to portable ontology specifications. Knowl. Acquis. 5(2), 199–220 (1993)
Guarino, N.: Formal Ontology. In: Proceedings of the 1st International Conference on Information Systems. IOS Press, Trento (1998)
Smith, B., Welty, C.: FOIS introduction: Ontology- towards a new synthesis. In: Proceedings of the International Conference on Formal Ontology in Information Systems, Ogunquit, Maine, USA, October 17-19. ACM, New York (2001)
Krötzsch, M., Cić, D., Völkel, M.: Wikipedia and semantic web - the missing links (2005)
Maedche, A.D.: Ontology Learning for the Semantic Web. Kluwer Academic Publishers, Dordrecht (2002)
Hepp, M., Bachlechner, D., Siorpaer, K.: Harvesting Wiki Consensus - Using Wikipedia Entries as Ontology Elements. IEEE Internet Computing 11(5), 54–65 (2007)
Lima, V., Nunes, M., Vieira, R.: Desafios do Processamento de Línguas Naturais. In: SEMISH - XXXIV Seminário Integrado de Software e Hardware, Anais do XXVII Congresso da SBC, Rio de Janeiro (2007)
Völkel, M., Krötzsch, M., Vrandecic, D., Haller, H., Studer, R.: Semantic Wikipédia. In: Proceedings of the 15th International Conference on World Wide Web, pp. 585–594 (2006)
Wu, F., Weld, D.S.: Automatically refining the wikipedia infobox ontology. In: Proc. of the 17th Int. Conf. on WWW, pp. 635–644. ACM, New York (2008)
Ponzetto, S., Strube, M.: Knowledge Derived from Wikipedia for Computing Semantic Relatedness. Journal of Artificial Intelligence Research 30, 181–212 (2007)
Ponzetto, S.P., Strube, M.: Deriving a large scale taxonomy from Wikipedia. In: Proceedings of the 22nd National Conference on Artificial Intelligence, vol. 2, pp. 1440–1445. AAAI Press, Menlo Park (2007)
Nastase, V., Strube, M.: Decoding Wikipedia Categories for Knowledge Acquisition. In: Twenty-Third AAAI Conference on Artificial Intelligence, pp.1219–1224 (2008)
Ponzetto, S.P., Strube, M.: WikiTaxonomy: A Large Scale Knowledge Resource. In: Ghallab, M., Spyropoulos, C.D., Fakotakis, N., Avouris, N. (eds.) Proceeding of the 2008 Conference on ECAI 2008. Frontiers in Artificial Intelligence and Applications, vol. 178, pp. 751–752. IOS Press, Amsterdam (2008)
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Large Ontology from Wikipedia and WordNet. Web Semant. 6(3), 203–217 (2008)
Syed, Z., Finin, T., Joshi, A.: Wikipedia as an Ontology for Describing Documents. In: Proceedings of the International Conference on Weblogs and Social Media (2008)
Zirn, C., Nastase, V., Strube, M.: Distinguishing between Instances and Classes in the Wikipedia Taxonomy. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 376–387. Springer, Heidelberg (2008)
Nastase, V., Strube, M.: Decoding Wikipedia Categories for Knowledge Acquisition. In: AAAI 2008, pp. 1219–1224 (2008)
Gruber, T.R.: Ontolingua: A Mechanism to Support Portable Ontologies. Technical Report (1992)
Horridge, C., Knublauch, H., Rector, A., Stevens, R., Wroe, C.: A practical guide to building OWL Ontologies using the Protégé OWL Plug-in and CO-ODE Tools (2004), http://www.co-ode.org/resources/tutorials/ProtegeOWLTutorial.pdf
Miller, G.A., Hristea, F.: WordNet nouns: Classes and instances. Computational Linguistics 32(1), 1–3 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xavier, C.C., de Lima, V.L.S. (2010). A Semi-automatic Method for Domain Ontology Extraction from Portuguese Language Wikipedia’s Categories. In: da Rocha Costa, A.C., Vicari, R.M., Tonidandel, F. (eds) Advances in Artificial Intelligence – SBIA 2010. SBIA 2010. Lecture Notes in Computer Science(), vol 6404. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16138-4_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-16138-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16137-7
Online ISBN: 978-3-642-16138-4
eBook Packages: Computer ScienceComputer Science (R0)