Contextual Ontological Concepts Extraction

Karoui, Lobna; Bennacer, Nacéra; Aufaure, Marie-Aude

doi:10.1007/11893318_32

Lobna Karoui²¹,
Nacéra Bennacer²¹ &
Marie-Aude Aufaure²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4265))

Included in the following conference series:

International Conference on Discovery Science

1264 Accesses

Abstract

Ontologies provide a common layer which plays a major role in supporting information exchange and sharing. In this paper, we focus on the ontological concept extraction process from HTML documents. We propose an unsupervised hierarchical clustering algorithm namely “Contextual Ontological Concept Extraction” (COCE) which is an incremental use of a partitioning algorithm and is guided by a structural context. This context exploits the html structure and the location of words to select the semantically closer cooccurrents for each word and to improve the words weighting. Guided by this context definition, we perform an incremental clustering that refines the words’ context of each cluster to obtain semantic extracted concepts. The COCE algorithm offers the choice between either an automatic execution or an interactive one. We experiment the COCE algorithm on French documents related to the tourism. Our results show how the execution of our context-based algorithm improves the relevance of the clusters’ conceptual quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

The Study of Indian Domain Ontology Building Based on the Framework of HNC

ProMine: A Text Mining Solution for Concept Extraction and Filtering

A Comparative Study of Ontology Building Tools for Contextual Information Retrieval

References

Faure, D., Nedellec, C., Rouveirol, C.: Acquisition of semantic knowledge uing machine learning methods: the system ASIUM. Technical report number ICS-TR-88-16, inference and learning group, University of Paris-sud (1998)
Google Scholar
Meadche, A., Staab, S.: Ontology learning for the semantic Web. IEEE journal on Intelligent Systems 16(2), 72–79 (2001)
Article Google Scholar
Han, H., Elmasri, R.: Architecture of WebOntEx: A system for automatic extraction of ontologies from the Web. In: WCM 2000 (submitted, 2000)
Google Scholar
Davulcu, H., Vadrevu, S., Nagarajan, S.: OntoMiner: Boostrapping ontologies from overlapping domain specific web sites. In: AAAI 1998/IAAI 1998: Proceedings of the 15th National Conference on Artificial Intelligence (1998)
Google Scholar
Navigli, R., Velardi, P.: Learning domain ontologies from document warehousees and dedicated web sites. In: AAAI 1998/IAAI 1998: Proceedings of the 15th National Conference on Artificial Intelligence (1998)
Google Scholar
Michelet, B.: L’analyse des associations. Thèse de doctorat, Université de Paris VII, UFR de Chimie, Paris (Octobre 26, 1988)
Google Scholar
Karoui, L., Bennacer, N.: A framework for retrieving conceptual knowledge from Web pages. Semantic Web Applications and Perspectives SWAP, Italy (2005)
Google Scholar
Vazirgiannis, M., Halkidi, M., Gunopoulos, D.: Uncertaintly handling and quality assessmen in data mining. Springer, Heidelberg (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Ecole Supérieure d’Electricité, Plateau de Moulon 3 rue Joliot Curie, 91192 cedex, Gif-sur-Yvette, France
Lobna Karoui, Nacéra Bennacer & Marie-Aude Aufaure

Authors

Lobna Karoui
View author publications
You can also search for this author in PubMed Google Scholar
Nacéra Bennacer
View author publications
You can also search for this author in PubMed Google Scholar
Marie-Aude Aufaure
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Jozef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Ljupčo Todorovski
University of Nova Gorica, Nova Gorica, Slovenia
Nada Lavrač
Meme Media Laboratory, Hokkaido University Sapporo, Kita 13, Nishi 8, Kita-ku, P.O. Box, 060-8628, Sapporo, Japan
Klaus P. Jantke

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karoui, L., Bennacer, N., Aufaure, MA. (2006). Contextual Ontological Concepts Extraction. In: Todorovski, L., Lavrač, N., Jantke, K.P. (eds) Discovery Science. DS 2006. Lecture Notes in Computer Science(), vol 4265. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11893318_32

Download citation

DOI: https://doi.org/10.1007/11893318_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46491-4
Online ISBN: 978-3-540-46493-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics