Skip to main content
Log in

Query-driven approach of contextual ontology module learning using web snippets

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

The main objective of this work is to automatically build ontology modules that cover search terms of users in ontology-based question answering on the Web. Indeed, some arising approaches of ontology module extraction aim at solving the problem of identifying ontology fragment candidates that are relevant for the application. The main problem is that these approaches consider only the input of predefined ontologies, instead of the underlying semantics represented in texts. This work proposes an approach of contextual ontology module learning covering particular search terms by analyzing past user queries and by searching for web snippets provided by the traditional search engines. The obtained contextual modules will be used for query reformulation. The proposal has been evaluated on the ground of two criteria: the semantic cotopy measure of discovered ontology modules and the precision measure of the search results obtained by using the resulted ontology modules for query reformulation. The experiments have been carried out according to two case studies: an open domain web search and the medical digital library “PubMed”.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20

Similar content being viewed by others

References

  • Alfonseca, E. & Manandhar, S. (2002). An unsupervised method for general named entity recognition and automated concept discovery. In Proc. first international conference on genaral WordNet. India.

  • Abdollahzadeh, A. & Barforush, A.R. (2012). Ontology learning: revised. Journal of Web Engineering, 11(4), 269–289.

    Google Scholar 

  • Ben-Mustapha, N., Baazaoui-Zghal, H., Marie-Aude, A., Ben-Ghzala, H. (2009). Survey on ontology learning from web and open issues. In Third international symposium on innovation in information and communication technology, (ISIICT’ 2009). Amman Jordan.

  • Ben-Mustapha, N., Aufaure, M.-A., Baazaoui-Zghal, H., Ben-Ghzala, H. (2011). Contextual ontology module learning from web snippets and past user queries. In Proceedings of the 15th international conference on knowledge-based and intelligent information and engineering systems, KES’11 (Vol. part II, pp. 538–547).

  • Ben-Mustapha, N., Aufaure, M.-A., Baazaoui-Zghal, H., Ben-Ghzala, H. (2012). Modular ontological warehouse for adaptative information search. In Proceedings of 2nd international conference on model and data engineering (MEDI’2012).

  • Cimiano, P. (2006). Ontology learning and population from text—algorithms, evaluation and applications. Springer.

  • Cuzzocrea, A. & Mastroianni, C. (2003). A reference architecture for knowledge management-based. In Proceeding of the fourth international on web information systems engineering (WISE) (pp. 347–354).

  • Cuzzocrea, A. (2006). Combining multidimensional user models and knowledge representation and management techniques for making web services knowledge-aware. Web Intelligence and Agent Systems, 4(3), 289–312.

    Google Scholar 

  • D’Aquin, M., Sabou, M., Motta, E. (2006). Modularization, a key for the dynamic selection of relevant knowledge components. In Proc. of the ISWC 2006 workshop on modular ontologies.

  • Downey, D., Broadhead, M., Etzioni, O. (2007). Locating complex named entities in Web text. In Proceedings of the 20th international joint conference on artificial intelligence (pp. 2733–2739).

  • Ehrig, H., Ehrig, K., Prange, U., Taentzer, G. (2006). Fundamental theory for typed attributed graphs and graph transformation based on adhesive HLR categories. Journal of Fundamentals and Information, 74(1), 31–61. IOS Press.

    MathSciNet  MATH  Google Scholar 

  • Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A.-M., Shaked, T., Soderland, S., Weld, D.S., Yates, A. (2004). Web scale information extraction in KnowItAll (preliminary results). In Proceedings of the 13th International WWW Conference (pp. 100–111). New York, USA.

  • Ferreira, J. (1999). A local maxima method and a fair dispersion normalization for extracting multi-word units from corpora. World Trade, 369–381.

  • Elloumi, M., Ben-Mustapha, N., Baazaoui, H., Moreno, A., Sanchez, D. (2010). Evolutive content-based search system. In KDIR.

  • Hearst, M.A. (1998). Automated discovery of WordNet relations. Wordnet an Electronic Lexical Database (pp. 132–152). MIT Press, Cambridge, MA.

  • Geleijnse, G. & Korst, J.H.M. (2005). Automatic ontology population by googling. In Proceedings of the seventeenth Belgium-Netherlands conference on artificial intelligence (pp. 120–126).

  • Gerhard, W., Weichselbraun, A., Scharl, A., Sabou, M. (2012). Dynamic integration of multiple evidence sources for ontology learning. Journal of Information and Data Management, 3(3), 243–254.

    Google Scholar 

  • Godard, D. (2006). Compositionalit: questions linguistiques. In Godard, D., Roussarie, L., Corblin, F. (Eds.), Smanticlopdie: dictionnaire de smantique. GDR Smantique et Modli-sation, CNRS.

  • Dong, H. & Hussain, F.-K. (2012). SOF: a semi-supervised ontology-learning-based focused crawler. Journal of concurrency and computation: practice and experience. Wiley.

  • Gerhard, W., Weichselbraun, A., Scharl, A., Sabou, M. (2012). Dynamic integration of multiple evidence sources for ontology learning. Journal of Information and Data Management, 3(3), 243–254.

    Google Scholar 

  • Jarvelin, K. & Kekalainen, J. (2002). Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems, 20(4), 422–446.

    Article  Google Scholar 

  • Klein, D. & Manning, C.D. (2003). Accurate unlexicalized parsing. In Proceedings of the 41st annual meeting of the association for computational linguistics (pp. 423–430).

  • Landauer, T. & Dumais, S. (1997). A solution to platos problem: the latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104(2), 211–240.

    Article  Google Scholar 

  • Lemaire, B. & Denhire, G. (2006). Effects of high-order co-occurrences on word semantic similarities. Current Psychology Letters - Behaviour, Brain and Cognition, 18(1), 211–240.

    Google Scholar 

  • Maedche, A. & Staab, S. (2002). Measuring similarity between ontologies. In Proc. CIKM 2002, LNAI (Vol. 2473).

  • Meadow, C., Boyce, B., Kraft, D. (2000). Text information retrieval systems (2nd ed.) Academic Press.

  • Miller, G.A., Beckwith, R., Fellbaum, C.D., Gross, D., Miller, K. (1990). WordNet: an online lexical database. International Journal Lexicography, 3, 235–244.

    Article  Google Scholar 

  • Noy, N. & Musen, M. (2004). Specifying Ontology Views by Traversal, In Proc. of international semantic web conference (ISWC).

  • Sanchez, D. (2009). Domain ontology learning from the web. Knowledge Engineering Review, 24(4), 413.

    Article  Google Scholar 

  • Sanchez, D. & Moreno, A. (2008). Learning non-taxonomic relationships from web documents for domain ontology construction. DKE, 64(3), 600–623,.

    Article  Google Scholar 

  • Sanchez, D., Moreno, A., Del-Vasto-Terrientes, L. (2012). Learning relation axioms from text: an automatic Web-based approach. Expert Systems and Application, 39(5), 5792–5805.

    Article  Google Scholar 

  • Seidenberg, J. & Rector, A. (2006). Web ontology segmentation: analysis, classification and use. In Proc. of the World Wide Web Conference (WWW).

  • Stevenson, M. & Greenwood, M.A. (2006). Comparing information extraction pattern model. In Proceedings of the information extraction beyond the document workshop COLING/ACL.

  • Stuckenschmidt, H., Parent, C., Spaccapietra, S. (2009). Modular ontologies: concepts, theories and techniques for knowledge modularization. Springer, Berlin, Heidelberg,.

  • Turney, P.D. (2001). Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In Proceedings of the twelfth European conference on machine learning freiburg (pp. 491–499). Germany.

  • Wong, W., Liu, W., Bennamoun, M. (2012). Ontology learning from text: a look back and into the future. ACM Computer Survey, 44(4), 20–36.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nesrine Ben Mustapha.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ben Mustapha, N., Aufaure, MA., Baazaoui Zghal, H. et al. Query-driven approach of contextual ontology module learning using web snippets. J Intell Inf Syst 45, 61–94 (2015). https://doi.org/10.1007/s10844-013-0263-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-013-0263-6

Keywords

Navigation