Abstract
The main objective of this work is to automatically build ontology modules that cover search terms of users in ontology-based question answering on the Web. Indeed, some arising approaches of ontology module extraction aim at solving the problem of identifying ontology fragment candidates that are relevant for the application. The main problem is that these approaches consider only the input of predefined ontologies, instead of the underlying semantics represented in texts. This work proposes an approach of contextual ontology module learning covering particular search terms by analyzing past user queries and by searching for web snippets provided by the traditional search engines. The obtained contextual modules will be used for query reformulation. The proposal has been evaluated on the ground of two criteria: the semantic cotopy measure of discovered ontology modules and the precision measure of the search results obtained by using the resulted ontology modules for query reformulation. The experiments have been carried out according to two case studies: an open domain web search and the medical digital library “PubMed”.




















Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Alfonseca, E. & Manandhar, S. (2002). An unsupervised method for general named entity recognition and automated concept discovery. In Proc. first international conference on genaral WordNet. India.
Abdollahzadeh, A. & Barforush, A.R. (2012). Ontology learning: revised. Journal of Web Engineering, 11(4), 269–289.
Ben-Mustapha, N., Baazaoui-Zghal, H., Marie-Aude, A., Ben-Ghzala, H. (2009). Survey on ontology learning from web and open issues. In Third international symposium on innovation in information and communication technology, (ISIICT’ 2009). Amman Jordan.
Ben-Mustapha, N., Aufaure, M.-A., Baazaoui-Zghal, H., Ben-Ghzala, H. (2011). Contextual ontology module learning from web snippets and past user queries. In Proceedings of the 15th international conference on knowledge-based and intelligent information and engineering systems, KES’11 (Vol. part II, pp. 538–547).
Ben-Mustapha, N., Aufaure, M.-A., Baazaoui-Zghal, H., Ben-Ghzala, H. (2012). Modular ontological warehouse for adaptative information search. In Proceedings of 2nd international conference on model and data engineering (MEDI’2012).
Cimiano, P. (2006). Ontology learning and population from text—algorithms, evaluation and applications. Springer.
Cuzzocrea, A. & Mastroianni, C. (2003). A reference architecture for knowledge management-based. In Proceeding of the fourth international on web information systems engineering (WISE) (pp. 347–354).
Cuzzocrea, A. (2006). Combining multidimensional user models and knowledge representation and management techniques for making web services knowledge-aware. Web Intelligence and Agent Systems, 4(3), 289–312.
D’Aquin, M., Sabou, M., Motta, E. (2006). Modularization, a key for the dynamic selection of relevant knowledge components. In Proc. of the ISWC 2006 workshop on modular ontologies.
Downey, D., Broadhead, M., Etzioni, O. (2007). Locating complex named entities in Web text. In Proceedings of the 20th international joint conference on artificial intelligence (pp. 2733–2739).
Ehrig, H., Ehrig, K., Prange, U., Taentzer, G. (2006). Fundamental theory for typed attributed graphs and graph transformation based on adhesive HLR categories. Journal of Fundamentals and Information, 74(1), 31–61. IOS Press.
Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A.-M., Shaked, T., Soderland, S., Weld, D.S., Yates, A. (2004). Web scale information extraction in KnowItAll (preliminary results). In Proceedings of the 13th International WWW Conference (pp. 100–111). New York, USA.
Ferreira, J. (1999). A local maxima method and a fair dispersion normalization for extracting multi-word units from corpora. World Trade, 369–381.
Elloumi, M., Ben-Mustapha, N., Baazaoui, H., Moreno, A., Sanchez, D. (2010). Evolutive content-based search system. In KDIR.
Hearst, M.A. (1998). Automated discovery of WordNet relations. Wordnet an Electronic Lexical Database (pp. 132–152). MIT Press, Cambridge, MA.
Geleijnse, G. & Korst, J.H.M. (2005). Automatic ontology population by googling. In Proceedings of the seventeenth Belgium-Netherlands conference on artificial intelligence (pp. 120–126).
Gerhard, W., Weichselbraun, A., Scharl, A., Sabou, M. (2012). Dynamic integration of multiple evidence sources for ontology learning. Journal of Information and Data Management, 3(3), 243–254.
Godard, D. (2006). Compositionalit: questions linguistiques. In Godard, D., Roussarie, L., Corblin, F. (Eds.), Smanticlopdie: dictionnaire de smantique. GDR Smantique et Modli-sation, CNRS.
Dong, H. & Hussain, F.-K. (2012). SOF: a semi-supervised ontology-learning-based focused crawler. Journal of concurrency and computation: practice and experience. Wiley.
Gerhard, W., Weichselbraun, A., Scharl, A., Sabou, M. (2012). Dynamic integration of multiple evidence sources for ontology learning. Journal of Information and Data Management, 3(3), 243–254.
Jarvelin, K. & Kekalainen, J. (2002). Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems, 20(4), 422–446.
Klein, D. & Manning, C.D. (2003). Accurate unlexicalized parsing. In Proceedings of the 41st annual meeting of the association for computational linguistics (pp. 423–430).
Landauer, T. & Dumais, S. (1997). A solution to platos problem: the latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104(2), 211–240.
Lemaire, B. & Denhire, G. (2006). Effects of high-order co-occurrences on word semantic similarities. Current Psychology Letters - Behaviour, Brain and Cognition, 18(1), 211–240.
Maedche, A. & Staab, S. (2002). Measuring similarity between ontologies. In Proc. CIKM 2002, LNAI (Vol. 2473).
Meadow, C., Boyce, B., Kraft, D. (2000). Text information retrieval systems (2nd ed.) Academic Press.
Miller, G.A., Beckwith, R., Fellbaum, C.D., Gross, D., Miller, K. (1990). WordNet: an online lexical database. International Journal Lexicography, 3, 235–244.
Noy, N. & Musen, M. (2004). Specifying Ontology Views by Traversal, In Proc. of international semantic web conference (ISWC).
Sanchez, D. (2009). Domain ontology learning from the web. Knowledge Engineering Review, 24(4), 413.
Sanchez, D. & Moreno, A. (2008). Learning non-taxonomic relationships from web documents for domain ontology construction. DKE, 64(3), 600–623,.
Sanchez, D., Moreno, A., Del-Vasto-Terrientes, L. (2012). Learning relation axioms from text: an automatic Web-based approach. Expert Systems and Application, 39(5), 5792–5805.
Seidenberg, J. & Rector, A. (2006). Web ontology segmentation: analysis, classification and use. In Proc. of the World Wide Web Conference (WWW).
Stevenson, M. & Greenwood, M.A. (2006). Comparing information extraction pattern model. In Proceedings of the information extraction beyond the document workshop COLING/ACL.
Stuckenschmidt, H., Parent, C., Spaccapietra, S. (2009). Modular ontologies: concepts, theories and techniques for knowledge modularization. Springer, Berlin, Heidelberg,.
Turney, P.D. (2001). Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In Proceedings of the twelfth European conference on machine learning freiburg (pp. 491–499). Germany.
Wong, W., Liu, W., Bennamoun, M. (2012). Ontology learning from text: a look back and into the future. ACM Computer Survey, 44(4), 20–36.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ben Mustapha, N., Aufaure, MA., Baazaoui Zghal, H. et al. Query-driven approach of contextual ontology module learning using web snippets. J Intell Inf Syst 45, 61–94 (2015). https://doi.org/10.1007/s10844-013-0263-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-013-0263-6