Skip to main content

Enriching Ontology for Deep Web Search

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5181))

Abstract

This paper addresses the problems of extracting instances from the Deep Web, enriching a domain specific ontology with those instances, and using this ontology to improve Web search. Extending an existing ontology with a large number of instances extracted from the Deep Web is an important process for making the ontology more usable for indexing of Deep Web sites. We demonstrate how instances extracted from the Deep Web are used to enhance a domain ontology. We show the contribution of the enriched ontology to Web search effectiveness. This is done by comparing the number of relevant Web sites returned by a search engine with a user’s search terms only, with the Web sites found when using additional ontology-based search terms. Experiments suggest that the ontology plus instances approach results in more relevant Web sites among the first 100 hits.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Singh, M.P.: Deep Web structure. IEEE Internet Computing 6(5), 4-5 (September 2002)

    Google Scholar 

  2. He, B., Patel, M., Zhang, Z., Chang, K.C.-C.: Accessing the Deep Web: A Survey. Communications of the ACM (CACM) 50(5), 94–101 (2007)

    Google Scholar 

  3. Ntoulas, A., Zerfos, P., Cho, J.: Downloading Textual Hidden Web Content Through Keyword Queries. In: Proc. of the ACM/IEEE Joint Conf. on Digital Libraries (JCDL), pp. 100–109 (2005)

    Google Scholar 

  4. Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing. International Journal of Human-Computer Studies 43(5), 907–928 (1995)

    Google Scholar 

  5. Davulcu, H., Vadrevu, S., Nagarajan, S., Ramakrishnan, I.V.: OntoMiner: bootstrapping and populating ontologies from domain-specific Web sites. IEEE Intelligent Systems 18(5), 24–33 (2003)

    Google Scholar 

  6. McDowell, L.K., Cafarella, M.: Ontology-driven Information Extraction with OntoSyphon. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 428–444. Springer, Heidelberg (2006)

    Google Scholar 

  7. An, Y.J., Geller, J., Wu, Y.-T., Chun, S.A.: Semantic Deep Web: Automatic Attribute Extraction from the Deep Web Data Sources. In: Proc. of the 22nd Annual ACM Symposium on Applied Computing (SAC 2007), Seoul, Korea (March 2007)

    Google Scholar 

  8. Dou, D., McDermott, D.V., Qi, P.: Ontology Translation on the Semantic Web. Journal on Data Semantics 2, 35–57 (2005)

    Google Scholar 

  9. Wu, W., Doan, A., Yu, C.T., Meng, W.: Bootstrapping Domain Ontology for Semantic Web Services from Source Web Sites. In: Bussler, C.J., Shan, M.-C. (eds.) TES 2005. LNCS, vol. 3811, pp. 11–22. Springer, Heidelberg (2006)

    Google Scholar 

  10. An, Y.J., Geller, J., Wu, Y.-T., Chun, S.A.: Automatic Generation of Ontology from the Deep Web. In: Proceedings of 6th International Workshop on Web Semantics (WebS 2007), September 3-7, 2007 (in Press, 2007)

    Google Scholar 

  11. Stanford Medical Informatics, Protégé 3.2.5 [Computer program API] (Retrieved, May 2007), http://protege.stanford.edu/doc/pdk/api/index.html

  12. Stanford Medical Informatics, Protégé - OWL 3.2.1 [Computer program API] (Retrieved, May 2007), http://protege.stanford.edu/download/release-javadoc-owl/

  13. Jansen, B.J., Spink, A., Saracevic, T.: Real Life, Real Users, and Real Needs: A Study and Analysis of User Queries on the Web. Information Processing & Management 36(2), 207–227 (2000)

    Google Scholar 

  14. Liddle, S., Embley, D., Scott, D., Yau, S.: Extracting Data Behind Web. In: Proceedings of the Joint Workshop on Conceptual Modeling Approaches for E-business: A Web Service Perspective (eCOMO 2002), pp. 38–49 (October 2002)

    Google Scholar 

  15. Omelayenko, B.: Learning of ontologies for the Web: the analysis of existent approaches. In: Proceedings of the International Workshop on Web Dynamics (Retrieved, 2001), http://www.dcs.bbk.ac.uk/webDyn/webDynPapers/omelayenko.pdf

  16. Weber, N., Buitelaar, P.: Web-based Ontology Learning with ISOLDE. In: Proceedings of ISWC 2006 Workshop on Web Content Mining with Human Language Technologies (2006), http://www2.dfki.de/~paulb/

  17. Faatz, A., Steinmetz, R.: Ontology Enrichment Evaluation. In: Motta, E., Shadbolt, N.R., Stutt, A., Gibbins, N. (eds.) EKAW 2004. LNCS (LNAI), vol. 3257, pp. 497–498. Springer, Heidelberg (2004)

    Google Scholar 

  18. Stanford Center for Biomedical Informatics Research, Protégé 3.3.1 [Computer Program] (Retrieved, 2007), http://protege.stanford.edu/download/registered.html

  19. Raghavan, S., Garcia-Molina, H.: Crawling the Hidden Web. In: Proceedings of the 27th International Conference on Very Large Data Bases, pp. 29–138 (2001)

    Google Scholar 

  20. Liddle, S., Embley, D., Scott, D., Yau, S.: Extracting Data Behind Web Forms. In: Proceedings of the Joint Workshop on Conceptual Modeling Approaches for E-business: A Web Service Perspective (eCOMO 2002), pp. 38–49 (2002)

    Google Scholar 

  21. Ramachandran, R., Movva, S., Graves, S., Tanner, S.: Ontology-based Semantic Search Tool for Atmospheric Science. In: Proceedings of 22nd International Conference on Interactive Information Processing Systems for Meteorology, Oceanography, and Hydrology, http://ams.confex.com/ams/Annual2006/

  22. An, Y.J.: Ontology Learning for the Semantic Deep Web, Ph.D. Dissertation in Computer Science, New Jersey Institute of Technology (January 2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Sourav S. Bhowmick Josef Küng Roland Wagner

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

An, Y.J., Chun, S.A., Huang, Kc., Geller, J. (2008). Enriching Ontology for Deep Web Search. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2008. Lecture Notes in Computer Science, vol 5181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85654-2_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85654-2_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85653-5

  • Online ISBN: 978-3-540-85654-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics