Skip to main content

Using Semantics to Personalize Access to Data-Intensive Web Sources

  • Conference paper
Business Information Systems Workshops (BIS 2009)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 37))

Included in the following conference series:

  • 624 Accesses

Abstract

Data-intensive Web sites are an important and growing source of well-structured information on the Web. Their potential value remains largely unused as they pose a number of challenges to both machine and human users. They are dispersed and provide heterogeneous, rigid, site-specific and non-personalizable query and navigation interfaces. In this paper we present outline of method for accessing data from data-intensive Web sites in an uniform way. Our method is independent of the source and allows for personalization of the access to data. We describe how domain ontology is used for definition of personalized GUI. Then initial evaluation of described method is provided.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abramowicz, W., Flejter, D., Kaczmarek, T., Starzecka, M., Walczak, A.: Semantically Enhanced Deep Web. In: INFORMATIK 2008 Beherrschbare Systeme dank Informatik, 38. Jahrestagung der Gesellschaft für Informatik, Gesellschaft für Informatik e.V (GI), München, September,  8. bis 13, pp. 673–679 (2008)

    Google Scholar 

  2. Alvarez, M., Raposo, J., Pan, A., Cacheda, F., Bellas, F., Carneiro, V.: Deepbot: A focused crawler for accessing hidden web content. In: 3rd international workshop on Data engineering issues in E-commerce and services, pp. 18–25 (2007)

    Google Scholar 

  3. An, Y.J., Geller, J., Wu, Y.-T., Chun, S.A.: Semantic deep web: automatic attribute extraction from the deep web data sources. In: SAC 2007: Proceedings of the 2007 ACM symposium on Applied computing, pp. 1667–1672. ACM, New York (2007)

    Google Scholar 

  4. Anupam, V., Freire, J., Kumar, B., Lieuwen, D.: Automating web navigation with the webvcr. In: 9th International Conference on World Wide Web, pp. 503–517 (2000)

    Google Scholar 

  5. Bergman, M.K.: The deep web: Surfacing hidden value. The Journal of Electronic Publishing 7(1) (2001)

    Google Scholar 

  6. Bigham, J.P., Cavender, A.C., Kaminsky, R.S., Prince, C.M., Robinson, T.S.: Transcendence: Enabling a personal view of the deep web. In: International Conference on Intelligent User Interfaces (2008)

    Google Scholar 

  7. Chang, K.C.-C., He, B., Zhang, Z.: Mining semantics for large scale integration on the web: evidences, insights, and challenges. SIGKDD Exploration Newsletter 6(2), 67–76 (2004)

    Article  Google Scholar 

  8. Cockburn, A., McKenzie, D.: An evaluation of cone trees. In: Proceedings of the 2000 British Computer Society Conference on Human Computer Interaction (2000)

    Google Scholar 

  9. Chang, K.C.-C., He, B., Zhang, Z.: Metaquerier: querying structured web sources on-the-fly. In: 2005 ACM SIGMOD International Conference on Management of Data, pp. 927–929 (2005)

    Google Scholar 

  10. Doan, A., Domingos, P., Halevy, A.Y.: Reconciling schemas of disparate data sources: A machine-learning approach. In: SIGMOD Conference (2001)

    Google Scholar 

  11. Flesca, S., Gottlob, G., Baumgartner, R.: Supervised wrapper generation with lixto. In: 27th International Conference on Very Large Data Bases, pp. 715–716 (2001)

    Google Scholar 

  12. Halevy, A.: Theory of answering queries using views. SIGMOD Record 29(4), 40–47 (2000)

    Article  Google Scholar 

  13. Handschuh, S., Staab, S., Volz, R.: On deep annotation. In: WWW 2003: Proceedings of the 12th international conference on World Wide Web, pp. 431–438. ACM, New York (2003)

    Google Scholar 

  14. Katifori, A., Torou, E., Halatsis, C., Lepouras, G., Vassilakis, C.: A comparative study of four ontology visualization techniques in protege: Experiment setup and preliminary results. In: IV 2006: Proceedings of the conference on Information Visualization, Washington, DC, USA, pp. 417–423. IEEE Computer Society, Los Alamitos (2006)

    Google Scholar 

  15. Kobsa, A.: User experiments with tree visualization systems. In: INFOVIS 2004: Proceedings of the IEEE Symposium on Information Visualization, Washington, DC, USA, pp. 9–16. IEEE Computer Society, Los Alamitos (2004)

    Google Scholar 

  16. Teixeira, J.S., Ribeiro-Neto, B.A., Laender, A.H.F., da Silva, A.S.: A brief survey of web data extraction tools. SIGMOD Record 31(2), 84–93 (2002)

    Article  Google Scholar 

  17. Nagao, K.: Digital Content Annotation and Transcoding. Artech House Publishers, Norwood (2003)

    Google Scholar 

  18. Ntoulas, A., Zerfos, P., Cho, J.: Downloading textual hidden web content through keyword queries. In: 5th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 100–109 (2005)

    Google Scholar 

  19. Papakonstantinou, Y., Gupta, A., Haas, L.: Capabilities-based query rewriting in mediator systems. In: 4th International Conference on Parallel and Distributed Information Systems (1996)

    Google Scholar 

  20. Raghavan, S., Garcia-Molina, H.: Crawling the hidden web. In: 27th International Conference on Very Large Data Bases, pp. 129–138 (2001)

    Google Scholar 

  21. Rivandeneira, W., Benderson, B.B.: A study of search result clustering interfaces: Comparing textual and zoomable interfaces. Technical report, University of Maryland HCIL (2003)

    Google Scholar 

  22. Starzecka, M.: Nawigacja w serwisach www na podstawie ontologicznego opisu zrode. Master’s thesis. Akademia Ekonomiczna w Poznaniu (2008)

    Google Scholar 

  23. Gal, A., Modica, G., Jamil, H.: OntoBuilder: Fully Automatic Extraction and Consolidation of Ontologies from Web Sources. In: International Conference on Data Engineering. IEEE Computer Society, Los Alamitos (1996)

    Google Scholar 

  24. Walny, J.: Semaform: Semantic wrapper generation for querying deep web data sources. CPSC 502 project under the supervision of Dr. Denilson Barbosa (2007), http://www.ucalgary.ca/~jkwalny/502/index.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Starzecka, M., Walczak, A. (2009). Using Semantics to Personalize Access to Data-Intensive Web Sources. In: Abramowicz, W., Flejter, D. (eds) Business Information Systems Workshops. BIS 2009. Lecture Notes in Business Information Processing, vol 37. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03424-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03424-4_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03423-7

  • Online ISBN: 978-3-642-03424-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics