Using Semantics to Personalize Access to Data-Intensive Web Sources

Starzecka, Monika; Walczak, Adam

doi:10.1007/978-3-642-03424-4_4

Monika Starzecka⁷ &
Adam Walczak⁷

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 37))

Included in the following conference series:

International Conference on Business Information Systems

624 Accesses

Abstract

Data-intensive Web sites are an important and growing source of well-structured information on the Web. Their potential value remains largely unused as they pose a number of challenges to both machine and human users. They are dispersed and provide heterogeneous, rigid, site-specific and non-personalizable query and navigation interfaces. In this paper we present outline of method for accessing data from data-intensive Web sites in an uniform way. Our method is independent of the source and allows for personalization of the access to data. We describe how domain ontology is used for definition of personalized GUI. Then initial evaluation of described method is provided.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abramowicz, W., Flejter, D., Kaczmarek, T., Starzecka, M., Walczak, A.: Semantically Enhanced Deep Web. In: INFORMATIK 2008 Beherrschbare Systeme dank Informatik, 38. Jahrestagung der Gesellschaft für Informatik, Gesellschaft für Informatik e.V (GI), München, September, 8. bis 13, pp. 673–679 (2008)
Google Scholar
Alvarez, M., Raposo, J., Pan, A., Cacheda, F., Bellas, F., Carneiro, V.: Deepbot: A focused crawler for accessing hidden web content. In: 3rd international workshop on Data engineering issues in E-commerce and services, pp. 18–25 (2007)
Google Scholar
An, Y.J., Geller, J., Wu, Y.-T., Chun, S.A.: Semantic deep web: automatic attribute extraction from the deep web data sources. In: SAC 2007: Proceedings of the 2007 ACM symposium on Applied computing, pp. 1667–1672. ACM, New York (2007)
Google Scholar
Anupam, V., Freire, J., Kumar, B., Lieuwen, D.: Automating web navigation with the webvcr. In: 9th International Conference on World Wide Web, pp. 503–517 (2000)
Google Scholar
Bergman, M.K.: The deep web: Surfacing hidden value. The Journal of Electronic Publishing 7(1) (2001)
Google Scholar
Bigham, J.P., Cavender, A.C., Kaminsky, R.S., Prince, C.M., Robinson, T.S.: Transcendence: Enabling a personal view of the deep web. In: International Conference on Intelligent User Interfaces (2008)
Google Scholar
Chang, K.C.-C., He, B., Zhang, Z.: Mining semantics for large scale integration on the web: evidences, insights, and challenges. SIGKDD Exploration Newsletter 6(2), 67–76 (2004)
Article Google Scholar
Cockburn, A., McKenzie, D.: An evaluation of cone trees. In: Proceedings of the 2000 British Computer Society Conference on Human Computer Interaction (2000)
Google Scholar
Chang, K.C.-C., He, B., Zhang, Z.: Metaquerier: querying structured web sources on-the-fly. In: 2005 ACM SIGMOD International Conference on Management of Data, pp. 927–929 (2005)
Google Scholar
Doan, A., Domingos, P., Halevy, A.Y.: Reconciling schemas of disparate data sources: A machine-learning approach. In: SIGMOD Conference (2001)
Google Scholar
Flesca, S., Gottlob, G., Baumgartner, R.: Supervised wrapper generation with lixto. In: 27th International Conference on Very Large Data Bases, pp. 715–716 (2001)
Google Scholar
Halevy, A.: Theory of answering queries using views. SIGMOD Record 29(4), 40–47 (2000)
Article Google Scholar
Handschuh, S., Staab, S., Volz, R.: On deep annotation. In: WWW 2003: Proceedings of the 12th international conference on World Wide Web, pp. 431–438. ACM, New York (2003)
Google Scholar
Katifori, A., Torou, E., Halatsis, C., Lepouras, G., Vassilakis, C.: A comparative study of four ontology visualization techniques in protege: Experiment setup and preliminary results. In: IV 2006: Proceedings of the conference on Information Visualization, Washington, DC, USA, pp. 417–423. IEEE Computer Society, Los Alamitos (2006)
Google Scholar
Kobsa, A.: User experiments with tree visualization systems. In: INFOVIS 2004: Proceedings of the IEEE Symposium on Information Visualization, Washington, DC, USA, pp. 9–16. IEEE Computer Society, Los Alamitos (2004)
Google Scholar
Teixeira, J.S., Ribeiro-Neto, B.A., Laender, A.H.F., da Silva, A.S.: A brief survey of web data extraction tools. SIGMOD Record 31(2), 84–93 (2002)
Article Google Scholar
Nagao, K.: Digital Content Annotation and Transcoding. Artech House Publishers, Norwood (2003)
Google Scholar
Ntoulas, A., Zerfos, P., Cho, J.: Downloading textual hidden web content through keyword queries. In: 5th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 100–109 (2005)
Google Scholar
Papakonstantinou, Y., Gupta, A., Haas, L.: Capabilities-based query rewriting in mediator systems. In: 4th International Conference on Parallel and Distributed Information Systems (1996)
Google Scholar
Raghavan, S., Garcia-Molina, H.: Crawling the hidden web. In: 27th International Conference on Very Large Data Bases, pp. 129–138 (2001)
Google Scholar
Rivandeneira, W., Benderson, B.B.: A study of search result clustering interfaces: Comparing textual and zoomable interfaces. Technical report, University of Maryland HCIL (2003)
Google Scholar
Starzecka, M.: Nawigacja w serwisach www na podstawie ontologicznego opisu zrode. Master’s thesis. Akademia Ekonomiczna w Poznaniu (2008)
Google Scholar
Gal, A., Modica, G., Jamil, H.: OntoBuilder: Fully Automatic Extraction and Consolidation of Ontologies from Web Sources. In: International Conference on Data Engineering. IEEE Computer Society, Los Alamitos (1996)
Google Scholar
Walny, J.: Semaform: Semantic wrapper generation for querying deep web data sources. CPSC 502 project under the supervision of Dr. Denilson Barbosa (2007), http://www.ucalgary.ca/~jkwalny/502/index.html

Download references

Author information

Authors and Affiliations

Poznan University of Economics, Al. Niepodleglosci 10, 61-875, Poznan, Poland
Monika Starzecka & Adam Walczak

Authors

Monika Starzecka
View author publications
You can also search for this author in PubMed Google Scholar
Adam Walczak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Systems, Poznań University of Economics, Al. Niepodległości 10, 61-875, Poznań, Poland
Witold Abramowicz & Dominik Flejter &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Starzecka, M., Walczak, A. (2009). Using Semantics to Personalize Access to Data-Intensive Web Sources. In: Abramowicz, W., Flejter, D. (eds) Business Information Systems Workshops. BIS 2009. Lecture Notes in Business Information Processing, vol 37. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03424-4_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-03424-4_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03423-7
Online ISBN: 978-3-642-03424-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics