Abstract
The World-Wide Web presents new challenges to database researchers, especially in the area of query processing. Currently, querying the World-Wide Web is done by using Online Indices. These sites employ search engines, known as “robots”, that can scan the network periodically and form text based indices. A severe limitation of these search services is that the structural information, namely the organization of documents into parts pointing to each other, is lost. Several tasks, ranging from data mining to Intranet management, require the analysis of the hypertext structural organization.
In this paper, we propose s simple graph based query language. In this language, both the query and its target are graphs. We present and evaluate the efficiency of a general class of algorithms for answering graph queries. The algorithms’ definition take into account two important facts of the WWW: (1) efficient algorithms must minimize the communication needed to answer a query and (2) query evaluation involves a process of data graph exploration.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The query language for semistructured data. Journal on Digital Libraries, 1(1):68–88, 1996.
S. Abiteboul and V. Vianu. Queries and computation on the web. In ICDt, 1997.
C. Beeri and Y. Kornatzky. A logical query language for hypertext systems. In Proceedings of the European Conference on Hypertext, pages 67–80, 1990.
T. Bray, J. Paoli, and C. M. Sperberg-McQueenSpernber-McQueen. Extensible markup language (xml). W3C Recommendation, http//www.w3.org/TR/WD-xml.
M. P. Conses and A. O. Mendelzon. Expressing structural hypertext queries in graphlog. In Proc. Hypertext'89, 1989.
M. P. Conses and A. O. Mendelzon. Grapholog: a visual formation for real life recursion. In Proc PODS, pages 404–416, 1990.
R. Fielding, J. Getty, J. Mogul, H. Frystyk, and T. Berners-Lee. Rfc 2068: Hypertext transfer protocol http/1.1, January 1997.
M. Jarke and J. Koch. Query optimization in database systems. Computing Surveys, 16(2), June 1994.
Y. Kifer, W. Kim, and Y. Sagiv. Querying object-oriented databases. In Proc. SIGMOD, pages 393–402, 1992.
Y. Kogan, D. Michaeli, Y. Sagiv, and O. Shmueli. Utilizing the multiple facets of www content. In Proc. NGITS, 1997.
D. Konopnicki and O. Shmueli. Information gathering in the world-wide web: The w3ql query language and the w3qs system. TODS, to appear.
Laks V. S. Lakshmanan, Fereidoon Sadri, and Iyer N. Subramania, A declarative language for qeurying and restructing the web. In Sixth International Workshop on Research Issues in Data Engineering-Interoperability of Nontraditional Database Systems, 1996.
A. O. Mendelson and P. T. Wood. Finding regular simple paths in graph databases. SIAM J. Comp., 24(6), 1995.
G.A. Mihaila, A. O. Mendelson and T. Milo. Querying the world-wide web. In Proc. PDIS96, pages 80–91, 1996.
T. Minohola and R. Wanatabe. Queries on structure in hypertext. In Proc. FODO, pages 394–411, 1993.
P. G. Selinger, M. M. Astrahan, D. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path selection in a relation database management system. In Proc. SIGMOD, pages 23–34, 1979.
J. D. Ullman. Data and knowledge-Base Systems-Volume II. Computer Science Press, 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Konopnicki, D., Shmueli, O. (1999). WWW Exploration Queries. In: Pinter, R.Y., Tsur, S. (eds) Next Generation Information Technologies and Systems. NGITS 1999. Lecture Notes in Computer Science, vol 1649. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48521-X_3
Download citation
DOI: https://doi.org/10.1007/3-540-48521-X_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66225-9
Online ISBN: 978-3-540-48521-6
eBook Packages: Springer Book Archive