Abstract
QUEST is a system for Querying Semantically Tagged documents on the World-Wide Web. The advent of new markup languages, such as xml, facilitates authoring of Web documents that contain not just html tags for instructing a browser how to view a document, but also contain objects that represent the semantic structure of the document. When such documents become widely available, more powerful methods to access and query information on the Web will be possible. The QUEST system was designed and implemented for querying and manipulating documents written in the markup language ohtml. ohtml combines html and objects of the oem data model. QUEST has several new features. First, QUEST can be used to query a combination of hypertext and object structures. Second, The results of queries are ohtml pages and thus of the same type as the data being queried. Third, QUEST implements a new approach for querying semistructured data that produces meaningful answers even when the input data is incomplete, i.e., when some variables of the query cannot be bound to database values. Finally, the experience of developing and using QUEST for querying semantic documents on the Web can be useful for the design and implementation of query languages for xml. This paper provides an overview of the QUEST system and its components.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Abiteboul Querying semi-structured data. In International Conference on Database Theory, volume 1186 of Lecture Notes in Computer Science, pages 1–18, Delphi (Greece), January 1997. Springer-Verlag.
G.O. Arocena and A.O. Mendelzon WebOQL: Restructuring documents, databases, and webs. In Proc. 14th International Conference on Data Engineering, pages 24–33, Orlando (Florida, USA), February 1998. IEEE Computer Society.
P. Atzeni, G. Mecca, and P. Merialdo To weave the web. In Proc. 23nd International Conference on Very Large Data Bases, pages 206–215, Athens (Greece), August 1997. Morgan Kaufmann Publishers.
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J.L. Wiener The Lorel query language for semistructured data. International Journal on Digital Libraries, 1(1):68–88, 1997.
P. Buneman, S.B. Davidson, G.G. Hillebrand, and D. Suciu A query language and optimization techniques for unstructured data. In Proc. 1996 ACM SIGMOD International Conference on Management of Data, pages 505–516, Montreal (Canada), June 1996.
P. Buneman Semistructured data. In Proc. 16th Symposium on Principles of Database Systems, pages 117–121, Tucson (Arizona, USA), May 1997. ACM Press.
World Wide Web Consortium. Extensible markup language (XML) 1.0. http://www.w3.org/TR/REC-xml, 1998.
A. Deutsch, M. Fernandez, D. Florescu, A. Levy, and D. Suciu. Applications of XML-QL, a query language for XML. http://www.w3.org/TR/NOTE-xml-ql, 1998.
M.F. Fernandez, D. Florescu, J. Kang, A.Y. Levy, and D. Suciu. Catching the boat with Strudel: Experiences with a web-site management system. In Proc. 1998 ACM SIGMOD International Conference on Management of Data, pages 414–425, Seattle (Washington, USA), June 1998. ACM Press.
C.A. Galindo-Legaria. Outerjoins as disjunctions. In Proc. 1994 ACM SIGMOD International Conference on Management of Data, pages 348–358, Minneapolis (Minnesota, USA), May 1994. ACM Press.
GLOBES. http://www.globes.co.il.
Y. Kogan, D. Michaeli, Y. Sagiv, and O. Shmueli. Utilizing the multiple facets of WWW contents. Data and Knowledge Engineering, 28(3):255–275, 1998.
Y. Kanza, W. Nutt, and Y. Sagiv. Queries with incomplete answers over semistructured data. In “Proc. 18th Symposium on Principles of Database Systems”, “Philadelphia (Pennsylvania, USA) ”, may 1999. ACM Press.
D. Konopnicki and O. Shmueli. W3QS: A query system for the world-wide web. In Proc. 21st International Conference on Very Large Data Bases, pages 54–65. Morgan Kaufmann Publishers, August 1995.
D. Konopnicki and O. Shmueli. W3QS–A system for WWW querying. In Proc. 13th International Conference on Data Engineering, page 586, Binghamton (United Kingdom), April 1997. IEEE Computer Society.
L.V.S. Lakshmanan, F. Sadri, and I.N. Subramanian. A declarative language for querying and restructuring the web. In Proc. 6th International Workshop on Research Issues on Data Engineering-Interoperability of Nontraditional Database Systems, pages 12–21, New Orleans (Louisiana, USA), February 1996. IEEE Computer Society.
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. SIGMOD Record, 3(26):54–66, 1997.
G. Mecca, P. Atzeni, A. Masci, P. Merialdo, and G. Sindoni. The Araneus web-base management system. In Proc. 1998 ACM SIGMOD International Conference on Management of Data, pages 544–546, Seattle (Washington, USA), June 1998. ACM Press.
A.O. Mendelzon and T. Milo. Formal models of web queries. In Proc. 16th Symposium on Principles of Database Systems, pages 134–143, Tucson (Arizona, USA), May 1997. ACM Press.
A.O. Mendelzon, G.A. Mihaila, and T. Milo. Querying the world wide web. International Journal on Digital Libraries, 1(1):54–67, 1997.
Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object exchange across heterogeneous information sources. In P.S. Yu and A.L.P. Chen, editors, Proc. 11th International Conference on Data Engineering, pages 251–260, Taipei, March 1995. IEEE Computer Society.
D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, and J. Widom. Querying semistructured heterogeneous information. In Proc. 4th International Conference on Deductive and Object-Oriented Databases, volume 1013 of Lecture Notes in Computer Science, pages 319–344, Singapore, December 1995. Springer-Verlag.
D. Quass, J. Widom, R. Goldman, K. Haas, Q. Luo, J. McHugh, S. Nestorov, A. Rajaraman, H. Rivero, S. Abiteboul, J.D. Ullman, and J.L. Wiener. Lore: A lightweight object repository for semistructured data. In Proc. 1996 ACM SIGMOD International Conference on Management of Data, page 549, Montreal (Canada), June 1996.
A. Rajaraman and J.D. Ullman. Integrating information by outerjoins and full disjunctions. In Proc. 15th Symposium on Principles of Database Systems, pages 238–248, Montreal (Canada), June 1996. ACM Press.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bar-Yossef, Z., Kanza, Y., Kogan, Y., Sagiv, Y., Nutt, W. (1999). Querying Semantically Tagged Documents on the World-Wide Web. In: Pinter, R.Y., Tsur, S. (eds) Next Generation Information Technologies and Systems. NGITS 1999. Lecture Notes in Computer Science, vol 1649. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48521-X_2
Download citation
DOI: https://doi.org/10.1007/3-540-48521-X_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66225-9
Online ISBN: 978-3-540-48521-6
eBook Packages: Springer Book Archive