Skip to main content

Querying Semantically Tagged Documents on the World-Wide Web

  • Conference paper
  • First Online:
Next Generation Information Technologies and Systems (NGITS 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1649))

Abstract

QUEST is a system for Querying Semantically Tagged documents on the World-Wide Web. The advent of new markup languages, such as xml, facilitates authoring of Web documents that contain not just html tags for instructing a browser how to view a document, but also contain objects that represent the semantic structure of the document. When such documents become widely available, more powerful methods to access and query information on the Web will be possible. The QUEST system was designed and implemented for querying and manipulating documents written in the markup language ohtml. ohtml combines html and objects of the oem data model. QUEST has several new features. First, QUEST can be used to query a combination of hypertext and object structures. Second, The results of queries are ohtml pages and thus of the same type as the data being queried. Third, QUEST implements a new approach for querying semistructured data that produces meaningful answers even when the input data is incomplete, i.e., when some variables of the query cannot be bound to database values. Finally, the experience of developing and using QUEST for querying semantic documents on the Web can be useful for the design and implementation of query languages for xml. This paper provides an overview of the QUEST system and its components.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Abiteboul Querying semi-structured data. In International Conference on Database Theory, volume 1186 of Lecture Notes in Computer Science, pages 1–18, Delphi (Greece), January 1997. Springer-Verlag.

    Google Scholar 

  2. G.O. Arocena and A.O. Mendelzon WebOQL: Restructuring documents, databases, and webs. In Proc. 14th International Conference on Data Engineering, pages 24–33, Orlando (Florida, USA), February 1998. IEEE Computer Society.

    Google Scholar 

  3. P. Atzeni, G. Mecca, and P. Merialdo To weave the web. In Proc. 23nd International Conference on Very Large Data Bases, pages 206–215, Athens (Greece), August 1997. Morgan Kaufmann Publishers.

    Google Scholar 

  4. S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J.L. Wiener The Lorel query language for semistructured data. International Journal on Digital Libraries, 1(1):68–88, 1997.

    Article  Google Scholar 

  5. P. Buneman, S.B. Davidson, G.G. Hillebrand, and D. Suciu A query language and optimization techniques for unstructured data. In Proc. 1996 ACM SIGMOD International Conference on Management of Data, pages 505–516, Montreal (Canada), June 1996.

    Google Scholar 

  6. P. Buneman Semistructured data. In Proc. 16th Symposium on Principles of Database Systems, pages 117–121, Tucson (Arizona, USA), May 1997. ACM Press.

    Google Scholar 

  7. World Wide Web Consortium. Extensible markup language (XML) 1.0. http://www.w3.org/TR/REC-xml, 1998.

  8. A. Deutsch, M. Fernandez, D. Florescu, A. Levy, and D. Suciu. Applications of XML-QL, a query language for XML. http://www.w3.org/TR/NOTE-xml-ql, 1998.

  9. M.F. Fernandez, D. Florescu, J. Kang, A.Y. Levy, and D. Suciu. Catching the boat with Strudel: Experiences with a web-site management system. In Proc. 1998 ACM SIGMOD International Conference on Management of Data, pages 414–425, Seattle (Washington, USA), June 1998. ACM Press.

    Google Scholar 

  10. C.A. Galindo-Legaria. Outerjoins as disjunctions. In Proc. 1994 ACM SIGMOD International Conference on Management of Data, pages 348–358, Minneapolis (Minnesota, USA), May 1994. ACM Press.

    Google Scholar 

  11. GLOBES. http://www.globes.co.il.

  12. Y. Kogan, D. Michaeli, Y. Sagiv, and O. Shmueli. Utilizing the multiple facets of WWW contents. Data and Knowledge Engineering, 28(3):255–275, 1998.

    Article  MATH  Google Scholar 

  13. Y. Kanza, W. Nutt, and Y. Sagiv. Queries with incomplete answers over semistructured data. In “Proc. 18th Symposium on Principles of Database Systems”, “Philadelphia (Pennsylvania, USA) ”, may 1999. ACM Press.

    Google Scholar 

  14. D. Konopnicki and O. Shmueli. W3QS: A query system for the world-wide web. In Proc. 21st International Conference on Very Large Data Bases, pages 54–65. Morgan Kaufmann Publishers, August 1995.

    Google Scholar 

  15. D. Konopnicki and O. Shmueli. W3QS–A system for WWW querying. In Proc. 13th International Conference on Data Engineering, page 586, Binghamton (United Kingdom), April 1997. IEEE Computer Society.

    Google Scholar 

  16. L.V.S. Lakshmanan, F. Sadri, and I.N. Subramanian. A declarative language for querying and restructuring the web. In Proc. 6th International Workshop on Research Issues on Data Engineering-Interoperability of Nontraditional Database Systems, pages 12–21, New Orleans (Louisiana, USA), February 1996. IEEE Computer Society.

    Google Scholar 

  17. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. SIGMOD Record, 3(26):54–66, 1997.

    Article  Google Scholar 

  18. G. Mecca, P. Atzeni, A. Masci, P. Merialdo, and G. Sindoni. The Araneus web-base management system. In Proc. 1998 ACM SIGMOD International Conference on Management of Data, pages 544–546, Seattle (Washington, USA), June 1998. ACM Press.

    Google Scholar 

  19. A.O. Mendelzon and T. Milo. Formal models of web queries. In Proc. 16th Symposium on Principles of Database Systems, pages 134–143, Tucson (Arizona, USA), May 1997. ACM Press.

    Google Scholar 

  20. A.O. Mendelzon, G.A. Mihaila, and T. Milo. Querying the world wide web. International Journal on Digital Libraries, 1(1):54–67, 1997.

    Google Scholar 

  21. Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object exchange across heterogeneous information sources. In P.S. Yu and A.L.P. Chen, editors, Proc. 11th International Conference on Data Engineering, pages 251–260, Taipei, March 1995. IEEE Computer Society.

    Google Scholar 

  22. D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, and J. Widom. Querying semistructured heterogeneous information. In Proc. 4th International Conference on Deductive and Object-Oriented Databases, volume 1013 of Lecture Notes in Computer Science, pages 319–344, Singapore, December 1995. Springer-Verlag.

    Google Scholar 

  23. D. Quass, J. Widom, R. Goldman, K. Haas, Q. Luo, J. McHugh, S. Nestorov, A. Rajaraman, H. Rivero, S. Abiteboul, J.D. Ullman, and J.L. Wiener. Lore: A lightweight object repository for semistructured data. In Proc. 1996 ACM SIGMOD International Conference on Management of Data, page 549, Montreal (Canada), June 1996.

    Google Scholar 

  24. A. Rajaraman and J.D. Ullman. Integrating information by outerjoins and full disjunctions. In Proc. 15th Symposium on Principles of Database Systems, pages 238–248, Montreal (Canada), June 1996. ACM Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bar-Yossef, Z., Kanza, Y., Kogan, Y., Sagiv, Y., Nutt, W. (1999). Querying Semantically Tagged Documents on the World-Wide Web. In: Pinter, R.Y., Tsur, S. (eds) Next Generation Information Technologies and Systems. NGITS 1999. Lecture Notes in Computer Science, vol 1649. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48521-X_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-48521-X_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66225-9

  • Online ISBN: 978-3-540-48521-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics