Skip to main content

Representing and Querying Summarized XML Data

  • Conference paper
Database and Expert Systems Applications (DEXA 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2736))

Included in the following conference series:

Abstract

In the last few years several repositories for storing XML documents and languages for querying XML data have been studied and implemented. All the query languages proposed so far allow to obtain exact answers, but when applied to large XML repositories or warehouses, such precise queries may require high response times. To overcome this problem, in traditional relational warehouses fast approximate queries are supported, built on concise data statistics based on histograms or sampling techniques. In this paper we propose a novel approach to summarize an XML document collection taking into account the hierarchical structure of XML documents, which makes the summarization process substantially more difficult than in case of flat, relational data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bamboat, Q.A., Dunemann, O.: Obtaining quick results for approximate answers. In: Proc. of SCI 2001 and ISAS 2001, Orlando, Florida (2001)

    Google Scholar 

  2. Comai, S., Marrara, S., Tanca, L.: Representing and querying summarized xml data. Technical report, Politecnico di Milano (2003)

    Google Scholar 

  3. W3C Consortium. Xml 1.0 (February 1998), http://www.w3.org/XML

  4. http://www.xyleme.com

  5. http://www.excelon.com

  6. Gibbons, P.B., Matias, Y.: Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A (1999)

    Google Scholar 

  7. Widom, J., Goldman, R., McHugh, J.: From semistructured data to xml: Migrating the lore data model and query language. In: Proc. WebDb, pp. 25–30 (1999)

    Google Scholar 

  8. Jagadish, H., Al-Khalifa, S., Lakshmanan, L., Nierman, A., Paparizos, S., Patel, J., Srivastava, D., Wu, Y.: Timber: A native xml database (2002)

    Google Scholar 

  9. Polyzotis, N., Garofalakis, M.: Statistical synopses for graph-structured xml databases. In: ACM (ed.) Proc. ACM SIGMOD Conference, Madison, Wisconsin, USA (2002)

    Google Scholar 

  10. Ql 1998 query languages (1998), http://www.w3.org/TandS/QL/QL98

  11. Chamberlin, D., Florescu, D., Robie, J., Simeon, J., Stefanescu, M.: Xquery: A query language for xml (2001), http://www.w3.org/TR/xquery/

  12. (1998), http://www.tamino.com

  13. Virtuoso, http://www.openlinksw.com/virtuoso

  14. W3C. Xml path language (xpath) version 1.0 (1999), http://www.w3.org/TR/xpath

  15. http://db.uwaterloo.ca/ddbms/projects/xbench/index.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Comai, S., Marrara, S., Tanca, L. (2003). Representing and Querying Summarized XML Data. In: Mařík, V., Retschitzegger, W., Štěpánková, O. (eds) Database and Expert Systems Applications. DEXA 2003. Lecture Notes in Computer Science, vol 2736. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45227-0_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45227-0_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40806-2

  • Online ISBN: 978-3-540-45227-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics