Skip to main content

A Query-Friendly Compression for GML Documents

  • Conference paper
Database Systems for Adanced Applications (DASFAA 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6637))

Included in the following conference series:

Abstract

Geography Markup Language (GML) has become a standard encoding format for exchanging geographic data among heterogeneous Geographic Information System (GIS) applications. Whereas, the iteration of document structure and the textual expression of geographic data often cause the huge size of GML documents. In this paper, a query-friendly GML compression method is proposed, where the GML documents in SAX document parsing are transformed to a compact representation encompassing an event dictionary, the events hierarchy in balanced parentheses, a binary event wavelet tree and the document content blocks before compressed using a general compression utility. The proposed compression method supports direct path queries and spatial queries over the compressed files without the requirement of a full decompression. The compression model, the query resolution process and the compression algorithm are detailed in this paper, though the presentation is a preliminary investigation and it remains to carry out experiments to validate the proposed compression method on real GML documents.

This work was supported by the National Natural Science Foundation of China (NSFC) under grant No. 60873040 and China 863 Program under grant No. 2009AA01Z135.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Geospatial information – Geography Markup Language (GML). ISO 19136:2007 (2007)

    Google Scholar 

  2. Huffman, D.A.: A Method for the Construction of Minimum-Redundancy Codes. Proceedings of the IRE 40(9), 1098–1101 (1952)

    Article  MATH  Google Scholar 

  3. Ziv, J., Lempel, A.: A universal algorithm for sequential data compression. IEEE Transactions on Information Theory IT-23, 337–343 (1977)

    Article  MathSciNet  MATH  Google Scholar 

  4. Witten, I.H., Neal, R.M., Cleary, J.G.: Arithmetic coding for data compression. Communications of the ACM 30(6), 520–540 (1987)

    Article  Google Scholar 

  5. Cleary, J.G., Witten, I.H.: Data compression using adaptive coding and partial string matching. IEEE Transactions on Communications 32(4), 396–402 (1984)

    Article  Google Scholar 

  6. Burrows, M., Wheeler, D.J.: A block-sorting lossless data compression algorithm. Technical report SRC-RR-124, Hewlett-Packard Company (1994)

    Google Scholar 

  7. Hartmut, L., Suciu, D.: XMill: an efficient compressor for XML data. In: ACM SIGMOD 2000, pp. 153–164. ACM Press, New York (2000)

    Google Scholar 

  8. Girardot, M., Sundaresan, N.: Millau: an encoding format for efficient representation and exchange of XML over the Web. Computer Networks 33(1-6), 747–765 (2000)

    Article  Google Scholar 

  9. Cheney, J.: Compressing XML with multiplexed hierarchical PPM models. In: DCC 2001, pp. 163–172. IEEE Press, New York (2001)

    Google Scholar 

  10. League, C., Eng, K.: Type-based compression of XML data. In: DCC 2007, pp. 272–282. IEEE Press, New York (2007)

    Google Scholar 

  11. Skibiński, P., Grabowski, S., Swacha, J.: Effective asymmetric XML compression. Software: Practice and Experience 38(10), 1024–1047 (2008)

    Google Scholar 

  12. Tolani, P.M., Haritsa, J.R.: XGrind: a query-friendly XML compressor. In: ICDE 2002, pp. 225–234. IEEE Press, New York (2002)

    Google Scholar 

  13. Min, J., Park, M., Chung, C.: XPress: a queriable compression for XML data. In: ACM SIGMOD 2003, pp. 122–133. IEEE Press, New York (2003)

    Google Scholar 

  14. Arion, A., Bonifati, A., Costa, G., D’Aguanno, S., Manolescu, I., Pugliese, A.: XQueC: Pushing queries to compressed XML data. In: VLDB 2003, pp. 1065–1068 (2003)

    Google Scholar 

  15. Lam, W.Y., Ng, W., Wood, P.T., Levene, M.: XCQ: A queriable XML compression system. Knowledge and Information Systems 10(4), 421–452 (2006)

    Article  Google Scholar 

  16. Guan, J., Zhou, S.: GPress: Towards effective GML documents compresssion. In: ICDE 2007, pp. 1473–1474. IEEE Press, New York (2007)

    Google Scholar 

  17. Guan, J., Zhou, S., Chen, Y.: An effective GML documents compressor. IEICE Transactions on Information and Systems E91-D(7), 1982–1990 (2008)

    Google Scholar 

  18. Wei, Q., Guan, J.: A GML Compression Approach Based on On-line Semantic Clustering. In: Geoinformatics 2010, pp. 1–7. IEEE Press, New York (2010)

    Google Scholar 

  19. Dai, Q., Zhang, S., Wang, Z.: GQComp: A Query-Supported Compression Technique for GML. In: 9th IEEE International Conference on Computer and Information Technology, pp. 311–317. IEEE Press, New York (2009)

    Google Scholar 

  20. Vatsavai, R.R.: GML-QL: A spatial query language specification for GML. Department of Computer Science and Engineering, University of Minnesota, http://www.cobblestoneconcepts.com/ucgis2summer2002/vatsavai/vatsavai.htm

  21. Boucelma, O., Colonna, F.M.: GQuery: a query language for GML. In: 24th Urban Data Management Symposium (2004)

    Google Scholar 

  22. Jihong, G.: GQL: Extending XQuery to query GML documents. Geo-spatial Information Science 9(2), 118–126 (2006)

    Article  Google Scholar 

  23. XQuery 1.0: An XML query language, http://www.w3.org/XML/Query/

  24. GZip 1.2.4, http://www.gzip.org

  25. Grossi, R., Gupta, A., Vitter, J.S.: High-order entropy-compressed text indexes. In: SODA 2003 (2003)

    Google Scholar 

  26. Ferragina, P., Giancarlo, R., Manzini, G.: The myriad virtues of wavelet trees. Information and Computation 207(8), 849–866 (2009)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wei, Q. (2011). A Query-Friendly Compression for GML Documents. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds) Database Systems for Adanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20244-5_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20244-5_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20243-8

  • Online ISBN: 978-3-642-20244-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics