Abstract
Geography Markup Language (GML) has become a standard encoding format for exchanging geographic data among heterogeneous Geographic Information System (GIS) applications. Whereas, the iteration of document structure and the textual expression of geographic data often cause the huge size of GML documents. In this paper, a query-friendly GML compression method is proposed, where the GML documents in SAX document parsing are transformed to a compact representation encompassing an event dictionary, the events hierarchy in balanced parentheses, a binary event wavelet tree and the document content blocks before compressed using a general compression utility. The proposed compression method supports direct path queries and spatial queries over the compressed files without the requirement of a full decompression. The compression model, the query resolution process and the compression algorithm are detailed in this paper, though the presentation is a preliminary investigation and it remains to carry out experiments to validate the proposed compression method on real GML documents.
This work was supported by the National Natural Science Foundation of China (NSFC) under grant No. 60873040 and China 863 Program under grant No. 2009AA01Z135.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Geospatial information – Geography Markup Language (GML). ISO 19136:2007 (2007)
Huffman, D.A.: A Method for the Construction of Minimum-Redundancy Codes. Proceedings of the IRE 40(9), 1098–1101 (1952)
Ziv, J., Lempel, A.: A universal algorithm for sequential data compression. IEEE Transactions on Information Theory IT-23, 337–343 (1977)
Witten, I.H., Neal, R.M., Cleary, J.G.: Arithmetic coding for data compression. Communications of the ACM 30(6), 520–540 (1987)
Cleary, J.G., Witten, I.H.: Data compression using adaptive coding and partial string matching. IEEE Transactions on Communications 32(4), 396–402 (1984)
Burrows, M., Wheeler, D.J.: A block-sorting lossless data compression algorithm. Technical report SRC-RR-124, Hewlett-Packard Company (1994)
Hartmut, L., Suciu, D.: XMill: an efficient compressor for XML data. In: ACM SIGMOD 2000, pp. 153–164. ACM Press, New York (2000)
Girardot, M., Sundaresan, N.: Millau: an encoding format for efficient representation and exchange of XML over the Web. Computer Networks 33(1-6), 747–765 (2000)
Cheney, J.: Compressing XML with multiplexed hierarchical PPM models. In: DCC 2001, pp. 163–172. IEEE Press, New York (2001)
League, C., Eng, K.: Type-based compression of XML data. In: DCC 2007, pp. 272–282. IEEE Press, New York (2007)
Skibiński, P., Grabowski, S., Swacha, J.: Effective asymmetric XML compression. Software: Practice and Experience 38(10), 1024–1047 (2008)
Tolani, P.M., Haritsa, J.R.: XGrind: a query-friendly XML compressor. In: ICDE 2002, pp. 225–234. IEEE Press, New York (2002)
Min, J., Park, M., Chung, C.: XPress: a queriable compression for XML data. In: ACM SIGMOD 2003, pp. 122–133. IEEE Press, New York (2003)
Arion, A., Bonifati, A., Costa, G., D’Aguanno, S., Manolescu, I., Pugliese, A.: XQueC: Pushing queries to compressed XML data. In: VLDB 2003, pp. 1065–1068 (2003)
Lam, W.Y., Ng, W., Wood, P.T., Levene, M.: XCQ: A queriable XML compression system. Knowledge and Information Systems 10(4), 421–452 (2006)
Guan, J., Zhou, S.: GPress: Towards effective GML documents compresssion. In: ICDE 2007, pp. 1473–1474. IEEE Press, New York (2007)
Guan, J., Zhou, S., Chen, Y.: An effective GML documents compressor. IEICE Transactions on Information and Systems E91-D(7), 1982–1990 (2008)
Wei, Q., Guan, J.: A GML Compression Approach Based on On-line Semantic Clustering. In: Geoinformatics 2010, pp. 1–7. IEEE Press, New York (2010)
Dai, Q., Zhang, S., Wang, Z.: GQComp: A Query-Supported Compression Technique for GML. In: 9th IEEE International Conference on Computer and Information Technology, pp. 311–317. IEEE Press, New York (2009)
Vatsavai, R.R.: GML-QL: A spatial query language specification for GML. Department of Computer Science and Engineering, University of Minnesota, http://www.cobblestoneconcepts.com/ucgis2summer2002/vatsavai/vatsavai.htm
Boucelma, O., Colonna, F.M.: GQuery: a query language for GML. In: 24th Urban Data Management Symposium (2004)
Jihong, G.: GQL: Extending XQuery to query GML documents. Geo-spatial Information Science 9(2), 118–126 (2006)
XQuery 1.0: An XML query language, http://www.w3.org/XML/Query/
GZip 1.2.4, http://www.gzip.org
Grossi, R., Gupta, A., Vitter, J.S.: High-order entropy-compressed text indexes. In: SODA 2003 (2003)
Ferragina, P., Giancarlo, R., Manzini, G.: The myriad virtues of wavelet trees. Information and Computation 207(8), 849–866 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wei, Q. (2011). A Query-Friendly Compression for GML Documents. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds) Database Systems for Adanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20244-5_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-20244-5_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20243-8
Online ISBN: 978-3-642-20244-5
eBook Packages: Computer ScienceComputer Science (R0)