Dictionary-based order-preserving string compression

Antoshenkov, Gennady

doi:10.1007/s007780050031

Dictionary-based order-preserving string compression

Published: February 1997

Volume 6, pages 26–39, (1997)
Cite this article

The VLDB Journal Aims and scope Submit manuscript

Gennady Antoshenkov¹

263 Accesses
22 Citations
3 Altmetric
Explore all metrics

Abstract.

As no database exists without indexes, no index implementation exists without order-preserving key compression, in particular, without prefix and tail compression. However, despite the great potentials of making indexes smaller and faster, application of general compression methods to ordered data sets has advanced very little. This paper demonstrates that the fast dictionary-based methods can be applied to order-preserving compression almost with the same freedom as in the general case. The proposed new technology has the same speed and a compression rate only marginally lower than the traditional order-indifferent dictionary encoding. Procedures for encoding and generating the encode tables are described covering such order-related features as ordered data set restrictions, sensitivity and insensitivity to a character position, and one-symbol encoding of each frequent trailing character sequence. The experimental results presented demonstrate five-folded compression on real-life data sets and twelve-folded compression on Wisconsin benchmark text fields.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Author information

Authors and Affiliations

Oracle Corporation, New England Development Center, 110 Spitbrook Road, Nashua, NH 03062, USA; e-mail: gantoshe@us.oracle.com, , , , , , US
Gennady Antoshenkov

Authors

Gennady Antoshenkov
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Edited by M.T. Ozsu. Received 1 February 1995 / Accepted 1 November 1995

Rights and permissions

Reprints and permissions

About this article

Cite this article

Antoshenkov, G. Dictionary-based order-preserving string compression . The VLDB Journal 6, 26–39 (1997). https://doi.org/10.1007/s007780050031

Download citation

Issue Date: February 1997
DOI: https://doi.org/10.1007/s007780050031

Key words:Indexing – Order-preserving key compression

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dictionary-based order-preserving string compression

Abstract.

Access this article

Similar content being viewed by others

A survey on the evolution of stream processing systems

Feistel Networks

A Hierarchical Error Correction Strategy for Text DNA Storage

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Dictionary-based order-preserving string compression

Abstract.

Access this article

Similar content being viewed by others

A survey on the evolution of stream processing systems

Feistel Networks

A Hierarchical Error Correction Strategy for Text DNA Storage

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation