Lempel–Ziv Factorization Using Less Time & Space

Chen, Gang; Puglisi, Simon J.; Smyth, W. F.

doi:10.1007/s11786-007-0024-4

Lempel–Ziv Factorization Using Less Time & Space

Published: 11 April 2008

Volume 1, pages 605–623, (2008)
Cite this article

Mathematics in Computer Science Aims and scope Submit manuscript

Gang Chen¹,
Simon J. Puglisi² &
W. F. Smyth^3,4

153 Accesses
35 Citations
3 Altmetric
Explore all metrics

Abstract.

For 30 years the Lempel–Ziv factorization LZ_x of a string x = x[1..n] has been a fundamental data structure of string processing, especially valuable for string compression and for computing all the repetitions (runs) in x. Traditionally the standard method for computing LZ_x was based on Θ(n)-time (or, depending on the measure used, O(n log n)-time) processing of the suffix tree ST_x of x. Recently Abouelhoda et al. proposed an efficient Lempel–Ziv factorization algorithm based on an “enhanced” suffix array – that is, a suffix array SA_x together with supporting data structures, principally an “interval tree”. In this paper we introduce a collection of fast space-efficient algorithms for LZ factorization, also based on suffix arrays, that in theory as well as in many practical circumstances are superior to those previously proposed; one family out of this collection achieves true Θ(n)-time alphabet-independent processing in the worst case by avoiding tree structures altogether.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Lempel–Ziv Factorization Powered by Space Efficient Suffix Trees

Article 25 July 2017

Linear Time Lempel-Ziv Factorization: Simple, Fast, Small

Applications of V-Order: Suffix Arrays, the Burrows-Wheeler Transform & the FM-index

Author information

Authors and Affiliations

Department of Computing & Software, McMaster University, Hamilton, Ontario, L8S 4K1, Canada
Gang Chen
School of Computer Science & Information Technology, RMIT University, GPO Box 2476V, Melbourne, Victoria, 3001, Australia
Simon J. Puglisi
Digital Ecosystems & Business Intelligence Institute, Curtin University of Technology, GPO Box U1987, Perth, Western Australia, 6845, Australia
W. F. Smyth
Algorithms Research Group, Department of Computing & Software, McMaster University, Hamilton, Ontario, L8S 4K1, Canada
W. F. Smyth

Authors

Gang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Simon J. Puglisi
View author publications
You can also search for this author in PubMed Google Scholar
W. F. Smyth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to W. F. Smyth.

Additional information

The work of the first and third authors was supported in part by grants from the Natural Sciences & Engineering Research Council of Canada.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, G., Puglisi, S.J. & Smyth, W.F. Lempel–Ziv Factorization Using Less Time & Space. Math.comput.sci. 1, 605–623 (2008). https://doi.org/10.1007/s11786-007-0024-4

Download citation

Received: 31 March 2007
Revised: 06 August 2007
Accepted: 21 September 2007
Published: 11 April 2008
Issue Date: June 2008
DOI: https://doi.org/10.1007/s11786-007-0024-4

Mathematics Subject Classification (2000).

68W05

Keywords.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Lempel–Ziv Factorization Using Less Time & Space

Abstract.

Access this article

Similar content being viewed by others

Lempel–Ziv Factorization Powered by Space Efficient Suffix Trees

Linear Time Lempel-Ziv Factorization: Simple, Fast, Small

Applications of V-Order: Suffix Arrays, the Burrows-Wheeler Transform & the FM-index

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Mathematics Subject Classification (2000).

Keywords.

Navigation

Lempel–Ziv Factorization Using Less Time & Space

Abstract.

Access this article

Similar content being viewed by others

Lempel–Ziv Factorization Powered by Space Efficient Suffix Trees

Linear Time Lempel-Ziv Factorization: Simple, Fast, Small

Applications of V-Order: Suffix Arrays, the Burrows-Wheeler Transform & the FM-index

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification (2000).

Keywords.

Search

Navigation