Years and Authors of Summarized Original Work
-
2000, 2003; Sadakane
-
2000, 2005; Grossi, Vitter
-
2000, 2005; Ferragina, Manzini
Problem Definition
Given a text string T = t1t2… t n over an alphabet \(\varSigma\) of size \(\sigma\), the suffix array A[1, n] is a permutation of the interval [1, n] that sorts the suffixes of T. More precisely, it satisfies T[A[i], n] < T[A[i + 1], n] for all 1 ≤ i < n, where “ < ” between strings is the lexicographical order. The suffix array is the canonical full-text index that allows to efficiently compute basic string matching queries on T.
The compressed suffix array (CSA) problem asks to replace A with a space-efficient data structure that is capable of efficiently computing A[i].
If a CSA does not require T to operate, and is capable of efficiently answering substring queries on T, it is called a self-index, as it can be seen as a replacement of T itself. Typical queries required from such an index are the following:
-
count(P): count how many times a...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Belazzougui D, Navarro G (2011) Alphabet-independent compressed text indexing. In: ESA, Saarbrücken, pp 748–759
Burrows M, Wheeler D (1994) A block sorting lossless data compression algorithm. Technical report 124, Digital Equipment Corporation
Ferragina P, Manzini G (2005) Indexing compressed texts. J ACM 52(4):552–581
Ferragina P, Giancarlo R, Manzini G, Sciortino M (2005) Boosting textual compression in optimal linear time. J ACM 52(4):688–713
Ferragina P, Manzini G, Mäkinen V, Navarro G (2007) Compressed representations of sequences and full-text indexes. ACM Trans Algorithms 3(2):20
Foschini L, Grossi R, Gupta A, Vitter JS (2006) When indexing equals compression: experiments with compressing suffix arrays and applications. ACM Trans Algorithms 2(4):611–639
Golynski A, Raman R, Srinivasa Rao S (2008) On the redundancy of succinct data structures. In: SWAT, Gothenburg, pp 148–159
Grossi R, Vitter J (2006) Compressed suffix arrays and suffix trees with applications to text indexing and string matching. SIAM J Comput 35(2):378–407
Grossi R, Gupta A, Vitter J (2003) High-order entropy-compressed text indexes. In: Proceedings of the 14th annual ACM-SIAM symposium on discrete algorithms (SODA), Baltimore, pp 841–850
Kärkkäinen J, Puglisi SJ (2011) Fixed block compression boosting in fm-indexes. In: SPIRE, Pisa, pp 174–184
Mäkinen V, Navarro G (2005) Succinct suffix arrays based on run-length encoding. Nord J Comput 12(1):40–66
Mäkinen V, Navarro G (2008) Dynamic entropy-compressed sequences and full-text indexes. ACM Trans Algorithms 4(3):32
Manber U, Myers G (1993) Suffix arrays: a new method for on-line string searches. SIAM J Comput 22(5):935–948
Manzini G (2001) An analysis of the Burrows-Wheeler transform. J ACM 48(3):407–430
Navarro G, Mäkinen V (2007) Compressed full-text indexes. ACM Comput Surv 39(1): Article 2
Sadakane K (2003) New text indexing functionalities of the compressed suffix arrays. J Algorithms 48(2):294–313
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this entry
Cite this entry
Belazzougui, D., Mäkinen, V., Valenzuela, D. (2016). Compressed Suffix Array. In: Kao, MY. (eds) Encyclopedia of Algorithms. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-2864-4_82
Download citation
DOI: https://doi.org/10.1007/978-1-4939-2864-4_82
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-2863-7
Online ISBN: 978-1-4939-2864-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering