Abstract
To search a pattern P in a text, index data structures such as suffix trees and suffix arrays are widely used. It is known that searching with suffix trees is faster than with suffix arrays in the aspect of time complexity. But recently, a few linear-time search algorithms for constant-size alphabet in suffix arrays have been suggested. One of such algorithms proposed by Sim et al. uses Burrows-Wheeler transform and takes O(|P|log|Σ|) time. But this algorithm needs too much space compared to Abouelhoda et al.’s algorithm to search a pattern.
In this paper we present an improved version for Sim et al.’s algorithm. It needs only 2n bytes at most if a given alphabet is sufficiently small.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abouelhoda, M.I., Ohlebusch, E., Kurtz, S.: Optimal exact string matching based on suffix arrays. In: Laender, A.H.F., Oliveira, A.L. (eds.) SPIRE 2002. LNCS, vol. 2476, pp. 31–43. Springer, Heidelberg (2002)
Burkhardt, S., Kärkkäinen, J.: Fast lightweight suffix array construction and checking. In: Baeza-Yates, R., Chávez, E., Crochemore, M. (eds.) CPM 2003. LNCS, vol. 2676, pp. 55–69. Springer, Heidelberg (2003)
Choi, Y.W., Sim, J.S., Park, K.: Time and space efficinet search with suffix arrays. Journal of Korea Information Science Society (accepted)
Farach, M.: Optimal suffix tree construction with large alphabets. In: FOCS, pp. 137–143 (1997)
Farach-Colton, M., Ferragina, P., Muthukrishnan, S.: On the sorting-complexity of suffix tree construction. JACM 47, 987–1011 (2000)
Ferragina, P., Manzini, G.: Opportunistic data structures with applications. In: FOCS, pp. 390–398 (2001)
Gonnet, G., Baeza-Yates, R., Snider, T.: New indices for text: Pat trees and pat arrays. In: Frakes, W.B., Baeza-Yates, R.A. (eds.) Information Retrieval: Data Structures & Algorithms, pp. 66–82. Prentice Hall, Englewood Cliffs (1992)
Gusfield, D.: Algorithms on Strings, Trees, and Sequences. Cambridge Univ. Press, Cambridge (1997)
D. Gusfield: An Increment-by-one approach to suffix arrays and trees (1990) (manuscript)
Hariharan, R.: Optimal parallel suffix tree construction. J. Comput. Syst. Sci. 55, 44–69 (1997)
Hon, W., Sadakane, K., Sung, W.: Breaking a time-and-space barrier in constructing full-text indices. In: FOCS (2003) (accepted)
Kärkkäinen, J., Sanders, P.: Simple linear work suffix array construction. In: James, A., Younas, M., Lings, B. (eds.) BNCOD 2003. LNCS, vol. 2712, pp. 943–955. Springer, Heidelberg (2003)
Kim, D., Sim, J.S., Park, H., Park, K.: Linear-time construction of suffix arrays. Journal of Discrete Algorithms 3/2-4, 126–142 (2005)
Ko, P., Aluru, S.: Space efficient linear time construction of suffix arrays. In: Baeza-Yates, R., Chávez, E., Crochemore, M. (eds.) CPM 2003. LNCS, vol. 2676, pp. 200–210. Springer, Heidelberg (2003)
Manber, U., Myers, G.: Suffix arrays: A new method for on-line string searches. SIAM J. Comput. 22, 935–938 (1993)
McCreight, E.M.: A space-economical suffix tree construction algorithm. JACM 23, 262–272 (1976)
Sadakane, K.: Succinct representation of lcp information and improvement in the compressed suffix arrays. In: SODA, pp. 225–232 (2002)
Sim, J.S., Kim, D.K., Park, H., Park, K.: Linear-time search in suffix arrays. Journal of Korea Information Science Society (accepted)
Weiner, P.: Linear pattern matching algorithms. In: Proc. 14th IEEE Symp. Switching and Automata Theory, pp. 1–11 (1973)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sim, J.S. (2005). Time and Space Efficient Search for Small Alphabets with Suffix Arrays. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3613. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539506_136
Download citation
DOI: https://doi.org/10.1007/11539506_136
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28312-6
Online ISBN: 978-3-540-31830-9
eBook Packages: Computer ScienceComputer Science (R0)