Suffix Trees on Words

Andersson, A.; Larsson, N. J.; Swanson, K.

doi:10.1007/PL00009260

Suffix Trees on Words

Published: March 1999

Volume 23, pages 246–260, (1999)
Cite this article

Algorithmica Aims and scope Submit manuscript

A. Andersson¹,
N. J. Larsson¹ &
K. Swanson¹

126 Accesses
23 Citations
Explore all metrics

Abstract.

We discuss an intrinsic generalization of the suffix tree, designed to index a string of length n which has a natural partitioning into m multicharacter substrings or words . This word suffix tree represents only the m suffixes that start at word boundaries. These boundaries are determined by delimiters , whose definition depends on the application.

Since traditional suffix tree construction algorithms rely heavily on the fact that all suffixes are inserted, construction of a word suffix tree is nontrivial, in particular when only O(m) construction space is allowed. We solve this problem, presenting an algorithm with O(n) expected running time. In general, construction cost is Ω(n) due to the need of scanning the entire input. In applications that require strict node ordering, an additional cost of sorting O(m') characters arises, where m' is the number of distinct words. In either case, this is a significant improvement over previously known solutions.

Furthermore, when the alphabet is small, we may assume that the n characters in the input string occupy o(n) machine words. We illustrate that this can allow a word suffix tree to be built in sublinear time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Author information

Authors and Affiliations

Department of Computer Science, Lund University, Box 118, S-221 00 Lund, Sweden. arne@dna.lth.se, jesper@dna.lth.se, kurt@dna.lth.se., , , , , , SE
A. Andersson, N. J. Larsson & K. Swanson

Authors

A. Andersson
View author publications
You can also search for this author in PubMed Google Scholar
N. J. Larsson
View author publications
You can also search for this author in PubMed Google Scholar
K. Swanson
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Received September 2, 1997; revised December 10, 1997.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Andersson, A., Larsson, N. & Swanson, K. Suffix Trees on Words. Algorithmica 23, 246–260 (1999). https://doi.org/10.1007/PL00009260

Download citation

Issue Date: March 1999
DOI: https://doi.org/10.1007/PL00009260

Key words. Suffix trees, Substring searching.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Suffix Trees on Words

Abstract.

Access this article

Similar content being viewed by others

A Suffix Tree Or Not a Suffix Tree?

Suffix Trees for Partial Words and the Longest Common Compatible Prefix Problem

On Suffix Tree Breadth

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Suffix Trees on Words

Abstract.

Access this article

Similar content being viewed by others

A Suffix Tree Or Not a Suffix Tree?

Suffix Trees for Partial Words and the Longest Common Compatible Prefix Problem

On Suffix Tree Breadth

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation