Conferences >2011 11th International Confe...

New data structures for analyzing frequent factors in strings

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Discovering frequent factors from long strings is an important problem in many applications, such as biosequence mining. In classical approaches, the algorithms process a...Show More

Metadata

Abstract:

Discovering frequent factors from long strings is an important problem in many applications, such as biosequence mining. In classical approaches, the algorithms process a vast database of small strings. However, in this paper we analyze a small database of long strings. The main difference resides in the high number of patterns to analyze. To tackle the problem, we have developed a new algorithm for discovering frequent factors in long strings. This algorithm uses a new data structure to arrange nodes in a trie. A positioning matrix is defined as a new positioning strategy. By using positioning matrices, we can apply advanced prune heuristics in a trie with a minimal computational cost. The positioning matrices let us process strings including Short Tandem Repeats and calculate different interestingness measures efficiently. The algorithm has been successfully used in natural language and biological sequence contexts.

Published in: 2011 11th International Conference on Intelligent Systems Design and Applications

Date of Conference: 22-24 November 2011

Date Added to IEEE Xplore: 02 January 2012

ISBN Information:

ISSN Information:

DOI: 10.1109/ISDA.2011.6121772

Conference Location: Cordoba, Spain

Contents

References is not available for this document.

New data structures for analyzing frequent factors in strings

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

New data structures for analyzing frequent factors in strings

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?