Conferences >2014 IEEE International Confe...

Compression-based normal similarity measures for DNA sequences

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Similarity measures based on compression assess the distance between two objects based on the number of bits needed to describe one, given a description of the other. The...Show More

Metadata

Abstract:

Similarity measures based on compression assess the distance between two objects based on the number of bits needed to describe one, given a description of the other. Theoretically, compression-based similarity depends on the concept of Kol-mogorov complexity, which is non-computable. The implementations require compression algorithms that are approximately normal. The approach has important advantages (no signal features to identify and extract, for example) but the compression method must be normal. This paper proposes normal algorithms based on mixtures of finite context models. Normality is attained by combining two new ideas: the use of least-recently-used caching in the context models, to allow deeper contexts, and data interleaving, to better explore that cache. Examples for DNA sequences are given (at the human genome scale).

Published in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 04-09 May 2014

Date Added to IEEE Xplore: 14 July 2014

Electronic ISBN:978-1-4799-2893-4

ISSN Information:

DOI: 10.1109/ICASSP.2014.6853630

Conference Location: Florence, Italy

Contents

References is not available for this document.

Compression-based normal similarity measures for DNA sequences

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Compression-based normal similarity measures for DNA sequences

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?