Abstract
In this article we have proposed an abstract representation for a sequence using a constant sized 3D matrix. Subsequently the representation may be utilized for many analytical purposes. We have attempted to use it for comparing sequences, and analyzed the method’s asymptotic complexity. Providing a metric for sequence comparison is an underlying operation to many bioinformatics applications. In order to show the effectiveness of the proposed sequence comparison technique we have generated some phylogeny over two sets of bio-sequences and compared them with the ones available in literature. The results prove that our technique is comparable to the standard ones. The technique, called the correlogram-based method, is borrowed from the image analysis area. We have also done some experiments with synthetically generated sequences in order to compare correlogram-based method with the well-known dynamic programming method. Finally, we have discussed some other possibilities on how our method can be used or extended.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. Journal of Molecular. Biology 215, 403–410 (1990)
Bertorelle, G., Barbujanit, G.: Analysis of DNA diversity by spatial auto correlation. Genetics 140(2), 811–819 (1995)
Chapman, M.S., Rossmann, M.G.: Structure, Sequence and Function Correlations among Parvoviruses. Viriology 194(2), 491–508 (1993)
Huang, J., Mitra, M., Zhu, W.J., Zabih, R.: Image Indexing using color correlograms. International Journal of Computer Vision 35(3), 245–268 (1999)
Lai, A.C.K., Rogers, K.M., Glaser, A., Tudor, L., Chambers, T.: Alternate circulation of recent equine-2 influenza viruses (H3N8) from two distinct lineages in the United States. Virus Res. 100(2), 159–164 (2004)
Macchiato, M.F., Cuomo, V., Tramontano, A.: Determination of the autocorrelation orders of proteins. Genetics 140, 811–819 (1995)
Rosenberg, M.S., Subramanian, S., Kumar, S.: Patterns of Transitional Mutation Biases Within and Among Mammalian Genomes. Mol. Biol. Evol. 20(6), 988–993 (2003)
Samant, G., Mitra, D.: Correlogram method for Comparing Bio-Sequences, Florida Institute of Technology Technical Report No. CS-2006-01 (2005), http://www.cs.fit.edu/~tr/cs-2006-01.pdf
Smith, T.F., Waterman, M.S.: Identification of common molecular sequences. Journal of Molecular Biology 147, 195–197 (1981)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mitra, D., Samant, G., Sengupta, K. (2006). Correlogram-Based Method for Comparing Biological Sequences. In: Ali, M., Dapoigny, R. (eds) Advances in Applied Artificial Intelligence. IEA/AIE 2006. Lecture Notes in Computer Science(), vol 4031. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11779568_102
Download citation
DOI: https://doi.org/10.1007/11779568_102
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35453-6
Online ISBN: 978-3-540-35454-3
eBook Packages: Computer ScienceComputer Science (R0)