Abstract
Chaos Game Representation (CGR) of DNA sequences has been used for visual representation as well as alignment-free comparisons. CGR is considered to be of great value as the images obtained from parts of a genome present the same structure as those obtained for the whole genome. However, the robustness of the CGR method to compare DNA sequences obtained in a variety of scenarios is not yet fully demonstrated. This paper addresses this issue by presenting a method to evaluate the potential of CGR to distinguish various classes in a DNA dataset. Two indices are proposed for this purpose - a rejection rate (\(\alpha \)) and an overlapping rate (\(\beta \)). The method was applied to 4 datasets, with between 31 to 400 classes each. Nearly 430 million pairs of DNA sequences were compared using the CGR.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Deschavanne, P.J., Giron, A., Vilain, J., Fagot, G., Fertil, B.: Genomic signature: characterization and classification of species assessed by chaos game representation of sequences. Mol. Biol. Evol. 16(10), 1391–1399 (1999)
Hoang, T., Yin, C., Yau, S.S.T.: Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison. Genomics 108, 134–142 (2016)
Jeffrey, H.J.: Chaos game representation of gene structure. Nucleic Acids Res. 18, 2163–2170 (1990)
Joseph, J., Sasikumar, R.: Chaos game representation for comparison of whole genomes. BMC Bioinform. 7, 243 (2006)
Kari, L., et al.: Mapping the space of genomic signatures. PLoS ONE 10(5), e0119815 (2015)
Mitra, S.K.: Digital Signal Processing: A Computer-Based Approach, 4th edn. McGraw-Hill, New York (2011)
Ni, H.M., Qi, D.W., Mu, H.B.: Applying MSSIM combined chaos game representation to genome sequences analysis. Genomics 110(3), 180–190 (2018)
Palmenberg, A.C., et al.: Sequencing and analyses of all known human rhinovirus genomes reveal structure and evolution. Science 324, 55–59 (2009)
Stan, C., Cristescu, C.P., Scarlat, E.I.: Similarity analysis for DNA sequences based on chaos game representation. Case study: the albumin. J. Theoret. Biol. 267, 513–518 (2010)
Stepanyan, I.V., Petoukhov, S.V.: The matrix method of representation, analysis and classification of long genetic sequences. Information 8(1), 12 (2017)
Tanchotsrinon, W., Lursinsap, C., Poovorawan, Y.: A high performance prediction of HPV genotypes by chaos game representation and singular value decomposition. BMC Bioinform. 16, 71 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Marcal, A.R.S. (2018). Evaluation of Chaos Game Representation for Comparison of DNA Sequences. In: Barneva, R., Brimkov, V., Tavares, J. (eds) Combinatorial Image Analysis. IWCIA 2018. Lecture Notes in Computer Science(), vol 11255. Springer, Cham. https://doi.org/10.1007/978-3-030-05288-1_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-05288-1_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05287-4
Online ISBN: 978-3-030-05288-1
eBook Packages: Computer ScienceComputer Science (R0)