Abstract
With the development of next generation sequencing (NGS) technology, genomic research now requires analysis at the entire genome level. Because of easy access to very large amounts of data, it is desirable to look at all the data rather than examine individual bases. At this time, data visualization of the entire genome level can be very useful. However, most visualization tools simply visualize the resulting files derived from external analysis systems. In this study, it was possible to intuitively present the entire sequence to a researcher by converting the data for the entire genome into a 3-dimensional plot. In addition, by compressing the information in 3D space with run length encoding and storing it in a skip list, it is possible to perform fast comparison and search sequences with low complexity by layering base information. As a result, compared to alignment-based sequence comparisons, we obtained improved search results, and we could examine sequences from various angles using layered information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Coordinators, N.R.: Database resources of the national center for biotechnology information. Nucleic Acids Res. 44(Database issue), D7 (2016)
Da-Young, L., Kyung-Rim, K., Taeyong, K., Hwan-Gue, C.: Comparison-specialized visualization model for whole genome sequences. J. WSCG 24(2), 43–52 (2016)
Ekdahl, S., Sonnhammer, E.L.: ChromoWheel: a new spin on eukaryotic chromosome visualization. Bioinformatics 20(4), 576–577 (2004)
Hubbard, T., Barker, D., Birney, E., Cameron, G., Chen, Y., Clark, L., Cox, T., Cuff, J., Curwen, V., Down, T., et al.: The ensembl genome database project. Nucleic Acids Res. 30(1), 38–41 (2002)
Johnson, M., Zaretskaya, I., Raytselis, Y., Merezhuk, Y., McGinnis, S., Madden, T.L.: NCBI BLAST: a better web interface. Nucleic Acids Res. 36(suppl-2), W5–W9 (2008)
Krzywinski, M.I., Schein, J.E., Birol, I., Connors, J., Gascoyne, R., Horsman, D., Jones, S.J., Marra, M.A.: Circos: an information aesthetic for comparative genomics. Genome Res. 19(9), 1639–1645 (2009)
Lee, D.Y., Tak, H.S., Kim, H.H., Cho, H.G.: Alignment-free sequence searching over whole genomes using 3D random plot of query DNA sequences. Informatica 42(3) (2018)
Pugh, W.: Skip lists: a probabilistic alternative to balanced trees. Commun. ACM 33(6), 668–676 (1990)
Randic, M., Zupan, J., Plavsic, D., et al.: A novel unexpected use of a graphical representation of DNA: graphical alignment of DNA sequences. Chem. Phys. Lett. 431, 375–379 (2006)
Rutherford, K., Parkhill, J., Crook, J., Horsnell, T., Rice, P., Rajandream, M.A., Barrell, B.: Artemis: sequence visualization and annotation. Bioinformatics 16(10), 944–945 (2000)
Sperber, G., Lövgren, A., Eriksson, N.E., Benachenhou, F., Blomberg, J.: Retrotector online, a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic sequences. BMC Bioinformatics 10(6), S4 (2009)
Sullivan, M.J., Petty, N.K., Beatson, S.A.: Easyfig: a genome comparison visualizer. Bioinformatics 27(7), 1009–1010 (2011)
Acknowledgment
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2017R1D1A1A02018504).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Lee, DY., Tak, HS., Cho, HG. (2019). Sequence Searching and Visualizing over 3D Random Plot of Whole Genome Using Skip List. In: Lee, S., Ismail, R., Choo, H. (eds) Proceedings of the 13th International Conference on Ubiquitous Information Management and Communication (IMCOM) 2019. IMCOM 2019. Advances in Intelligent Systems and Computing, vol 935. Springer, Cham. https://doi.org/10.1007/978-3-030-19063-7_64
Download citation
DOI: https://doi.org/10.1007/978-3-030-19063-7_64
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19062-0
Online ISBN: 978-3-030-19063-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)