Abstract
The article presents a new algorithm for random DNA fragment assembly. The algorithm uses an extended de Bruijn graph that stores information of reads coverage. It is able to reconstruct consecutive repetitive sequences longer than reads and to process large amount of data provided by new generation sequencers. Preliminary simulation results show the advantages of the method.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Shendure, J., Ji, H.: Next-generation dna sequencing. Nature Biotechnology 26(10), 1135–1145 (2008)
Sanger, F., Nicklen, S., Coulson, A.: Dna sequencing with chain-terminating inhibitors. Proceedings of the National Academy of Sciences 74(12), 5463 (1977)
Lander, E., Waterman, M.: Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics 2(3), 231–239 (1988)
Zhang, W., Chen, J., Yang, Y., Tang, Y., Shang, J., Shen, B.: A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies. PloS One 6(3), e17915 (2011)
Pevzner, P., Tang, H., Waterman, M.: An eulerian path approach to dna fragment assembly. Proceedings of the National Academy of Sciences 98(17), 9748 (2001)
Myers, E.: Toward simplifying and accurately formulating fragment assembly. Journal of Computational Biology 2(2), 275–290 (1995)
Myers, E.: The fragment assembly string graph. Bioinformatics 21(suppl. 2), ii79–ii85 (2005)
Pevzner, P., Tang, H., Tesler, G.: De novo repeat classification and fragment assembly. Genome Research 14(9), 1786–1796 (2004)
Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Introduction to algorithms. The MIT press (2001)
libraries, B.: http://www.boost.org
Hochhut, B., Wilde, C., Balling, G., Middendorf, B., Dobrindt, U., Brzuszkiewicz, E., Gottschalk, G., Carniel, E., Hacker, J.: Role of pathogenicity island-associated integrases in the genome plasticity of uropathogenic escherichia coli strain 536. Molecular Microbiology 61(3), 584–595 (2006)
Goffeau, A., Barrell, B., Bussey, H., Davis, R., Dujon, B., Feldmann, H., Galibert, F., Hoheisel, J., Jacq, C., Johnston, M., et al.: Life with 6000 genes. Science 274(5287), 546 (1996)
Paszkiewicz, K., Studholme, D.: De novo assembly of short sequence reads. Briefings in Bioinformatics 11(5), 457 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nowak, R.M. (2012). Genome Assembler for Repetitive Sequences. In: Piętka, E., Kawa, J. (eds) Information Technologies in Biomedicine. Lecture Notes in Computer Science(), vol 7339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31196-3_42
Download citation
DOI: https://doi.org/10.1007/978-3-642-31196-3_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31195-6
Online ISBN: 978-3-642-31196-3
eBook Packages: Computer ScienceComputer Science (R0)