Abstract
Sequencing errors can be difficult to detect due to the high rate of production of new data, which makes manual curation unfeasible. To address these shortcomings we have developed a phylogenetic inspired algorithm to assess the quality of new sequences given a related phylogeny. Its performance and efficiency have been evaluated with human mitochondrial DNA data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Achilli, A., Rengo, C., Magri, C., Battaglia, V., Olivieri, A., Scozzari, R., Cruciani, F., Zeviani, M., Briem, E., Carelli, V., Moral, P., Dugoujon, J.M., Roostalu, U., Loogvöli, E.L., Kivisild, T., Bandelt, H.J., Richards, M., Villems, R., Santachiara-Benerecetti, A.S., Semino, O., Torroni, A.: The molecular dissection of mtDNA haplogroup H confirms that the Franco-Cantabrian glacial refuge was a major source for the European gene pool. American Journal of Human Genetics 75, 910–918 (2004)
Andrews, R.M., Kubacka, I., Chinnery, P.F., Lightowlers, R.N., Turnbull, D.M., Howell, N.: Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nature Genetics 23, 147 (1999)
Bandelt, H.J., Macaulay, V., Richards, M. (eds.): Human mitochondrial DNA and the evolution of Homo sapiens. Springer, Berlin (2006)
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: GenBank. Nucleic Acids Research 38, D46–D51 (2010)
Blanco, R., Mayordomo, E.: ZARAMIT: A System for the Evolutionary Study of Human Mitochondrial DNA. In: Omatu, S., Rocha, M.P., Bravo, J., Fernández, F., Corchado, E., Bustillo, A., Corchado, J.M. (eds.) IWANN 2009, Part II. LNCS, vol. 5518, pp. 1139–1142. Springer, Heidelberg (2009)
Edgar, R.C.: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32, 1792–1797 (2004)
Kim, S., Tang, H., Mardis, E.R. (eds.): Genome sequencing technology and algorithms. Artech House, Norwood (2007)
Margulies, M., Egholm, M., Altman, W.E., Attiya, S., Bader, J.S., Bemben, L.A., Berka, J., Braverman, M.S., Chen, Y.J., Chen, Z., Dewell, S.B., Du, L., Fierro, J.M., Gomes, X.V., Goodwin, B.C., He, W., Helgesen, S., He Ho, C., Irzyk, G.P., Jando, S.C., Alenquer, M.L.I., Jarvie, T.P., Jirage, K.B., Kim, J.B., Knight, J.R., Lanza, J.R., Leamon, J.H., Lefkowitz, S.M., Lei, M., Li, J., Lohman, K.L., Lu, H., Makhijani, V.B., McDade, K.E., McKenna, M.P., Myers, E.W., Nickerson, E., Nobile, J.R., Plant, R., Puc, B.P., Ronan, M.T., Roth, G.T., Sarkis, G.J., Simons, J.F., Simpson, J.W., Srinivasan, M., Tartaro, K.R., Tomasz, A., Vogt, K.A., Volkmer, G.A., Wang, S.H., Wang, Y., Weiner, M.P., Yu, P., Begley, R.F., Rothberg, J.M.: Genome sequencing in open microfabricated high density picoliter reactors. Nature 437, 376–380 (2005)
Matsen, F.A., Kodner, R.B., Armbrust, E.V.: pplacer: linear time maximum-likelihood and bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics 11, 538 (2010)
Olsen, G.J., Overbeek, R., Larsen, N., Marsh, T.L., McCaughey, M.J., Maciukenas, M.A., Kuan, W.M., Macke, T.J., Xing, Y., Woese, C.R.: The ribosomal database project. Nucleic Acids Research 20(supplement), 2199–2200 (1992)
Rajkumar, R., Banerjee, J., Gunturi, H.B., Trivedi, R., Kashyap, V.K.: Phylogeny and antiquity of M macrohaplogroup inferred from complete mt DNA sequence of Indian specific lineages. BMC Evolutionary Biology 5, 26 (2005)
Ruiz-Pesini, E., Lott, M.T., Procaccio, V., Poole, J., Brandon, M.C., Mishmar, D., Yi, C., Kreuziger, J., Baldi, P., Wallace, D.C.: An enhanced mitomap with a global mtdna mutational phylogeny. Nucleic Acids Research 35, D823–D828 (2007)
van Oven, M., Kayser, M.: Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Human Mutation 29, E386–E394 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Álvarez-Jarreta, J., Mayordomo, E., Ruiz-Pesini, E. (2012). PHYSER: An Algorithm to Detect Sequencing Errors from Phylogenetic Information. In: Rocha, M., Luscombe, N., Fdez-Riverola, F., Rodríguez, J. (eds) 6th International Conference on Practical Applications of Computational Biology & Bioinformatics. Advances in Intelligent and Soft Computing, vol 154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28839-5_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-28839-5_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28838-8
Online ISBN: 978-3-642-28839-5
eBook Packages: EngineeringEngineering (R0)