Abstract
SMART2 is an enhanced version of the SMART pipeline for mitogenome assembly from low-coverage whole-genome sequencing (WGS) data. Novel features include automatic selection of the optimal number of read pairs used for assembly and the ability to assemble multiple sequencing libraries when available. SMART2 succeeded in generating mitochondrial sequences for 26 metazoan species with WGS data but no previously published mitogenomes in NCBI databases. The SMART2 pipeline is publicly available via a user-friendly Galaxy interface at https://neo.engr.uconn.edu/?tool_id=SMART2.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Afgan, E., et al.: The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 46(W1), W537–W544 (2018)
Al-Nakeeb, K., Petersen, T.N., Sicheritz-Pontén, T.: Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data. BMC Bioinform. 18(1), 510 (2017)
Alqahtani, F., Mandoiu, I.: Statistical mitogenome assembly with repeats. J. Comput. Biol. online ahead of print (2020). https://doi.org/10.1089/cmb.2019.0505
Alqahtani, F., Duckett, D., Pirro, S., Măndoiu, I.I.: Complete mitochondrial genome of water vole, Microtus richardsoni (2020). In preparation
Alqahtani, F., Măndoiu, I.I.: Statistical mitogenome assembly with repeats. In: 8th IEEE International Conference on Computational Advances in Bio and Medical Sciences (2018)
Alves-Silva, J., et al.: The ancestry of Brazilian mtDNA lineages. Am. J. Hum. Genet. 67(2), 444–461 (2000)
Antipov, D., Hartwick, N., Shen, M., Raiko, M., Lapidus, A., Pevzner, P.A.: plasmidSPAdes: assembling plasmids from whole genome sequencing data. Bioinformatics 32(22), 3380–3387 (2016)
Calabrese, C., et al.: MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing. Bioinformatics 30(21), 3115–3117 (2014)
Cochrane, G., et al.: Evidence standards in experimental and inferential INSDC third party annotation data. OMICS J. Integr. Biol. 10(2), 105–113 (2006)
Darriba, D., Taboada, G.L., Doallo, R., Posada, D.: jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9(8), 772 (2012)
Dierckxsens, N., Mardulyn, P., Smits, G.: NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45(4), e18–e18 (2016)
Gupta, A., Bhardwaj, A., Sharma, P., Pal, Y., et al.: Mitochondrial DNA-a tool for phylogenetic and biodiversity search in equines. J. Biodivers. Endangered Species 2015 (2015)
Hahn, C., Bachmann, L., Chevreux, B.: Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads–a baiting and iterative mapping approach. Nucleic Acids Res. 41(13), e129–e129 (2013)
Hebert, P.D., Ratnasingham, S., de Waard, J.R.: Barcoding animal life: cytochrome c oxidase subunit 1 divergences among closely related species. Proc. Roy. Soc. London Ser. B Biolog. Sci. 270(suppl\(\_1\)), S96–S99 (2003)
Katoh, K., Misawa, K., Kuma, K.I., Miyata, T.: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30(14), 3059–3066 (2002)
Kurabayashi, A., Sumida, M.: Afrobatrachian mitochondrial genomes: genome reorganization, gene rearrangement mechanisms, and evolutionary trends of duplicated and rearranged genes. BMC Genom. 14(1), 633 (2013)
Letunic, I., Bork, P.: Interactive tree of life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 47(W1), W256–W259 (2019)
Li, W.X., et al.: The complete mitochondrial dna of three monozoic tapeworms in the caryophyllidea: a mitogenomic perspective on the phylogeny of eucestodes. Parasites Vectors 10(1), 314 (2017)
Melton, T., Holland, C., Holland, M.: Forensic mitochondria DNA analysis: current practice and future potential. Forensic Sci. Rev. 24(2), 101 (2012)
Price, M.N., Dehal, P.S., Arkin, A.P.: Fasttree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol. Biol. Evol. 26(7), 1641–1650 (2009)
Ratnasingham, S., Hebert, P.D.: BOLD: The barcode of life data system (http://www.barcodinglife.org). Mol. Ecol. Notes 7(3), 355–364 (2007)
Scrucca, L., Fop, M., Murphy, T.B., Raftery, A.E.: mclust 5: clustering, classification and density estimation using Gaussian finite mixture models. R J. 8(1), 205–233 (2016)
Trevisan, B., Alcantara, D.M., Machado, D.J., Marques, F.P., Lahr, D.J.: Genome skimming is a low-cost and robust strategy to assemble complete mitochondrial genomes from ethanol preserved specimens in biodiversity studies. PeerJ 7, e7543 (2019)
Veltri, K.L., Espiritu, M., Singh, G.: Distinct genomic copy number in mitochondria of different mammalian organs. J. Cell. Physiol. 143(1), 160–164 (1990)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Alqahtani, F., Măndoiu, I. (2020). SMART2: Multi-library Statistical Mitogenome Assembly with Repeats. In: Măndoiu, I., Murali, T., Narasimhan, G., Rajasekaran, S., Skums, P., Zelikovsky, A. (eds) Computational Advances in Bio and Medical Sciences. ICCABS 2019. Lecture Notes in Computer Science(), vol 12029. Springer, Cham. https://doi.org/10.1007/978-3-030-46165-2_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-46165-2_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46164-5
Online ISBN: 978-3-030-46165-2
eBook Packages: Computer ScienceComputer Science (R0)