Abstract
Traditional comparisons between metagenomes are often performed using reference databases as intermediary templates from which to obtain distance metrics. However, in order to fully exploit the potential of the information contained within metagenomes, it becomes of interest to remove any intermediate agent that is prone to introduce errors or biased results. In this work, we perform an analysis over the state of the art methods and deduce that it is necessary to employ fine-grained methods in order to assess similarity between metagenomes. In addition, we propose our developed method for accurate and fast matching of reads.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Considering coverage as the length of the alignment divided by the length of the query read.
- 2.
References
Maillet, N., et al.: Compareads: comparing huge metagenomic experiments. BMC Bioinform. 13(19), 1 (2012)
Benoit, G., et al.: Multiple Comparative Metagenomics using Multiset k-mer Counting. arXiv preprint arXiv:1604.02412 (2016)
Vinga, S., Almeida, J.: Alignment-free sequence comparisona review. Bioinformatics 19(4), 513–523 (2003)
Bonham-Carter, O., Steele, J., Bastola, D.: Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis. Brief. Bioinform. 15(6), 890–905 (2013). doi:10.1093/bib/bbt052
Lippert, R.A., Huang, H., Waterman, M.S.: Distributional regimes for the number of k-word matches between two random sequences. Proc. Natl. Acad. Sci. 99(22), 13980–13989 (2002)
Anne, C., et al.: A new statistical approach for assessing similarity of species composition with incidence and abundance data. Ecology Lett. 8(2), 148–159 (2005)
Ondov, B.D., et al.: Mash: fast genome and metagenome distance estimation using MinHash. BioRxiv (2016). doi:10.1101/029827
Yuriy, F., et al.: How independent are the appearances of n-mers in different genomes? Bioinformatics 20(15), 2421–2428 (2004)
Maillet, N., et al.: COMMET: comparing and combining multiple metagenomic datasets. In: 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE (2014)
Langmead, B., Salzberg, S.L.: Fast gapped-read alignment with Bowtie 2. Nature Methods 9(4), 357–359 (2012)
Altschul, S.F., et al.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25(17), 3389–3402 (1997)
Arjona-Medina, J.A., Torreno, O., Chelbat, N., Trelles, O.: Experimental Study of Local Alignment Distributions in the Comparison of Large Genomic Sequences Soibio (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Pérez-Wohlfeil, E., Torreno, O., Trelles, O. (2017). Pairwise and Incremental Multi-stage Alignment of Metagenomes: A New Proposal. In: Rojas, I., Ortuño, F. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2017. Lecture Notes in Computer Science(), vol 10209. Springer, Cham. https://doi.org/10.1007/978-3-319-56154-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-56154-7_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-56153-0
Online ISBN: 978-3-319-56154-7
eBook Packages: Computer ScienceComputer Science (R0)