Skip to main content

RB-Finder: An Improved Distance-Based Sliding Window Method to Detect Recombination Breakpoints

  • Conference paper
Research in Computational Molecular Biology (RECOMB 2007)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 4453))

Abstract

Recombination detection is important before inferring phylogenetic relationships. This will eventually lead to a better understanding of pathogen evolution, more accurate genotyping and advancements in vaccine development. In this paper, we introduce RB-Finder, a fast and accurate distance-based window method to detect recombination in a multiple sequence alignment. Our method introduces a more informative distance measure and a novel weighting strategy to reduce the window size sensitivity problem and hence improve the accuracy of breakpoint detection. Furthermore, our method is faster than existing phylogeny-based methods since we do not need to construct and compare complex phylogenetic trees. When compared with the current best method Pruned-PDM, we are about a few hundred times more efficient. Experimental evaluation of RB-Finder using synthetic and biological datasets showed that our method is more accurate than existing phylogeny-based methods. We also show how our method has potential use in other related applications such as genotyping.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Posada, D.: Evaluation of methods for detecting recombination from data sequences: Empirical data. Molecular Biology and Evolution 19, 1198–1212 (2002)

    Google Scholar 

  2. Weiller, G.F.: Phylogenetic profiles: a graphical method for detecting genetic recombination in homologous sequences. Molecular Biology and Evolution 15, 326–335 (1998)

    Google Scholar 

  3. Myers, S.R., Griffiths, R.C.: Bounds on the minimum number of recombination events in a sample history. Genetics 163, 375–394 (2003)

    Google Scholar 

  4. Schierup, M.H., Hein, J.: Consequences of recombination on traditional phylogenetic analysis. Genetics 156, 879–891 (2000)

    Google Scholar 

  5. McGuire, G., Wright, F.: TOPAL 2.0: improved detection of mosaic sequences within multiple alignments. Bioinformatics 16, 130–134 (2000)

    Article  Google Scholar 

  6. Husmeier, D., Wright, F.: Probabilistic divergence measures for detecting interspecies recombination. Bioinformatics 17, 123–131 (2001)

    Google Scholar 

  7. Husmeier, D., Wright, F., Milne, I.: Detecting interspecific recombination with a pruned probabilistic divergence measure. Bioinformatics 21, 1797–1806 (2005)

    Article  Google Scholar 

  8. Ruths, D., Nakhleh, L.: RECOMP: A parsimony-based method for detecting recombination. In: Proceedings of The Fourth Asia-Pacific Bioinformatics Conference, pp. 59–68 (2006)

    Google Scholar 

  9. Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Kolmogorov-Smirnov Test. In: Numerical Recipes in FORTRAN: The Art of Scientific Computing, 2nd edn., pp. 617–620 (1992)

    Google Scholar 

  10. Kimura, M.: A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. Journal of Molecular Evolution 16, 111–120 (1980)

    Article  Google Scholar 

  11. Thompson, J.D., Higgins, D.G., Gibson, T.J.: CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequencing weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680 (1994)

    Article  Google Scholar 

  12. Sturmer, M., Doerr, W., Preiser, W.: Variety of interpretation systems for human immunodeficiency virus type 1 genotyping: Confirmatory information or additional confusion? Current Drug Targets Infectious Disorder 3, 373–382 (2003)

    Article  Google Scholar 

  13. Leitner, T., Korber, B., Daniels, M., Calef, C., Foley, B.: HIV-1 Subtype and Circulating Recombinant Form (CRF) Reference Sequences (2005), http://www.hiv.lanl.gov/content/hiv-db/REVIEWS/RefSeqs2005/RefSeqs05.html

    Google Scholar 

  14. The HIV Sequence Database. http://www.hiv.lanl.gov/content/hiv-db/mainpage.html

  15. The Circulating Recombinant Forms. http://hiv-web.lanl.gov/content/hiv-db/CRFs/CRFs.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Terry Speed Haiyan Huang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Lee, WH., Sung, WK. (2007). RB-Finder: An Improved Distance-Based Sliding Window Method to Detect Recombination Breakpoints. In: Speed, T., Huang, H. (eds) Research in Computational Molecular Biology. RECOMB 2007. Lecture Notes in Computer Science(), vol 4453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71681-5_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71681-5_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71680-8

  • Online ISBN: 978-3-540-71681-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics