skip to main content
10.1145/2435264.2435329acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
poster

Acceleration of the long read mapping on a PC-FPGA architecture (abstract only)

Published: 11 February 2013 Publication History

Abstract

The genome sequence alignment, whereby ultra scale of sequence reads should be compared to an enormous long reference, has been one central challenge to the biologists for a long period. For recent years, new sequencing technology makes it possible to generate longer reads (sequences of genome fragments) which seem more valuable for the life science research. It has been foreseen that long genome reads (length longer than 200 base pairs) will dominate the field in the near future. Unfortunately, most of the state-of-art aligners nowadays are optimized and only applicable for the short read mapping while present long read aligners are still not satisfying at the aspect of speed. In this paper, we propose a novel PC-FPGA hybrid system to improve the performance of the long read mapping. The BWA-SW algorithm is chosen as the alignment approach and by accelerating the bottleneck of the algorithm, our solution could archive a significant improvement in term of speed. Experiments demonstrate that the described system is as accurate as the BWA-SW aligner and about 1.41-2.73 times faster than it for reads with lengths ranging from 500bp to 2000bp.

References

[1]
Wu, T.D. and C.K. Watanabe, GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics, 2005. 21(9): p. 1859--1875.
[2]
Lin, H., Z. Zhang, M.Q. Zhang, et al., ZOOM! Zillions of oligos mapped. Bioinformatics, 2008. 24(21): p. 2431--7.
[3]
Li, R., Y. Li, K. Kristiansen, et al., SOAP: short oligonucleotide alignment program. Bioinformatics, 2008. 24(5): p. 713--4.
[4]
Li, H. and R. Durbin, Fast and accurate short read alingment with Burrows-Wheeler transform. Bioinformatics, 2009. 25(14): p. 1754--1760.
[5]
Li, H. and N. Homer, A survey of sequence alignment algorithms for next-generation sequencing. Bioinformatics, 2010. 11(5): p. 473--483.
[6]
Misra, S., A. Agrawal, W.K. Liao, et al., Anatomy of a hash-based long read sequence mapping algorithm for next generation DNA sequencing. Bioinformatics, 2011. 27(2): p. 189--195.
[7]
Olson, C.B., M. Kim, C. Clauson, et al. Hardware Acceleration of Short Read Mapping. in 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines. 2012.
[8]
Altschul, S.F., W. Gish, W. Miller, et al., Basic Local Alignment Search Tool. Molecular Biology, 1990. 215(3): p. 403 -- 410.
[9]
CoX, A., ELAND: Efficient Local Alignment of Nucleotide Data. (Unpublished), 2007.
[10]
Li, H. and R. Durbin. MAQ. Available from: http://maq.sourceforge.net/.
[11]
Homer, N., B. Merriman, and S.F. Nelson, BFAST: An Alignment Tool for Large Scale Genome Resequencing. PLoS ONE, 2009. 4(11): p. e7767.
[12]
Kurtz, S., A. Phillippy, A.L. Delcher, et al., Versatile and open software for comparing large genomes. Genome Biology, 2004. 5(2): p. R12.
[13]
Langmead, B., C. Trapnell, M. Pop, et al., Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology, 2009. 10(3): p. 1--10.
[14]
Kent, W.J., BLAT--The BLAST-like Alignment Tool. Genome Research, 2002. 12(4): p. 656--664.
[15]
Smith, A.D., Z. Xuan, and M.Q. Zhang, Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC bioinformatics, 2008. 9(128): p. 1471--2105.
[16]
Rumble, S.M., P. Lacroute, A.V. Dalca, et al., SHRiMP: Accurate Mapping of Short Color-space Reads. PLoS Comput Biol, 2009. 5(5): p. e1000386.
[17]
Chen, Y., T. Souaiaia, and T. Chen, PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds. Bioinformatics, 2009. 25(19): p. 2514--2521.
[18]
Mosaik. Available from: http://bioinformatics.bc.edu/marthlab/Mosaik.
[19]
Li, H. and R. Durbin, Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics, 2010. 26(5): p. 589--595.
[20]
Ning, Z., A.J. Cox, and J.C. Mullikin, SSAHA: a fast search method for large DNA databases. Genome Research, 2001. 11(10): p. 1725--9.
[21]
Kehr, B., D. Weese, and K. Reinert, STELLAR: fast and exact local alignments. BMC bioinformatics, 2011. 12(Suppl 9): p. S15.
[22]
Schatz, M.C., C. Trapnell, A.L. Delcher, et al., High-throughput sequence alignment using Graphics Processing Units. BMC Bioinformatics, 2007(8): p. 474.
[23]
Trapnell, C. and M.C. Schatz, Optimizing data intensive GPGPU computations for DNA sequence alignment. Parallel Computing 2009. 35(8--9): p. 429--440.
[24]
Liu, Y., B. Schmidt, and D. Maskell, CUDASW++2.0: enhanced Smith-Waterman protein database search on CUDA-enabled GPUs based on SIMT and virtualized SIMD abstractions. BMC Res Notes, 2010(3): p. 93.
[25]
Schatz, M.C., CloudBurst: highly sensitive read mapping with MapReduce. Bioinformatics, 2009. 25(11): p. 1363--1369
[26]
Pireddu, L., S. Leo, and G. Zanetti, SEAL: a distributed short read mapping and duplicate removal tool. Bioinformatics, 2011. 27(15): p. 2159--2160.
[27]
Luethy, R. and C. Hoover, Hardware and software systems for accelerating common bioinformatics sequence analysis algorithms. BIOSILICO, 2004. 2(1): p. 12--17.
[28]
Knodel, O., T.B. Preusser, and R.G. Spallek. Next-generation massively parallel short-read mapping on FPGAs. in 2011 IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP). 2011.
[29]
Preusser, T.B., O. Knodel, and R.G. Spallek. Short-Read Mapping by a Systolic Custom FPGA Computation. in 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines. 2012.
[30]
Tang, W., W. Wang, B. Duan, et al. Accelerating Millions of Short Reads Mapping on a Heterogeneous Architecture with FPGA Accelerator. in 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines. 2012.
[31]
Chen, Y., B. Schmidt, and D.L. Maskell. Accelerating short read mapping on an FPGA (abstract only). in Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays. 2012. Monterey, California, USA: ACM.

Cited By

View all
  • (2014)Accelerating the next generation long read mapping with the FPGA-based systemIEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)10.1109/TCBB.2014.232687611:5(840-852)Online publication date: 1-Sep-2014
  • (2013)A FPGA-Based High Performance Acceleration Platform for the Next Generation Long Read Mapping2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing10.1109/HPCC.and.EUC.2013.52(308-315)Online publication date: Nov-2013
  • (2013)Hardware acceleration for the banded Smith-Waterman algorithm with the cycled systolic array2013 International Conference on Field-Programmable Technology (FPT)10.1109/FPT.2013.6718421(480-481)Online publication date: Dec-2013

Index Terms

  1. Acceleration of the long read mapping on a PC-FPGA architecture (abstract only)

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    FPGA '13: Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
    February 2013
    294 pages
    ISBN:9781450318877
    DOI:10.1145/2435264

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 February 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. BWA-SW
    2. FPGA
    3. hardware acceleration
    4. long read mapping
    5. sequence alignment

    Qualifiers

    • Poster

    Conference

    FPGA '13
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 125 of 627 submissions, 20%

    Upcoming Conference

    FPGA '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2014)Accelerating the next generation long read mapping with the FPGA-based systemIEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)10.1109/TCBB.2014.232687611:5(840-852)Online publication date: 1-Sep-2014
    • (2013)A FPGA-Based High Performance Acceleration Platform for the Next Generation Long Read Mapping2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing10.1109/HPCC.and.EUC.2013.52(308-315)Online publication date: Nov-2013
    • (2013)Hardware acceleration for the banded Smith-Waterman algorithm with the cycled systolic array2013 International Conference on Field-Programmable Technology (FPT)10.1109/FPT.2013.6718421(480-481)Online publication date: Dec-2013

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media