skip to main content
10.1145/2554688.2554694acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
poster

Big data genome sequencing on Zynq based clusters (abstract only)

Published: 26 February 2014 Publication History

Abstract

Next-generation sequencing (NGS) problems have attracted many attentions of researchers in biological and medical computing domains. The current state-of-the-art NGS computing machines are dramatically lowering the cost and increasing the throughput of DNA sequencing. In this paper, we propose a practical study that uses Xilinx Zynq board to summarize acceleration engines using FPGA accelerators and ARM processors for the state-of-the-art short read mapping approaches. The heterogeneous processors and accelerators are coupled with each other using a general Hadoop distributed processing framework. First the reads are collected by the central server, and then distributed to multiple accelerators on the Zynq for hardware acceleration. Therefore, the combination of hardware acceleration and Map-Reduce execution flow could greatly accelerate the task of aligning short length reads to a known reference genome. Our approach is based on preprocessing the reference genomes and iterative jobs for aligning the continuous incoming reads. The hardware acceleration is based on the creditable read-mapping algorithm RMAP software approach. Furthermore, the speedup analysis on a Hadoop cluster, which concludes 8 development boards, is evaluated. Experimental results demonstrate that our proposed architecture and methods has the speedup of more than 112X, and is scalable with the number of accelerators. Finally, the Zynq based cluster has efficient potential to accelerate even general large scale big data applications.
This work was supported by the NSFC grants No. 61379040, No. 61272131 and No. 61202053.

Cited By

View all
  • (2018)Distributed gene clinical decision support system based on cloud computingBMC Medical Genomics10.1186/s12920-018-0415-111:S5Online publication date: 20-Nov-2018
  • (2017)Distributed gene clinical decision support system based on cloud computing2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2017.8217695(485-490)Online publication date: Nov-2017
  • (2014)Towards energy awareness in HadoopProceedings of the Fourth International Workshop on Network-Aware Data Management10.5555/2688394.2688397(16-22)Online publication date: 16-Nov-2014

Index Terms

  1. Big data genome sequencing on Zynq based clusters (abstract only)

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    FPGA '14: Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays
    February 2014
    272 pages
    ISBN:9781450326711
    DOI:10.1145/2554688
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 February 2014

    Check for updates

    Author Tags

    1. bioinformatics
    2. fpga
    3. genome sequencing
    4. hardware acceleration.
    5. rmap

    Qualifiers

    • Poster

    Conference

    FPGA'14
    Sponsor:

    Acceptance Rates

    FPGA '14 Paper Acceptance Rate 30 of 110 submissions, 27%;
    Overall Acceptance Rate 125 of 627 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Distributed gene clinical decision support system based on cloud computingBMC Medical Genomics10.1186/s12920-018-0415-111:S5Online publication date: 20-Nov-2018
    • (2017)Distributed gene clinical decision support system based on cloud computing2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2017.8217695(485-490)Online publication date: Nov-2017
    • (2014)Towards energy awareness in HadoopProceedings of the Fourth International Workshop on Network-Aware Data Management10.5555/2688394.2688397(16-22)Online publication date: 16-Nov-2014

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media