poster

De novo assembly of ultra-deep sequencing data

Authors:
Hamid Mirebrahim

University of California, Riverside, CA

University of California, Riverside, CA
View Profile

,
Timothy Close

University of California, Riverside, CA

University of California, Riverside, CA
View Profile

,
Stefano Lonardi

University of California, Riverside, CA

University of California, Riverside, CA
View Profile

BCB '14: Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health InformaticsSeptember 2014Pages 609https://doi.org/10.1145/2649387.2660799

Published:20 September 2014Publication History

BCB '14: Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

Pages 609

ABSTRACT

Life scientists and bio-informaticians have struggled with insufficient amount of sequencing data since the beginning of Sanger sequencing in the seventies. As a consequence, most of the de novo assembly methods that have been proposed are designed to deal with low coverage sequencing and unbalanced depth of coverage. The situation is now about to change. The cost of sequencing has been decreasing so much that it is interesting to think about the possibility to have "as much sequencing data as we want". When the sequencing will be so cheap that scientists can decide about their desired depth of coverage without being worried about cost, the following question arises: assuming today's sequencing error rate, does higher depth of coverage necessarily lead to a better quality assembly? In this study, we demonstrate for the first time that current state-of-the-art assemblers are unable to handle ultra-deep (i.e., 1,000-10,000x) depth of coverage. We then propose a new method to build high quality assemblies from ultra-deep sequencing data. Our approach is based on "data slicing": we split a large dataset into "slices", then assemble each slice individually using a off-the-shelves assembler. Our tool then merges optimally the individual assemblies. Experimental results show that our method can improve significantly the quality of the assemblies, when compared to the assemblies of the individual slices.

Recommendations

De novo transcriptome assembly with ABySS

Motivation: Whole transcriptome shotgun sequencing data from non-normalized samples offer unique opportunities to study the metabolic states of organisms. One can deduce gene expression levels using sequence coverage as a surrogate, identify coding ...
Read More
Techniques for de novo sequence assembly: algorithms and experimental results
Read More
Error Correction and de Novo Genome Assembly Of DNA Sequencing Data
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
BCB '14: Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
September 2014
851 pages
ISBN:9781450328944
DOI:10.1145/2649387
General Chairs:
Pierre Baldi
University of California, Irvine
,
Wei Wang
University of California, Los Angeles
Copyright © 2014 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 September 2014
Check for updates
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate254of885submissions,29%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 0
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

De novo assembly of ultra-deep sequencing data

BCB '14: Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

ABSTRACT

Cited By

Recommendations

De novo transcriptome assembly with ABySS

Techniques for de novo sequence assembly: algorithms and experimental results

Error Correction and de Novo Genome Assembly Of DNA Sequencing Data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

Digital Edition

Caption

De novo assembly of ultra-deep sequencing data

BCB '14: Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

ABSTRACT

Cited By

Recommendations

De novo transcriptome assembly with ABySS

Techniques for de novo sequence assembly: algorithms and experimental results

Error Correction and de Novo Genome Assembly Of DNA Sequencing Data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

Digital Edition

Share this Publication link

Share on Social Media