skip to main content
10.1145/3545839.3545855acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicomsConference Proceedingsconference-collections
research-article

A Comparative Study of HiCanu and Hifiasm

Published: 10 September 2022 Publication History

Abstract

The recent development of Hifi sequencing has greatly improved people's understanding of genomics. Hifi reads provide a more accurate and complete picture than traditional long reads and Illumina short reads. However, both long reads and short reads assemblers are not good fits for Hifi reads in reality. Therefore, in late 2020, HiCanu and Hifiasm have been developed to assemble Hifi reads. Even though they are both phased assemblers, which highly complexed regions will be separated into two different alleles, they show different output formats, algorithms and performance. The topic of this paper will be focused on comparison between HiCanu and Hifiasm on contiguity, completeness, and runtime. In order to compare the two tools, it is necessary to examine HiCanu and Hifiasm results of different genome assemblies from several published papers. Despite some shortcomings of Hifiasm assembler which is associated with increased coverage, Hifiasm is the best assembler for Hifi reads so far because of its high contiguity, completeness and fast runtime.

References

[1]
Wenger, A.M., Peluso, P., Rowell, W.J. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nat Biotechnol 37, 1155–1162 (2019). https://doi.org/10.1038/s41587-019-0217-9
[2]
Hon, T., Mars, K., Young, G. Highly accurate long-read HiFi sequencing data for five complex genomes. Sci Data 7, 399 (2020). https://doi.org/10.1038/s41597-020-00743-4
[3]
PacBio. “Hifi Reads - Highly Accurate Long-Read Sequencing.” PacBio, 11 July 2021, www.pacb.com/smrt-science/smrt-sequencing/hifi-reads-for-highly-accurate-long-read-sequencing/
[4]
Li, Z., “Comparison of the Two Major Classes of Assembly Algorithms: Overlap-Layout-Consensus and De-Bruijn-Graph.” Briefings in Functional Genomics, vol. 11, no. 1, 2011, pp. 25–37., https://doi.org/10.1093/bfgp/elr035
[5]
Anton Bankevich, Sergey Nurk, Dmitry Antipov, Alexey A. Gurevich, Mikhail Dvorkin, Alexander S. Kulikov, Valery M. Lesin, Sergey I. Nikolenko, Son Pham, Andrey D. Prjibelski, Alexey V. Pyshkin, Alexander V. Sirotkin, Nikolay Vyahhi, Glenn Tesler, Max A. Alekseyev, and Pavel A. Pevzner.Journal of Computational Biology.May 2012.455-477.https://doi.org/10.1089/cmb.2012.0021
[6]
Prjibelski, A., Antipov, D., Meleshko, D., Lapidus, A., & Korobeynikov, A. (2020). Using SPAdes de novo assembler. Current Protocols in Bioinformatics, 70, e102.
[7]
Jang-il Sohn, Jin-Wu Nam, The present and future of de novo whole-genome assembly, Briefings in Bioinformatics, Volume 19, Issue 1, January 2018, Pages 23–40, https://doi.org/10.1093/bib/bbw096
[8]
Heng Li, Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences, Bioinformatics, Volume 32, Issue 14, 15 July 2016, Pages 2103–2110, https://doi.org/10.1093/bioinformatics/btw152
[9]
 Marx, Vivien. “Long Road to Long-Read Assembly.” Nature Methods, vol. 18, no. 2, 1 Feb. 2021, pp. 125–129., https://doi.org/10.1038/s41592-021-01057-y
[10]
Koren, Sergey, “Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.” Genome research 27.5 (2017): 722-736.
[11]
Nurk, Sergey, "HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads." Genome research 30.9 (2020): 1291-1305.
[12]
Cheng, H., Concepcion, G.T., Feng, X. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 18, 170–175 (2021). https://doi.org/10.1038/s41592-020-01056-5
[13]
Chin, Chen-Shan, “Nonhybrid, Finished Microbial Genome Assemblies from Long-Read SMRT Sequencing Data.” Nature Methods, vol. 10, no. 6, 2013, pp. 563–569., https://doi.org/10.1038/nmeth.2474.
[14]
Chin, Chen-Shan, “Phased Diploid Genome Assembly with Single-Molecule Real-Time Sequencing.” Nature Methods, vol. 13, no. 12, 2016, pp. 1050–1054., https://doi.org/10.1038/nmeth.4035.
[15]
Koren, S., Rhie, A., Walenz, B. De novo assembly of haplotype-resolved genomes with trio binning. Nat Biotechnol 36, 1174–1182 (2018). https://doi.org/10.1038/nbt.4277
[16]
Hiatt, Susan M., “Long-Read Genome Sequencing for the Diagnosis of Neurodevelopmental Disorders.” BioRxiv, 2020, https: //doi.org/10.1101/2020.07.02.185447.
[17]
Feng, Xiaowen, “Metagenome Assembly of High-Fidelity Long Reads with Hifiasm-Meta.” Genomics (q-Bio.GN), 16 Oct. 2021, https: //doi.org/arXiv:2110.08457.
[18]
Thrash, A., Hoffmann, F. & Perkins, A. Toward a more holistic method of genome assembly assessment. BMC Bioinformatics 21, 249 (2020). https: //doi.org/10.1186/s12859-020-3382-4
[19]
Clément Schneider, Christian Woehle, Carola Greve, Cyrille A D'Haese, Magnus Wolf, Michael Hiller, Axel Janke, Miklós Bálint, Bruno Huettel, Two high-quality de novo genomes from single ethanol-preserved specimens of tiny metazoans (Collembola), GigaScience, Volume 10, Issue 5, May 2021, giab035, https://doi.org/10.1093/gigascience/giab035
[20]
Gavrielatos, M., Kyriakidis, K., Spandidos, D. A., Michalopoulos, I."Benchmarking of next and third generation sequencing technologies and their associated algorithms for de novo genome assembly". Molecular Medicine Reports 23.4 (2021): 251.
[21]
Qi, Weihong, “The Haplotype-Resolved Chromosome Pairs and Transcriptome of a Heterozygous Diploid African Cassava Cultivar.” 19 Nov. 2021, https: //doi.org/10.1101/2021.11.16.468774.
[22]
Patrick Driguez, Salim Bougouffa, Karen Carty, Alexander Putra, Kamel Jabbari, Muppala Reddy, Richard Soppe, Nicole Cheung, Yoshinori Fukasawa, Luca Ermini BioRxiv 2021.01.25.428044;
[23]
Dengfeng Guan, Shane A McCarthy, Jonathan Wood, Kerstin Howe, Yadong Wang, Richard Durbin, Identifying and removing haplotypic duplication in primary genome assemblies, Bioinformatics, Volume 36, Issue 9, 1 May 2020, Pages 2896–2898, https://doi.org/10.1093/bioinformatics/btaa025
[24]
Kolmogorov, M., Yuan, J., Lin, Y. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37, 540–546 (2019). https: //doi.org/10.1038/s41587-019-0072-8
[25]
Faure, Roland, Nadège Guiglielmoni, and Jean-François Flot. "GraphUnzip: unzipping assembly graphs with long reads and Hi-C." bioRxiv (2021).
[26]
Wang, P., Yu, J., Jin, S. Genetic basis of high aroma and stress tolerance in the oolong tea cultivar genome. Hortic Res 8, 107 (2021). https: //doi.org/10.1038/s41438-021-00542-x
[27]
Xie, Min, “GcaPDA: A Haplotype-Resolved Diploid Assembler.” 31 May 2021, https: //doi.org/10.1101/2021.05.31.446328.
[28]
Vollger, Mitchell R., "Improved assembly and variant detection of a haploid human genome using single‐molecule, high‐fidelity long reads." Annals of human genetics 84.2 (2020): 125-140.

Index Terms

  1. A Comparative Study of HiCanu and Hifiasm

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICoMS '22: Proceedings of the 2022 5th International Conference on Mathematics and Statistics
    June 2022
    137 pages
    ISBN:9781450396233
    DOI:10.1145/3545839
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 September 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. HiCanu
    2. Hifi sequencing
    3. Hifiasm
    4. de novo assembly

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICoMS 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 435
      Total Downloads
    • Downloads (Last 12 months)149
    • Downloads (Last 6 weeks)20
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media