Comparison of the two up-to-date sequencing technologies for genome assembly: HiFi reads of Pacific Biosciences Sequel II system and ultralong reads of Oxford Nanopore

This article has been Reviewed by the following groups

Read the full article

Abstract

Background

The availability of reference genomes has revolutionized the study of biology. Multiple competing technologies have been developed to improve the quality and robustness of genome assemblies during the past decade. The 2 widely used long-read sequencing providers—Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT)—have recently updated their platforms: PacBio enables high-throughput HiFi reads with base-level resolution of >99%, and ONT generated reads as long as 2 Mb. We applied the 2 up-to-date platforms to a single rice individual and then compared the 2 assemblies to investigate the advantages and limitations of each.

Results

The results showed that ONT ultralong reads delivered higher contiguity, producing a total of 18 contigs of which 10 were assembled into a single chromosome compared to 394 contigs and 3 chromosome-level contigs for the PacBio assembly. The ONT ultralong reads also prevented assembly errors caused by long repetitive regions, for which we observed a total of 44 genes of false redundancies and 10 genes of false losses in the PacBio assembly, leading to over- or underestimation of the gene families in those long repetitive regions. We also noted that the PacBio HiFi reads generated assemblies with considerably fewer errors at the level of single nucleotides and small insertions and deletions than those of the ONT assembly, which generated an average 1.06 errors per kb and finally engendered 1,475 incorrect gene annotations via altered or truncated protein predictions.

Conclusions

It shows that both PacBio HiFi reads and ONT ultralong reads had their own merits. Further genome reference constructions could leverage both techniques to lessen the impact of assembly errors and subsequent annotation mistakes rooted in each.

Article activity feed

  1. Now published in GigaScience doi: 10.1093/gigascience/giaa123

    Dandan Lang 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Dandan LangShilai Zhang 2State Key laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Research Center for Perennial Rice Engineering and Technology of Yunnan, School of Agriculture, Yunnan University, Kunming, Yunnan, 650091, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this sitePingping Ren 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteFan Liang 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteZongyi Sun 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteGuanliang Meng 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Guanliang MengYuntao Tan 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteJiang Hu 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteXiaokang Li Find this author on Google ScholarFind this author on PubMedSearch for this author on this siteQihua Lai Find this author on Google ScholarFind this author on PubMedSearch for this author on this siteLingling Han 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteDepeng Wang 1GrandOmics Biosciences, Beijing, 102200, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteFengyi Hu 2State Key laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Research Center for Perennial Rice Engineering and Technology of Yunnan, School of Agriculture, Yunnan University, Kunming, Yunnan, 650091, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteWen Wang 3State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, 650223 Kunming, Yunnan, China4Center for Ecological and Environmental Sciences, Key Laboratory for Space Bioscience & Biotechnology, Northwestern Polytechnical University, 710072 Xi’an, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteFor correspondence: liushanlin@grandomics.com wwang@mail.kiz.ac.cnShanlin Liu 1GrandOmics Biosciences, Beijing, 102200, China5Department of Entomology, College of Plant Protection, China Agricultural University, 100193 Beijing, ChinaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Shanlin LiuFor correspondence: liushanlin@grandomics.com wwang@mail.kiz.ac.cn

    A version of this preprint has been published in the Open Access journal GigaScience (see paper https://doi.org/10.1093/gigascience/giaa123 ), where the paper and peer reviews are published openly under a CC-BY 4.0 license.

    These peer reviews were as follows:

    Reviewer 1: http://dx.doi.org/10.5524/REVIEW.102552 Reviewer 2: http://dx.doi.org/10.5524/REVIEW.102553 Reviewer 3: http://dx.doi.org/10.5524/REVIEW.102554