A fully phased accurate assembly of an individual human genome
Peter A. Audano,
Mitchell R. Vollger,
William T. Harvey,
Katherine M. Munson,
Human Genome Structural Variation Consortium,
Peter M. Lansdorp,
Scott E. Devine,
Ashley D. Sanders,
Mark J.P. Chaisson,
Jan O. Korbel,
E. E. Eichler,
Posted 26 Nov 2019
bioRxiv DOI: 10.1101/855049
Posted 26 Nov 2019
The prevailing genome assembly paradigm is to produce consensus sequences that "collapse" parental haplotypes into a consensus sequence. Here, we leverage the chromosome-wide phasing and scaffolding capabilities of single-cell strand sequencing (Strand-seq) and combine them with high-fidelity (HiFi) long sequencing reads, in a novel reference-free workflow for diploid de novo genome assembly. Employing this strategy, we produce completely phased de novo genome assemblies separately for each haplotype of a single individual of Puerto Rican origin (HG00733) in the absence of parental data. The assemblies are accurate (QV > 40), highly contiguous (contig N50 > 25 Mbp) with low switch error rates (0.4%) providing fully phased single-nucleotide variants (SNVs), indels, and structural variants (SVs). A comparison of Oxford Nanopore and PacBio phased assemblies identifies 150 regions that are preferential sites of contig breaks irrespective of sequencing technology or phasing algorithms.
- Downloaded 1,932 times
- Download rankings, all-time:
- Site-wide: 4,098 out of 94,912
- In bioinformatics: 734 out of 8,837
- Year to date:
- Site-wide: 2,023 out of 94,912
- Since beginning of last month:
- Site-wide: 2,713 out of 94,912
Downloads over time
Distribution of downloads per paper, site-wide
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!