Rxivist logo

Nanopore sequencing enables near-complete de novo assembly of Saccharomyces cerevisiae reference strain CEN.PK113-7D

By Alex N. Salazar, Arthur R Gorter de Vries, Marcel van den Broek, Melanie Wijsman, Pilar de la Torre Cort├ęs, Anja Brickwedde, Nick Brouwers, Jean-Marc Daran, Thomas Abeel

Posted 14 Aug 2017
bioRxiv DOI: 10.1101/175984 (published DOI: 10.1093/femsyr/fox074)

The haploid Saccharomyces cerevisiae strain CEN.PK113-7D is a popular model system for metabolic engineering and systems biology research. Current genome assemblies are based on short-read sequencing data scaffolded based on homology to strain S288C. However, these assemblies contain large sequence gaps, particularly in subtelomeric regions, and the assumption of perfect homology to S288C for scaffolding introduces bias. In this study, we obtained a near-complete genome assembly of CEN.PK113-7D using only Oxford Nanopore Technology's MinION sequencing platform. 15 of the 16 chromosomes, the mitochondrial genome, and the 2-micron plasmid are assembled in single contigs and all but one chromosome starts or ends in a telomere cap. This improved genome assembly contains 770 Kbp of added sequence containing 248 gene annotations in comparison to the previous assembly of CEN.PK113-7D. Many of these genes encode functions determining fitness in specific growth conditions and are therefore highly relevant for various industrial applications. Furthermore, we discovered a translocation between chromosomes III and VIII which caused misidentification of a MAL locus in the previous CEN.PK113-7D assembly. This study demonstrates the power of long-read sequencing by providing a high-quality reference assembly and annotation of CEN.PK113-7D and places a caveat on assumed genome stability of microorganisms.

Download data

  • Downloaded 1,221 times
  • Download rankings, all-time:
    • Site-wide: 10,042 out of 105,557
    • In genomics: 1,421 out of 6,382
  • Year to date:
    • Site-wide: 91,541 out of 105,557
  • Since beginning of last month:
    • Site-wide: 59,947 out of 105,557

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


PanLingua

Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News