Rxivist logo

The architecture of SARS-CoV-2 transcriptome

By Dongwan Kim, Joo-Yeon Lee, Jeong-Sun Yang, Jun Won Kim, V. Narry Kim, Hyeshik Chang

Posted 14 Mar 2020
bioRxiv DOI: 10.1101/2020.03.12.988865 (published DOI: 10.1016/j.cell.2020.04.011)

SARS-CoV-2 is a betacoronavirus that is responsible for the COVID-19 pandemic. The genome of SARS-CoV-2 was reported recently, but its transcriptomic architecture is unknown. Utilizing two complementary sequencing techniques, we here present a high-resolution map of the SARS-CoV-2 transcriptome and epitranscriptome. DNA nanoball sequencing shows that the transcriptome is highly complex owing to numerous recombination events, both canonical and noncanonical. In addition to the genomic RNA and subgenomic RNAs common in all coronaviruses, SARS-CoV-2 produces a large number of transcripts encoding unknown ORFs with fusion, deletion, and/or frameshift. Using nanopore direct RNA sequencing, we further find at least 41 RNA modification sites on viral transcripts, with the most frequent motif being AAGAA. Modified RNAs have shorter poly(A) tails than unmodified RNAs, suggesting a link between the internal modification and the 3′ tail. Functional investigation of the unknown ORFs and RNA modifications discovered in this study will open new directions to our understanding of the life cycle and pathogenicity of SARS-CoV-2. Highlights

Download data

  • Downloaded 13,982 times
  • Download rankings, all-time:
    • Site-wide: 144 out of 92,253
    • In genomics: 24 out of 5,819
  • Year to date:
    • Site-wide: 67 out of 92,253
  • Since beginning of last month:
    • Site-wide: 525 out of 92,253

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)