Rxivist logo

Characterization of SARS-CoV-2 viral diversity within and across hosts

By Palash Sashittal, Yunan Luo, Jian Peng, Mohammed El-Kebir

Posted 13 May 2020
bioRxiv DOI: 10.1101/2020.05.07.083410

In light of the current COVID-19 pandemic, there is an urgent need to accurately infer the evolutionary and transmission history of the virus to inform real-time outbreak management, public health policies and mitigation strategies. Current phylogenetic and phylodynamic approaches typically use consensus sequences, essentially assuming the presence of a single viral strain per host. Here, we analyze 621 bulk RNA sequencing samples and 7,540 consensus sequences from COVID-19 patients, and identify multiple strains of the virus, SARS-CoV-2, in four major clades that are prevalent within and across hosts. In particular, we find evidence for (i) within-host diversity across phylogenetic clades, (ii) putative cases of recombination, multi-strain and/or superinfections as well as (iii) distinct strain profiles across geographical locations and time. Our findings and algorithms will facilitate more detailed evolutionary analyses and contact tracing that specifically account for within-host viral diversity in the ongoing COVID-19 pandemic as well as future pandemics. ### Competing Interest Statement The authors have declared no competing interest.

Download data

  • Downloaded 1,853 times
  • Download rankings, all-time:
    • Site-wide: 4,961 out of 103,749
    • In bioinformatics: 877 out of 9,474
  • Year to date:
    • Site-wide: 1,128 out of 103,749
  • Since beginning of last month:
    • Site-wide: 7,773 out of 103,749

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)