Rxivist logo

Cumulus: a cloud-based data analysis framework for large-scale single-cell and single-nucleus RNA-seq

By Bo Li, Joshua Gould, Yiming Yang, Siranush Sarkizova, Marcin Tabaka, Orr Ashenberg, Yanay Rosen, Michal Slyper, Monika S. Kowalczyk, Alexandra-Chloe Villani, Timothy Tickle, Nir Hacohen, Orit Rozenblatt-Rosen, Aviv Regev

Posted 30 Oct 2019
bioRxiv DOI: 10.1101/823682

Massively parallel single-cell and single-nucleus RNA-seq (sc/snRNA-seq) have opened the way to systematic tissue atlases in health and disease, but as the scale of data generation is growing, so does the need for computational pipelines for scaled analysis. Here, we developed Cumulus, a cloud-based framework for analyzing large scale sc/snRNA-seq datasets. Cumulus combines the power of cloud computing with improvements in algorithm implementations to achieve high scalability, low cost, user-friendliness, and integrated support for a comprehensive set of features. We benchmark Cumulus on the Human Cell Atlas Census of Immune Cells dataset of bone marrow cells and show that it substantially improves efficiency over conventional frameworks, while maintaining or improving the quality of results, enabling large-scale studies.

Download data

  • Downloaded 1,793 times
  • Download rankings, all-time:
    • Site-wide: 8,803
    • In bioinformatics: 1,069
  • Year to date:
    • Site-wide: 17,507
  • Since beginning of last month:
    • Site-wide: 17,178

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)