Rxivist logo

DiMSum: an error model and pipeline for analyzing deep mutational scanning data and diagnosing common experimental pathologies

By Andre Faure, Jörn M. Schmiedel, Pablo Baeza-Centurion, Ben Lehner

Posted 26 Jun 2020
bioRxiv DOI: 10.1101/2020.06.25.171421 (published DOI: 10.1186/s13059-020-02091-3)

Deep mutational scanning (DMS) enables multiplexed measurement of the effects of thousands of variants of proteins, RNAs and regulatory elements. Here, we present a customizable pipeline - DiMSum - that represents an end-to-end solution for obtaining variant fitness and error estimates from raw sequencing data. A key innovation of DiMSum is the use of an interpretable error model that captures the main sources of variability arising in DMS workflows, outperforming previous methods. DiMSum is available as an R/Bioconda package and provides summary reports to help researchers diagnose common DMS pathologies and take remedial steps in their analyses. ### Competing Interest Statement The authors have declared no competing interest.

Download data

  • Downloaded 326 times
  • Download rankings, all-time:
    • Site-wide: 58,886 out of 106,159
    • In bioinformatics: 6,469 out of 9,474
  • Year to date:
    • Site-wide: 16,396 out of 106,159
  • Since beginning of last month:
    • Site-wide: 43,046 out of 106,159

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)