Rxivist logo

Estimating the rate of cell type degeneration from epigenetic sequencing of cell-free DNA

By Christa Caggiano, Barbara Celona, Fleur Garton, Joel Mefford, Brian Black, Catherine Lomen-Hoerth, Andrew Dahl, Noah Zaitlen

Posted 16 Jan 2020
bioRxiv DOI: 10.1101/2020.01.15.907022

Circulating cell-free DNA (cfDNA) in the bloodstream originates from dying cells and is a promising non-invasive biomarker for cell death. Here, we develop a method to accurately estimate the relative abundances of cell types contributing to cfDNA. We leverage the distinct DNA methylation profile of each cell type throughout the body. Decomposing the cfDNA mixture is difficult, as fragments from relevant cell types may only be present in a small amount. We propose an algorithm, CelFiE, that estimates cell type proportion from both whole-genome cfDNA input and reference data. CelFiE accommodates low coverage data, does not rely on CpG site curation, and estimates contributions from multiple unknown cell types that are not available in reference data. In simulations, we show that CelFiE can accurately estimate known and unknown cell type of origin of cfDNA mixtures in low coverage and noisy data. Simulations also demonstrate that we can effectively estimate cfDNA originating from rare cell types composing less than 0.01% of the total cfDNA. To validate CelFiE, we use a positive control: cfDNA extracted from pregnant and non-pregnant women. CelFiE estimates a large placenta component specifically in pregnant women (p=9.1 x 10E-5). Finally, we use CelFiE to decompose cfDNA from ALS patients and age-matched controls. We find increased cfDNA concentrations in ALS patients (p=3.0 x 10E-3). Specifically, CelFiE estimates increased skeletal muscle component in the cfDNA of ALS patients (p=2.6 x 10E-3), which is consistent with muscle impairment characterizing ALS. Quantification of skeletal muscle death in ALS is novel, and overall suggests that CelFiE may be a useful tool for biomarker discovery and monitoring of disease progression.

Download data

  • Downloaded 873 times
  • Download rankings, all-time:
    • Site-wide: 22,600
    • In genomics: 2,238
  • Year to date:
    • Site-wide: 47,210
  • Since beginning of last month:
    • Site-wide: 23,141

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


PanLingua

Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News