Population sequencing data reveal a compendium of mutational processes in human germline
Vladimir B. Seplyarskiy,
Ruslan A. Soldatov,
Ryan J. McGinty,
Jakob M. Goldmann,
Esteban G. Burchard,
Patrick T. Ellinor,
Stephen T McGarvey,
Braxton D Mitchell,
Vasan S. Ramachandran,
Scott T Weiss,
Donna K. Arnett,
Jerome I. Rotter,
Jennifer A. Brody,
Yii-Der Ida Chen,
Lisa de las Fuentes,
Stephen S Rich,
Ani W. Manichaikul,
Josyf C Mychaleckyj,
Nicholette D Palmer,
Jennifer A. Smith,
Sharon LR Kardia,
Patricia A. Peyser,
Lawrence F Bielak,
Timothy D. O’Connor,
Leslie S Emery,
NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium,
TOPMed Population Genetics Working Group,
Wendy S.W. Wong,
Peter V. Kharchenko,
Shamil R. Sunyaev
Posted 11 Jan 2020
bioRxiv DOI: 10.1101/2020.01.10.893024
Posted 11 Jan 2020
Mechanistic processes underlying human germline mutations remain largely unknown. Variation in mutation rate and spectra along the genome is informative about the biological mechanisms. We statistically decompose this variation into separate processes using a blind source separation technique. The analysis of a large-scale whole genome sequencing dataset (TOPMed) reveals nine processes that explain the variation in mutation properties between loci. Seven of these processes lend themselves to a biological interpretation. One process is driven by bulky DNA lesions that resolve asymmetrically with respect to transcription and replication. Two processes independently track direction of replication fork and replication timing. We identify a mutagenic effect of active demethylation primarily acting in regulatory regions. We also demonstrate that a recently discovered mutagenic process specific to oocytes can be localized solely from population sequencing data. This process is spread across all chromosomes and is highly asymmetric with respect to the direction of transcription, suggesting a major role of DNA damage.
- Downloaded 972 times
- Download rankings, all-time:
- Site-wide: 11,918 out of 88,857
- In genetics: 799 out of 4,603
- Year to date:
- Site-wide: 1,447 out of 88,857
- Since beginning of last month:
- Site-wide: 13,110 out of 88,857
Downloads over time
Distribution of downloads per paper, site-wide
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!