Quantitative analysis of population-scale family trees using millions of relatives
Daniel G. MacArthur,
Alkes L. Price,
Posted 07 Feb 2017
bioRxiv DOI: 10.1101/106427 (published DOI: 10.1126/science.aam9309)
Posted 07 Feb 2017
Family trees have vast applications in multiple fields from genetics to anthropology and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. Here, we collected 86 million profiles from publicly-available online data from genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of longevity by inspecting millions of relative pairs and to provide insights to population genetics theories on the dispersion of families. We also report a simple digital procedure to overlay other datasets with our resource in order to empower studies with population-scale genealogical data.
- Downloaded 18,972 times
- Download rankings, all-time:
- Site-wide: 75 out of 85,062
- In genomics: 11 out of 5,478
- Year to date:
- Site-wide: 4,513 out of 85,062
- Since beginning of last month:
- Site-wide: 10,091 out of 85,062
Downloads over time
Distribution of downloads per paper, site-wide
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!