Generalizability of Polygenic Risk Scores for Breast Cancer in the Multiethnic eMERGE Study
Wendy K. Chung,
Ali G Ghravi,
Katherine D Crew,
Teri A Manolio,
Gail P Jarvik,
Ann E Justice,
Alanna K Rahm,
Stephanie M Fullerton,
Jordan W Smoller,
Eric B Larson,
Paul K. Crane,
Mary Beth Terry,
Posted 21 Aug 2020
medRxiv DOI: 10.1101/2020.08.17.20176685
Posted 21 Aug 2020
Background: The majority of polygenic risk scores (PRS) for breast cancer have been developed and validated using cohorts of European ancestry (EA). Less is known about the generalizability of these PRS in other ancestral groups. Methods: The Electronic Medical Records and Genomics (eMERGE) network cohort dataset was used to evaluate the performance of seven previously developed PRS (three EA-based PRSs, and four non-EA based PRSs) in three major ancestral groups. Each PRS was separately evaluated in EA (cases: 3939; controls: 28840), African ancestry (AA) (cases: 121; controls: 1173) and self-reported LatinX ancestry (LA) (cases: 92; controls: 1363) women. We assessed the association between breast cancer risk and each PRS, adjusting forage, study site, breast cancer family history, and first three ancestry informative principal components. Results: EA-based PRSs were significantly associated with breast cancer risk in EA women per one SD increase (odds ratio [OR]=1.45, 95% confidence interval [CI]=1.40-1.51), and LA women (OR=1.41, 95% CI=1.13-1.77), but not AA women (OR=1.13, 95% CI=0.92-1.40). There was no statistically significant association for the non-EA PRSs in all ancestry groups, including an LA-based PRS and an AA-based PRS. Conclusion: We evaluated EA-derived PRS for estimating breast cancer risk using the eMERGE dataset and found they generalized well in LA women but not in AA women. For non-EA based PRSs, we did not replicate previously reported associations for the respective ancestries in the eMERGE cohort. Our results highlight the need to improve representation of diverse population groups, particularly AA women, in research cohorts.
- Downloaded 214 times
- Download rankings, all-time:
- Site-wide: 103,636
- In genetic and genomic medicine: 433
- Year to date:
- Site-wide: 48,907
- Since beginning of last month:
- Site-wide: 39,751
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!