Predicting Alcohol Dependence from Multi-Site Brain Structural Measures
By
Sage Hahn,
Scott Mackey,
Janna Cousijn,
John J. Foxe,
Robert Hester,
Kent Hutchinson,
Ozlem Korucuoglu,
Edythe London,
Valentina Lorenzetti,
Maartje Luijten,
Reza Momenan,
Catherine Orr,
Martin Paulus,
Lianne Schmaal,
Rajita Sinha,
Zsuzsika Sjoerds,
Dan Stein,
Elliot Stein,
Ruth J. van Holst,
Dick Veltman,
Reinout W Wiers,
Murat Yucel,
Paul M. Thompson,
Patricia Conrod,
Nicholas Allgaier,
Hugh Garavan
Posted 23 Jan 2020
medRxiv DOI: 10.1101/2020.01.17.20016873
BackgroundThe search for neuroimaging biomarkers of alcohol use disorder (AUD) has primarily been restricted to significance testing in small datasets of low diversity. To identify neurobiological markers beyond individual differences, it may be useful to develop classification models for AUD. The ever-increasing quantity of neuroimaging data demands methods that are robust to the complexities of multi-site designs and are generalizable to data from new scanners. MethodsThis study represents a mega-analysis of previously published datasets from 2,034 AUD and comparison participants spanning 27 sites, coordinated by the ENIGMA Addiction Working Group. Data were grouped into a training set including 1,652 participants (692 AUD, 24 sites), and test set with 382 participants (146 AUD, 3 sites). A battery of machine learning classifiers was evaluated using repeated random cross-validation (CV) and leave-site-out CV. Area under the receiver operating characteristic curve (AUC) was our base metric of performance. ResultsMulti-objective evolutionary search was conducted to identify sparse, generalizable, and high performing subsets of brain measurements. Cortical thickness in the left superior frontal gyrus and right lateral orbitofrontal cortex, cortical surface area in the right transverse temporal gyrus, and left putamen volume, appeared most frequently across searches. Restricting a regularized logistic regression model to these four features yielded a test-set AUC of .768. ConclusionsDeveloping classification models on multi-site data with varied underlying class distributions poses unique challenges. Supplementing datasets with controls from new sites and performing feature selection increases generalizability. Four features identified by evolutionary search may serve as specific biomarkers for individuals with current AUD.
Download data
- Downloaded 544 times
- Download rankings, all-time:
- Site-wide: 47,839
- In addiction medicine: 14
- Year to date:
- Site-wide: 101,738
- Since beginning of last month:
- Site-wide: 102,008
Altmetric data
Downloads over time
Distribution of downloads per paper, site-wide
PanLingua
News
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!