Challenges and recommendations to improve installability and archival stability of omics computational tools
By
Serghei Mangul,
Thiago Mosqueiro,
Richard J. Abdill,
Dat Duong,
Keith Mitchell,
Varuni Sarwal,
Brian L. Hill,
Jaqueline Brito,
Russell Jared Littman,
Benjamin Statz,
Angela Ka-Mei Lam,
Gargi Dayama,
Laura Grieneisen,
Lana S. Martin,
Jonathan Flint,
Eleazar Eskin,
Ran Blekhman
Posted 25 Oct 2018
bioRxiv DOI: 10.1101/452532
(published DOI: 10.1371/journal.pbio.3000333)
Developing new software tools for analysis of large-scale biological data is a key component of advancing modern biomedical research. Scientific reproduction of published findings requires running computational tools on data generated by such studies, yet little attention is presently allocated to the installability and archival stability of computational software tools. Scientific journals require data and code sharing, but none currently require authors to guarantee the continuing functionality of newly published tools. We have estimated the archival stability of computational biology software tools by performing an empirical analysis of the internet presence for 36,702 omics software resources published from 2005 to 2017. We found that almost 28% of all resources are currently not accessible through URLs published in the paper they first appeared in. Among the 98 software tools selected for our installability test, 51% were deemed “easy to install,” and 28% of the tools failed to be installed at all due to problems in the implementation. Moreover, for papers introducing new software, we found that the number of citations significantly increased when authors provided an easy installation process. We propose for incorporation into journal policy several practical solutions for increasing the widespread installability and archival stability of published bioinformatics software.
Download data
- Downloaded 5,331 times
- Download rankings, all-time:
- Site-wide: 1,714
- In bioinformatics: 137
- Year to date:
- Site-wide: None
- Since beginning of last month:
- Site-wide: None
Altmetric data
Downloads over time
Distribution of downloads per paper, site-wide
PanLingua
News
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!