Rxivist logo

COPO: a metadata platform for brokering FAIR data in the life sciences

By Anthony Etuk, Felix Shaw, Alejandra Gonzalez-Beltran, David Johnson, Marie-Angélique Laporte, Philippe Rocca-Serra, Elizabeth Arnaud, Medha Devare, Paul J Kersey, Susanna-Assunta Sansone, Robert P Davey

Posted 26 Sep 2019
bioRxiv DOI: 10.1101/782771

Scientific innovation is increasingly reliant on data and computational resources. Much of today's life science research involves generating, processing, and reusing heterogeneous datasets that are growing exponentially in size. Demand for technical experts (data scientists and bioinformaticians) to process these data is at an all-time high, but these are not typically trained in good data management practices. That said, we have come a long way in the last decade, with funders, publishers, and researchers themselves making the case for open, interoperable data as a key component of an open science philosophy. In response, recognition of the FAIR Principles (that data should be Findable, Accessible, Interoperable and Reusable) has become commonplace. However, both technical and cultural challenges for the implementation of these principles still exist when storing, managing, analysing and disseminating both legacy and new data. COPO is a computational system that attempts to address some of these challenges by enabling scientists to describe their research objects (raw or processed data, publications, samples, images, etc.) using community-sanctioned metadata sets and vocabularies, and then use public or institutional repositories to share it with the wider scientific community. COPO encourages data generators to adhere to appropriate metadata standards when publishing research objects, using semantic terms to add meaning to them and specify relationships between them. This allows data consumers, be they people or machines, to find, aggregate, and analyse data which would otherwise be private or invisible. Building upon existing standards to push the state of the art in scientific data dissemination whilst minimising the burden of data publication and sharing. Availability: COPO is entirely open source and freely available on GitHub at https://github.com/collaborative-open-plant-omics. A public instance of the platform for use by the community, as well as more information, can be found at copo-project.org.

Download data

  • Downloaded 343 times
  • Download rankings, all-time:
    • Site-wide: 56,264 out of 103,749
    • In bioinformatics: 6,266 out of 9,474
  • Year to date:
    • Site-wide: 36,254 out of 103,749
  • Since beginning of last month:
    • Site-wide: 57,759 out of 103,749

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)