Rxivist logo

Kipoi: accelerating the community exchange and reuse of predictive models for genomics

By Žiga Avsec, Roman Kreuzhuber, Johnny Israeli, Nancy Xu, Jun Cheng, Avanti Shrikumar, Abhimanyu Banerjee, Daniel S Kim, L Urban, Anshul Kundaje, Oliver Stegle, Julien Gagneur

Posted 24 Jul 2018
bioRxiv DOI: 10.1101/375345

Advanced machine learning models applied to large-scale genomics datasets hold the promise to be major drivers for genome science. Once trained, such models can serve as a tool to probe the relationships between data modalities, including the effect of genetic variants on phenotype. However, lack of standardization and limited accessibility of trained models have hampered their impact in practice. To address this, we present Kipoi, a collaborative initiative to define standards and to foster reuse of trained models in genomics. Already, the Kipoi repository contains over 2,000 trained models that cover canonical prediction tasks in transcriptional and post-transcriptional gene regulation. The Kipoi model standard grants automated software installation and provides unified interfaces to apply and interpret models. We illustrate Kipoi through canonical use cases, including model benchmarking, transfer learning, variant effect prediction, and building new models from existing ones. By providing a unified framework to archive, share, access, use, and build on models developed by the community, Kipoi will foster the dissemination and use of machine learning models in genomics.

Download data

  • Downloaded 3,335 times
  • Download rankings, all-time:
    • Site-wide: 1,388 out of 83,751
    • In bioinformatics: 252 out of 8,033
  • Year to date:
    • Site-wide: 3,549 out of 83,751
  • Since beginning of last month:
    • Site-wide: 6,226 out of 83,751

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)