Rxivist logo

Comprehensive identification and characterization of conserved small ORFs in animals

By Sebastian D. Mackowiak, Henrik Zauber, Chris Bielow, Denise Thiel, Kamila Kutz, Lorenzo Calviello, Guido Mastrobuoni, Nikolaus Rajewsky, Stefan Kempa, Matthias Selbach, Benedikt Obermayer

Posted 09 Apr 2015
bioRxiv DOI: 10.1101/017772 (published DOI: 10.1186/s13059-015-0742-x)

There is increasing evidence that non-annotated short open reading frames (sORFs) can encode functional micropeptides, but computational identification remains challenging. We expand our published method and predict conserved sORFs in human, mouse, zebrafish, fruit fly and the nematode C. elegans. Isolating specific conservation signatures indicative of purifying selection on encoded amino acid sequence, we identify about 2000 novel sORFs in the untranslated regions of canonical mRNAs or on transcripts annotated as non-coding. Predicted sORFs show stronger conservation signatures than those identified in previous studies and are sometimes conserved over large evolutionary distances. Encoded peptides have little homology to known proteins and are enriched in disordered regions and short interaction motifs. Published ribosome profiling data indicate translation for more than 100 of novel sORFs, and mass spectrometry data gives peptidomic evidence for more than 70 novel candidates. We thus provide a catalog of conserved micropeptides for functional validation in vivo.

Download data

  • Downloaded 974 times
  • Download rankings, all-time:
    • Site-wide: 14,478 out of 105,403
    • In genomics: 1,888 out of 6,382
  • Year to date:
    • Site-wide: 63,196 out of 105,403
  • Since beginning of last month:
    • Site-wide: 33,335 out of 105,403

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)