Rxivist logo

Contig annotation tool CAT robustly classifies assembled metagenomic contigs and long sequences

By Diego D. Cambuy, Felipe H. Coutinho, Bas E. Dutilh

Posted 01 Sep 2016
bioRxiv DOI: 10.1101/072868 (published DOI: 10.1186/s13059-019-1817-x)

In modern-day metagenomics, there is an increasing need for robust taxonomic annotation of long DNA sequences from unknown micro-organisms. Long metagenomic sequences may be derived from assembly of short-read metagenomes, or from long-read single molecule sequencing. Here we introduce CAT, a pipeline for robust taxonomic classification of long DNA sequences. We show that CAT correctly classifies contigs at different taxonomic levels, even in simulated metagenomic datasets that are very distantly related from the sequences in the database. CAT is implemented in Python and the required scripts can be freely downloaded from Github.

Download data

  • Downloaded 1,454 times
  • Download rankings, all-time:
    • Site-wide: 9,680 out of 117,931
    • In bioinformatics: 1,258 out of 9,553
  • Year to date:
    • Site-wide: 25,548 out of 117,931
  • Since beginning of last month:
    • Site-wide: 30,338 out of 117,931

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)