Rxivist logo

Systematic evaluation of statistical methods for identifying looping interactions in 5C data

By Thomas G. Gilgenast, Jennifer E. Phillips-Cremins

Posted 11 Oct 2017
bioRxiv DOI: 10.1101/201681 (published DOI: 10.1016/j.cels.2019.02.006)

Chromosome-Conformation-Capture-Carbon-Copy (5C) is a molecular technology based on proximity ligation that enables high-resolution and high coverage inquiry of long-range chromatin looping interactions. Computational pipelines for analyzing 5C data involve a series of inter-dependent normalization procedures and statistical methods that markedly influence downstream biological results. A detailed analysis of the trade-offs inherent to all stages of 5C analysis has not been reported, but is essential for understanding the biological basis of looping. Here, we provide a comparative assessment of method performance at each step in the 5C analysis pipeline, including sequencing depth and library complexity correction, bias mitigation, spatial noise reduction, distance-dependent expected and variance estimation, modeling, and loop detection. We present a detailed discussion of methodological advantages/disadvantages at each step and provide a full suite of algorithms, lib5C, to allow investigators to test the range of approaches on their own high-resolution 5C data. Principles learned from our comparative analyses will have broad impact on many other forms of Chromosome-Conformation-Capture-based data, including Hi-C, 4C, and Capture-C.

Download data

  • Downloaded 500 times
  • Download rankings, all-time:
    • Site-wide: 37,677 out of 103,764
    • In genomics: 3,571 out of 6,382
  • Year to date:
    • Site-wide: 94,757 out of 103,764
  • Since beginning of last month:
    • Site-wide: 97,116 out of 103,764

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)