Rxivist logo

A deep learning approach to predict the impact of non-coding sequence variants on 3D chromatin structure

By Tuan Trieu, Ekta Khurana

Posted 10 Jan 2019
bioRxiv DOI: 10.1101/516849

Three-dimensional structures of the genome play an important role in regulating the expression of genes. Non-coding variants have been shown to alter 3D genome structures to activate oncogenes in cancer. However, there is currently no method to predict the effect of DNA variants on 3D structures. We propose a deep learning method, DeepMILO, to learn DNA sequence features of CTCF/cohesin-mediated loops and to predict the effect of variants on these loops. DeepMILO consists of a convolutional and a recurrent neural network, and it can learn features beyond the presence of CTCF motifs and their orientations. Application of DeepMILO on a cohort of 241 malignant lymphoma patients with whole-genome sequences revealed CTCF/cohesin-mediated loops disrupted in multiple patients. These disrupted loops contain known cancer driver genes and novel genes. Our results show mutations at loop boundaries are associated with upregulation of the cancer driver gene BCL2 and may point to a possible new mechanism for its dysregulation via alteration of 3D loop structures.

Download data

  • Downloaded 1,305 times
  • Download rankings, all-time:
    • Site-wide: 7,376 out of 89,267
    • In bioinformatics: 1,297 out of 8,426
  • Year to date:
    • Site-wide: 24,858 out of 89,267
  • Since beginning of last month:
    • Site-wide: 55,788 out of 89,267

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)