A multi-platform reference for somatic structural variation detection
Jose Espejo Valle-Inclan,
Nicolle J.M. Besselink,
Ewart de Bruijn,
Daniel L. Cameron,
Stef van Lieshout,
Andy Wing Chun Pang,
Margaretha G.M. Roemer,
Markus J. van Roosmalen,
Aaron M. Wenger,
Remond J.A. Fijneman,
Wigard P. Kloosterman,
Posted 16 Oct 2020
bioRxiv DOI: 10.1101/2020.10.15.340497
Posted 16 Oct 2020
Accurate detection of somatic structural variation (SV) in cancer genomes remains a challenging problem. This is in part due to the lack of high-quality gold standard datasets that enable the benchmarking of experimental approaches and bioinformatic analysis pipelines for comprehensive somatic SV detection. Here, we approached this challenge by genome-wide somatic SV analysis of the paired melanoma and normal lymphoblastoid COLO829 cell lines using four different technologies: Illumina HiSeq, Oxford Nanopore, Pacific Biosciences and 10x Genomics. Based on the evidence from multiple technologies combined with extensive experimental validation, including Bionano optical mapping data and targeted detection of candidate breakpoint junctions, we compiled a comprehensive set of true somatic SVs, comprising all SV types. We demonstrate the utility of this resource by determining the SV detection performance of each technology as a function of tumor purity and sequence depth, highlighting the importance of assessing these parameters in cancer genomics projects and data analysis tool evaluation. The reference truth somatic SV dataset as well as the underlying raw multi-platform sequencing data are freely available and are an important resource for community somatic benchmarking efforts. ### Competing Interest Statement AWCP is an employee of Bionano Genomics. AMW is an employee and shareholder of Pacific Biosciences.
- Downloaded 175 times
- Download rankings, all-time:
- Site-wide: 85,001 out of 103,764
- In genomics: 5,738 out of 6,382
- Year to date:
- Site-wide: 37,972 out of 103,764
- Since beginning of last month:
- Site-wide: 2,262 out of 103,764
Downloads over time
Distribution of downloads per paper, site-wide
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!