fourSynergy: Ensemble-based interaction calling on 4C-seq data using gradient-free optimization

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Motivation

Chromatin organization plays a crucial role in gene regulation and is associated with various severe diseases like cancer. Since chromatin changes are potentially reversible, a deeper understanding of the alterations needs to be harnessed for the development of new therapies. Circular Chromosome Conformation Capture Sequencing (4C-seq) is a sequencing technique enabling the identification of chromatin interactions between genes and regulatory elements. This work aims to develop an ensemble algorithm that utilizes synergies among available 4C-seq tools, which in turn allows to achieve superior predictive performance in interaction calling.

Results

We employed existing 4C-seq algorithms using a weighted-voting approach. By optimizing the tool weights according to various predictive metrics using gradient-free optimization strategies, we demonstrate the potential of combining multiple 4C-seq analysis tools for interaction calling. Our results indicate that a weighted-voting based ensemble approach can outperform individual algorithms in various datasets. Although the optimal solutions differ across the 4C-seq datasets, we successfully identified global solutions that outperform the individual algorithms for all datasets analyzed.

Availability

https://github.com/sophiewind/fourSynergy , https://github.com/sophiewind/fourSynergy_pip

Contact

sophie.wind@uni-muenster.de

Supplementary information

Supplementary data are available at Journal Name online.

Article activity feed