HiCDOC: chromatin compartment prediction and differential analysis from Hi-C data with replicates

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Motivation

The spatial organization of the genome plays an essential role in regulating cellular functions, with A/B chromatin compartments reflecting broad differences in transcriptional and epigenetic activity. Hi-C enables genome-wide identification of such compartments, but robust differential analysis between groups of samples remains challenging. Existing approaches largely rely on Principal Component Analysis, which, applied on Hi-C matrices separately, requires heuristic sign choices to merge results and does not naturally incorporate replicates.

Results

Here we present HiCDOC, a Bioconductor package for the prediction and differential analysis of chromatin compartments from Hi-C data with replicates. HiCDOC uses constrained k -means clustering to jointly analyze multiple Hi-C matrices, incorporating replicate information to enhance robustness, and provides empirical statistical support for predicted compartment switches.

Applied to Hi-C datasets from human tissues and mouse cell lines, HiCDOC identified biologically relevant compartment changes supported by transcriptional differences. Comparisons with existing tools showed both overlap and complementarity, while a controlled benchmark with artificially introduced changes confirmed high sensitivity. Although extensively tested on pairwise comparisons, HiCDOC offers a flexible framework compatible with more complex designs and, in principle, with more than two compartment states.

By combining replicate-aware clustering, automatic A/B assignment across chromosomes, extensive quality control, and statistical evaluation, HiCDOC provides an alternative and complementary approach to PCA-based methods for compartment analysis. HiCDOC thus expands the methodological toolkit for exploring 3D genome dynamics and its role in cellular processes.

Availability

HiCDOC is implemented in R and C++, and is available on Bioconductor: https://bioconductor.org/packages/release/bioc/html/HiCDOC.html

Contact

sylvain.foissac@inrae.fr

Article activity feed