Scalable data harmonization for single-cell image-based profiling with CytoTable

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

High-content imaging (HCI) involves the automated acquisition and quantitative analysis of cell phenotypes from microscopy images. These studies often rely on screening, which can involve thousands of chemical or genetic perturbations that produce terabytes of microscopy data. To extract meaningful biological insights, this data must be processed into quantitative features through a technique known as image-based profiling. A major analytical bottleneck is curating the high-dimensional, single-cell data derived from varied image analysis tools. These datasets suffer from inconsistent schemas, inefficient file formats, and undocumented ontological relationships. These challenges reduce reproducibility and slow progress in downstream applications. To solve these issues, we introduce CytoTable 1 , a software package for harmonizing single-cell image-based profiling. CytoTable enables modular, portable, and cross-language data integration through a robust, reproducible, and scalable engine that harmonizes single-cell readouts from multiple image analysis tools, preparing for feature integration with software in the Cytomining ecosystem such as Pycytominer. 2

Article activity feed