ScAPE: A lightweight multitask learning baseline method to predict transcriptomic responses to perturbations

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Predicting biological responses to perturbations is essential for advancing drug development efforts. While high-throughput technologies have greatly enhanced our ability to study perturbation effects at the molecular level, accurately modeling these responses remains challenging due to the complex underlying molecular interactions, the vast dimensionality of cellular response, the limited amount of data, and technical limitations in data collection. Machine learning (ML) approaches have emerged as powerful tools for addressing these challenges. However, the observation in recent benchmarks that complex models do not consistently outperform simpler ones highlights the need for robust and efficient baselines for evaluating new methods. We present ScAPE (Single Cell Analysis of Perturbational Effects), an ML method for predicting drug-induced gene expression changes. ScAPE leverages only aggregated gene-level statistics across cell types and perturbations as input features, yet achieves good performance despite this simplicity. Its effectiveness was demonstrated in the Single-cell Perturbation Prediction NeurIPS 2023 challenge, where it ranked among the top-performing methods. We describe the core model and introduce extensions that enhance its utility as a baseline, including a multitask learning extension for simultaneous prediction of multiple transcriptional response metrics. A software API is provided to support adoption and integration into research pipelines. Benchmarking against other leading challenge entries and general-purpose ML approaches, using the original challenge metric, supports ScAPE as a simple, lightweight baseline for evaluating methods that predict perturbation effects at the gene-expression level. ScAPE is built using Keras 3, enabling support for different backends such as TensorFlow, JAX, or PyTorch, and freely available as an open-source package at github.com/scapeML/scape.

Article activity feed