A Computational Pipeline for Stratifying Autoimmune Patients Using Binary Antibody Data
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
This method describes a computational pipeline for stratifying autoimmune patient groups using exclusively binary autoantibody data. Our method addresses a methodological gap in computational immunology by providing a standardized framework for analyzing categorical serological data commonly found in electronic health records and resource-limited settings. The pipeline integrates three complementary analytical modules:
-
Module 1: Exploratory screening using statistical association tests.
-
Module 2: Quantification of overall immunological similarity and un-certainty.
-
Module 3: Prediction modeling and validation against chance.
We demonstrate the method’s utility by applying it to two autoimmune disorders. We were successful in recapitulating established clinical relationships in these two closely linked diseases. The pipeline is implemented in Python and includes detailed configuration options for custom disease groups, autoanti-body panels and stratification variables. This method enables researchers to extract meaningful immunological patterns from underutilized binary clinical data, serving as a hypothesis-generation tool to help drive impactful exploration.