Computational Pathology with Topological signatures and Visual Word Encoding
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Tissue analysis is considered the gold standard for the diagnosis of a wide spectrum of disorders. However, pathologists perform labor-intensive evaluations to ensure accurate results. Computational pathology has made significant advances in the development of task-specific predictive models. Nevertheless, traditional pixel- or texture-based features often fail to capture both local and global structural patterns together with their spatial organization. This study developed TopoBoW, a computational framework to objectively characterize morphological patterns using local and global microscopy image features. We developed TopoBoW by integrating Topological Data Analysis (TDA) and Bag-of-Visual-Words (BoVW), combining it with an attention-guided multi-layer perceptron (MLP) trained to distinguish between healthy and pathological muscle tissue. TDA captures global structural features with Betti curves derived from persistent homology, whereas BoVW encodes local textural patterns with SURF descriptors and histogram encoding. We also utilized visualizations to examine the statistical behavior of feature vectors across disease classes and healthy controls, evaluating their discriminative ability. We compare TopoBoW with several baseline and advanced models, including TDA, histogram of oriented gradients (HOG) with XGBoost and Attention-based MLP. TopoBoW demonstrated state-of-the-art performance on muscle tissue classification and outperformed all baselines in terms of all classification criteria, including accuracy, F1-score, and AUC. With its interpretable feature-based computational framework, TopoBoW can assist with pathological research, education, and interactive diagnostic workflows by integrating global structural and local textural information from images.