Enhanced Pooling for Weakly Supervised Gigapixel WSI Training Improves Classification and Lesion Localization
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
A robust artificial intelligence-assisted workflow for tumor assessment in pathology requires not only accurate classification but also precise lesion localization. While current weakly supervised learning methods significantly reduce the need for extensive annotations and leverage large quantities of annotation-free whole-slide images (WSIs) to enhance classification robustness, they often fall short in segmentation accuracy. We attribute this limitation to the optimization goals in classification, which tend to focus solely on the most representative features—an approach that is particularly inefficient for WSIs with gigapixel resolution. To address this challenge, we introduce a novel approach based on streaming convolution, an end-to-end method for WSI training. Our contributions include the Rectified LogSumExp pooling method and adaptive pseudo annotation generation for self-training, both designed to encourage models to learn from sub-representative features. Using only slide-level annotations from the CAMELYON16 dataset, our method achieves a significant improvement in metastasis localization, with a recall from 49.85% to 71.33% at a precision of 90%. This conclusion also holds true for a 3,024-LN dataset used in the assessment of lung cancer lymph node metastasis with a recall improved from 28.31% to 50.82%.