A Clinically Applicable and Generalizable Deep Learning Model for Anterior Mediastinal Tumors in CT Images Across Multiple Institutions

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Rare diseases are often difficult to diagnose, and their scarcity also makes it challenging to develop deep learning models for them due to lim- ited large-scale datasets. Anterior mediastinal tumors—including thymoma and thymic carcinoma—represent such rare entities. A few diagnostic support systems for these tumors have been proposed; however, no prior studies have tested them across multiple institutions, and clinically applicable and generalizable models remain lacking.A total of 711 computed tomography (CT) images were collected from 136 hospitals, each from a different patient with pathologically proven anterior mediastinal tumors (339 males, 372 females). Of these, 485 images were used for training, 62 for internal testing, and 164 for external testing. The external testing dataset comprised CT images from 121 unique institutions not involved in the other datasets. A 3D U-Net-based model was trained on the training dataset, and the model with the best performance on the internal testing dataset was selected. This model was then evaluated on the external testing dataset for its segmentation and detection performance across different institutions. Based on the reference standards provided by board-certified diagnostic radiologists, the trained model achieved average Dice scores of 0.82, Intersection over Union (IoU) of 0.72, Precision of 0.85, and Recall of 0.82 for tumor segmen- tation at the CT-image level. The free-response receiver operating characteristic curve—derived from lesion-wise IoU thresholds—demonstrated high sensitivity and a low false-positive rate for tumor detection. Even under a stricter IoU threshold of 0.50, the model maintained a sensitivity of 0.87 with only 0.61 false positives per scan. Our model achieved clinically applicable segmentation and detec- tion performance for anterior mediastinal tumors, demonstrating broad gener- alizability across 121 institutions and overcoming the data-scarcity challenges inherent to such rare diseases.

Article activity feed