Optimizing and Evaluating Robustness of AI for Brain Metastasis Detection and Segmentation via Loss Functions and Multi-dataset Training

Yiding Han
Piyush Pathak
Omar Awad
Abdallah S. R Mohamed
Vincent Ugarte
Boran Zhou
Daniel Allen Hamstra
Alfredo Enrique Echeverria
Hasan Al Mekdash
Zaid Ali Siddiqui
Baozhou Sun

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose

Accurate detection and segmentation of brain metastases (BM) from MRI are critical for the appropriate management of cancer patients. This study investigates strategies to enhance the robustness of artificial intelligence (AI)-based BM detection and segmentation models.

Method

A DeepMedic-based network with a loss function, tunable with a sensitivity/specificity tradeoff weighting factor α- was trained on T1 post-contrast MRI datasets from two institutions (514 patients, 4520 lesions). Robustness was evaluated on an external dataset from a third institution dataset (91 patients, 397 lesions), featuring ground truth annotations from two physicians. We investigated the impact of loss function weighting factor, α and training dataset combinations. Detection performance (sensitivity, precision, F1 score) and segmentation accuracy (Dice similarity, and 95% Hausdorff distance (HD95)) were evaluated using one physician’s contours as the reference standard. The optimal AI model was then directly compared to the performance of the second physician.

Results

Varying α demonstrated a trade-off between sensitivity (higher α) and precision (lower α), with α=0.5 yielding the best F1 score (0.80 ± 0.04 vs. 0.78 ± 0.04 for α=0.95 and 0.72 ± 0.03 for α=0.99) on the external dataset. The optimally trained model achieved detection performance comparable to the physician (F1: AI=0.83 ± 0.04, Physician=0.83 ± 0.04), but slightly underperformed in segmentation (Dice: 0.79 ± 0.04 vs. AI=0.74 ± 0.03; HD95: 2.8 ± 0.14 mm vs. AI=3.18 ± 0.16 mm, p<0.05).

Conclusion

The derived optimal model achieves detection and segmentation performance comparable to an expert physician in a parallel comparison.

Version published to 10.1101/2025.08.22.25334255 on medRxiv
Sep 2, 2025

Deep Learning-Based Brain Tumor Segmentation Using 3D MRI Scans from the BraTS 2020 Dataset

This article has 3 authors:
1. N. Deena Nepolian
2. M. Mary Synthuja Jain Preetha
3. K. S. Vijula Grace
This article has no evaluationsLatest version Jan 28, 2026
Segmenting with Confidence: Uncertainty Quantification for Brain Tumor Imaging

This article has 8 authors:
1. Yassine Guennoun
2. Pierre Nedelec
3. Mark McArthur
4. Evan Bloch
5. Jinchi Wei
6. Leo Sugrue
7. Evan Calabrese
8. Andreas Rauschecker
This article has no evaluationsLatest version Jan 9, 2026
Deep Learning-Based MRI Segmentation for Non-Invasive Prediction of Microsatellite Instability in Endometrial Cancer: A Multicenter Study

This article has 10 authors:
1. Ke Wang
2. Xiaoli Song
3. Xiaoyi Liu
4. Xuqing Lin
5. Hongjian Luo
6. Xinyi Gou
7. Nan Hong
8. Yi Wang
9. Rong Zhou
10. Jin Cheng
This article has no evaluationsLatest version Dec 30, 2025

Discuss this preprint

Listed in

Abstract

Purpose

Method

Results

Conclusion

Article activity feed

Related articles

Deep Learning-Based Brain Tumor Segmentation Using 3D MRI Scans from the BraTS 2020 Dataset

Segmenting with Confidence: Uncertainty Quantification for Brain Tumor Imaging

Deep Learning-Based MRI Segmentation for Non-Invasive Prediction of Microsatellite Instability in Endometrial Cancer: A Multicenter Study