OrganSegBench: A Comprehensive Multi-Organ Benchmark for Segmentation Foundation Models with a Practical Synergy Pathway to Clinical Application

Chengyan Wang
Qing Li
Yizhe Zhang
Xin Guo
Haosen Zhang
Yan Li
Mo Yang
Yajing Zhang
Mengting Sun
Longyu Sun
Haoyang Zhang
Junhong Liu
Shuo Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Segmentation Foundation Models (SFMs), despite their success in general computer vision, remain suboptimal for medical imaging, where clinical requirements like fairness and robustness are paramount. Existing benchmarks fail to address these needs and often rely on datasets with potential training data leakage. To bridge this gap, we introduce “OrganSegBench”, a comprehensive benchmark evaluation framework built on a new and high quality data resource from 701 subjects with 16 annotated organs and detailed demographics. Using this resource, we systematically evaluated six state-of-the-art SFMs across five key dimensions: accuracy, generalization, robustness, fairness, and clinical utility. Our analysis uncovers a fundamental trade-off: the most accurate models are consistently the least fair, with no single model achieving excellence across all dimensions. To resolve this dilemma, we propose two ensemble strategies: training-free fusion and multi-source knowledge distillation. Notably, both approaches decisively outperformed every individual SFM across all evaluation dimensions, resolving the accuracy-fairness trade-off. These findings expose the inherent limitations of current monolithic SFMs and establish principled model synergy as a practical and superior pathway toward building safe, equitable, and clinically robust AI.

Version published to 10.21203/rs.3.rs-7844536/v1 on Research Square
Oct 21, 2025

Independent Benchmarking of Prompt-Based Medical Segmentation Models

This article has 8 authors:
1. Ayhan Can Erdur
2. Daniel Scholz
3. Josef A. Buchner
4. Denise Bernhardt
5. Stephanie E. Combs
6. Benedikt Wiestler
7. Daniel Rueckert
8. Jan C. Peeken
This article has no evaluationsLatest version Oct 10, 2025
MSS-UNet : Mamba-Based Multi-directional Selective Scanning for Medical Image Segmentation

This article has 6 authors:
1. Jun Wu
2. Pengfei Zhan
3. Xinyi Zhu
4. Shuai Guo
5. Yu Chen
6. Li Yang
This article has no evaluationsLatest version Oct 20, 2025
CAMUS-HeartNet: A Deep Meta-Ensemble Architecture for Accurate Cardiac Tissue Segmentation

This article has 1 author:
1. Alireza Rahi
This article has no evaluationsLatest version Oct 19, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Independent Benchmarking of Prompt-Based Medical Segmentation Models

MSS-UNet : Mamba-Based Multi-directional Selective Scanning for Medical Image Segmentation

CAMUS-HeartNet: A Deep Meta-Ensemble Architecture for Accurate Cardiac Tissue Segmentation