Utility of Same-Modality, Cross-Domain Transfer Learning for Malignant Bone Tumor Detection on Radiographs: A Multi-Faceted Performance Comparison with a Scratch-Trained Model

Joe Hasei
Ryuichi Nakahara
Yujiro Otsuka
Koichi Takeuchi
Yusuke Nakamura
Kunihiro Ikuta
Shuhei Osaki
Hironari Tamiya
Shinji Miwa
Shusa Ohshika
Shunji Nishimura
Naoaki Kahara
Aki Yoshida
Hiroya Kondo
Tomohiro Fujiwara
Toshiyuki Kunisada
Toshifumi Ozaki

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background/Objectives: Developing high-performance artificial intelligence (AI) models for rare diseases like malignant bone tumors is limited by scarce annotated data. This study evaluates same-modality cross-domain transfer learning by comparing an AI model pretrained on chest radiographs with a model trained from scratch for detecting malignant bone tumors on knee radiographs. Methods: Two YOLOv5-based detectors differed only in initialization (transfer vs. scratch). Both were trained/validated on institutional data and tested on an independent external set of 743 radiographs (268 malignant, 475 normal). The primary outcome was AUC; prespecified operating points were high-sensitivity (≥0.90), high-specificity (≥0.90), and Youden-optimal. Secondary analyses included PR/F1, calibration (Brier, slope), and decision curve analysis (DCA). Results: AUC was similar (YOLO-TL 0.954 [95% CI 0.937–0.970] vs. YOLO-SC 0.961 [0.948–0.973]; DeLong p = 0.53). At the high-sensitivity point (both sensitivity = 0.903), YOLO-TL achieved higher specificity (0.903 vs. 0.867; McNemar p = 0.037) and PPV (0.840 vs. 0.793; bootstrap p = 0.030), reducing ~17 false positives among 475 negatives. At the high-specificity point (~0.902–0.903 for both), YOLO-TL showed higher sensitivity (0.798 vs. 0.764; p = 0.0077). At the Youden-optimal point, sensitivity favored YOLO-TL (0.914 vs. 0.892; p = 0.041) with a non-significant specificity difference. Conclusions: Transfer learning may not improve overall AUC but can enhance practical performance at clinically crucial thresholds. By maintaining high detection rates while reducing false positives, the transfer learning model offers superior clinical utility. Same-modality cross-domain transfer learning is an efficient strategy for developing robust AI systems for rare diseases, supporting tools more readily acceptable in real-world screening workflows.

Version published to 10.3390/cancers17193144
Sep 27, 2025
Version published to 10.20944/preprints202508.2006.v1
Aug 27, 2025

Integrating Traditional Machine Learning and Deep Learning Methods for Enhanced Wilms Tumor Detection

This article has 2 authors:
1. Anirudh Anandarao
2. Bhadresh Amarnath
This article has no evaluationsLatest version Dec 31, 2025
Comparative Analysis of 2.5D Deep Learning, 2D Deep Learning, and Radiomics Models for Predicting Axillary Lymph Node Metastasis in Breast Cancer: A Multi-Center Study

This article has 8 authors:
1. Lingsong Meng
2. Xin Zhao
3. Yuxia Zhang
4. Lin Lu
5. Xiang Meng
6. Shuangyu Li
7. Fuming Shao
8. Xiaoan Zhang
This article has no evaluationsLatest version Jan 19, 2026
Application of Deep Learning Strategies in the Standardization and Diagnostic Efficiency Enhancement of Chest X-ray Imaging

This article has 5 authors:
1. Wen Chang Tseng
2. Yung-Cheng Wang
3. Wei-Chi Chen
4. Sen-Ping Lin
5. Kang-Ping Lin
This article has no evaluationsLatest version Dec 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Integrating Traditional Machine Learning and Deep Learning Methods for Enhanced Wilms Tumor Detection

Comparative Analysis of 2.5D Deep Learning, 2D Deep Learning, and Radiomics Models for Predicting Axillary Lymph Node Metastasis in Breast Cancer: A Multi-Center Study

Application of Deep Learning Strategies in the Standardization and Diagnostic Efficiency Enhancement of Chest X-ray Imaging