SHIELD: An AI Framework for Skeletal Health Intelligence and Early Lesion Detection to Improve Orthopedic Referrals

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Delays in the referral of patients with suspected metastatic bone disease (MBD) from radiology reports represent a critical challenge that can negatively impact patient outcomes. The conventional manual review process is often a significant bottleneck, leading to prolonged diagnostic timelines. We developed and validated SHIELD, an automated AI framework designed to accelerate and improve the accuracy of MBD referrals. We fine-tuned a RadBERT-RoBERTa model on a decade of radiology reports (N = 245 patients) from two academic medical centers to classify reports into three tiers: "No Referral," "Referral," and "Referral/High Risk." To ensure clinical utility and transparency, SHIELD incorporates a Large Language Model to generate natural-language explanations for its classifications. SHIELD demonstrated exceptional performance on a hold-out test set. It achieved 100% accuracy and an Area Under the Curve (AUC) of 1.00 in the primary binary task of distinguishing referral from non-referral cases. In the more granular three-class task, the model achieved an overall accuracy of 89.52%, with near-perfect performance in identifying "No Referral" reports (F1-score: 99.20%). Critically, the model operated in a clinically "fail-safe" manner, never misclassifying a high-risk case as requiring no referral. A retrospective timeline analysis revealed that SHIELD can reduce the referral period from a conventional average of 109.6 days to a computational time of 1–3 minutes. Proposed work provides high accuracy with a sophisticated explainability component using a large language model. Thus, SHIELD framework is a robust, explainable, and autonomous solution for triaging radiology reports. By drastically reducing administrative and diagnostic delays, it has the potential to significantly accelerate the clinical workflow, ensure timely specialist consultation, and ultimately improve the standard of care for patients with suspected MBD.

Article activity feed