What can reader studies of radiologist use of AI models teach us about adaptation? A signal detection modelling exploration

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

A significant component in regulatory approval of diagnostic artificial intelligence (AI) models is the use of reader study evidence, diagnostic performance studies of radiologists on a curated set of images. Retrospective MRMC (multi-reader, multi-case) reader studies of AI use are commonly conducted in the field of medical AI, though they tend to overestimate AI performance and underestimate human performance through the use of aggregated metrics. Further, there has been limited investigation into the impact of diagnostic AI models on radiological decision-making, with few if any cognitive modelling studies undertaken. This has serious implications, as the evidence base needed for implementation of models is lacking verified cognitive research. As such, this paper explores a lost opportunity to model diagnostic reader studies to ask: what can reader studies teach us about radiologist adaptation to AI models?In this study, we used a 2021 reader study of a commercially available Australian diagnostic system by Harrison.ai, consisting of 20 expert radiologists reading 1163 cases. We fit hierarchical signal detection models at the individual and pathology level to a reader study of 20 expert radiologists over 1163 cases and 127 pathologies. Our modelling results indicated that radiologists at the pathology and individual level had increased discriminability with AI, though with a more liberal response bias, leading to higher false positive rates. Further exploratory analysis based on these results indicated that disease coverage rates became homogenised with AI, as well as different AI-generated segmentations potentially influencing correct rejection rates. We provide real-world interpretations of these findings.

Article activity feed