A Proof-of-Concept Large Language Model Application to Support Clinical Trial Screening in Surgical Oncology

Samantha M. Lai
Alysala M. Malik
Tejas S. Sathe
Caitlin J. Silvestri
Gulam A. Manji
Michael D. Kluger

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Introduction

Clinical trials advance the forefront of medical knowledge and rely on consistent patient accrual for success. However, patient screening for clinical trials is resource intensive. There is a need to increase the scalability of trial recruitment while maintaining or improving upon the sensitivity of the current process. We hypothesized we could use a state-of-the-art large language model (LLM), prompt engineering, and publicly available clinical trial data to predict patient eligibility for trials from clinic notes. Here, we present pilot data demonstrating the accuracy of this tool in a cohort of patients being evaluated for pancreas cancer treatment.

Methods

Patients who were screened for clinical trials at a single institution were studied. An LLM application was developed using LangChain and the GPT-4o model to assist in clinical trial screening. Deidentified patient data from clinical notes and trial eligibility criteria from ClinicalTrials.gov were used as inputs. For each patient, the model determined inclusion or exclusion with respect to selected eligibility criteria as well as nine clinical trials. Model responses were graded programmatically against a human rater standard. Time elapsed and cost for running each analysis were recorded.

Results

Of the 24 patients in the test set, 19 were eligible for at least one trial. There were 43 eligible patient-trial matches in the data set. Our model correctly predicted 39 out of 42 (90.7%) of these matches. There were 105 individual eligibility criteria evaluated per patient for a total of 2520 binary criteria. GPT-4o agreed with the raters for 2,438 out of 2,520 (96.7%) binary eligibility criteria. Sensitivity to overall trial eligibility ranged from 87.5% to 100% for 8 out of 9 trials. Specificity ranged from 73.3% to 100% over all nine trials. The median cost for screening a patient was 0.67 USD (0.63-0.74). Median time elapsed was 137.66 seconds (130.04-146.04). Median total token usage across three assistants was 112,266.5 tokens (102,982.0-122,174.2).

Conclusion

Overall, this model showed high sensitivity and specificity in using minimally processed free-text clinical notes to screen patients for appropriate clinical trials using a fraction of the time and cost of existing screening mechanisms. Results showed promise with a small cohort, and future studies are needed to assess its accuracy with a larger sample of patients and trials. This study represents the frontier of pitting of emerging large language model technology against the historically unruly terrain of the electronic medical record, suggesting that the imperfection of free-text clinical notes only slightly hinders the performance of a general-use model compared to previous performance on preprocessed data. These findings highlight that using this tool directly on clinical notes could complement human screening efforts to improve patient accrual at a low time and monetary cost.

Version published to 10.1101/2024.09.20.24314053v2 on medRxiv
Oct 4, 2024
Version published to 10.1101/2024.09.20.24314053v1 on medRxiv
Sep 23, 2024

Enhancing Biomarker-Based Oncology Trial Matching Using Large Language Models

This article has 4 authors:
1. Nour Al Khoury
2. Maqsood Shaik
3. Ricardo Wurmus
4. Altuna Akalin
This article has no evaluationsLatest version Sep 19, 2024
Predicting Individual Responses in Phase I Oncology Trials Using Routinely Collected Clinical Biomarkers

This article has 13 authors:
1. Nivedita Bhadra
2. Marley Boyd
3. Sandra Smith
4. Janet Espirito
5. Jeffrey Trent
6. Christine Powell
7. Kati Koktavy
8. Nicholas Robert
9. Jennifer Frytak
10. Laura H. Goetz
11. Sunil Sharma
12. Daniel D. Von Hoff
13. Nicholas J. Schork
This article has no evaluationsLatest version Oct 16, 2024
From a genomic risk model to clinical trial implementation in a learning health system: the ProGRESS Study

This article has 65 authors:
1. Jason L Vassy
2. Anna M Dornisch
3. Roshan Karunamuni
4. Michael Gatzen
5. Christopher J Kachulis
6. Niall J Lennon
7. Charles A Brunette
8. Morgan E Danowski
9. Richard L Hauger
10. Isla P Garraway
11. Adam S Kibel
12. Kyung Min Lee
13. Julie A Lynch
14. Kara N Maxwell
15. Brent S Rose
16. Craig C Teerlink
17. George J Xu
18. Sean E Hofherr
19. Katherine A Lafferty
20. Katie Larkin
21. Edyta Malolepsza
22. Candace J Patterson
23. Diana M Toledo
24. Jenny L Donovan
25. Freddie Hamdy
26. Richard M Martin
27. David E Neal
28. Emma L Turner
29. Ole A Andreassen
30. Anders M Dale
31. Ian G Mills
32. Jyotsna Batra
33. Judith Clements
34. Olivier Cussenot
35. Cezary Cybulski
36. Rosalind A Eeles
37. Jay H Fowke
38. Eli Marie Grindedal
39. Robert J Hamilton
40. Jasmine Lim
41. Yong-Jie Lu
42. Robert J MacInnis
43. Christiane Maier
44. Lorelei A Mucci
45. Luc Multigner
46. Susan L Neuhausen
47. Sune F Nielsen
48. Marie-Élise Parent
49. Jong Y Park
50. Gyorgy Petrovics
51. Anna Plym
52. Azad Razack
53. Barry S Rosenstein
54. Johanna Schleutker
55. Karina Dalsgaard Sørensen
56. Ruth C Travis
57. Ana Vega
58. Catharine M L West
59. Fredrik Wiklund
60. Wei Zheng
61. Profile Steering Committee
62. IMPACT Study Steering Committee and Collaborators
63. PRACTICAL Consortium
64. Million Veteran Program
65. Tyler M Seibert
This article has no evaluationsLatest version Nov 4, 2024

Listed in

Abstract

Introduction

Methods

Results

Conclusion

Article activity feed

Related articles

Enhancing Biomarker-Based Oncology Trial Matching Using Large Language Models

Predicting Individual Responses in Phase I Oncology Trials Using Routinely Collected Clinical Biomarkers

From a genomic risk model to clinical trial implementation in a learning health system: the ProGRESS Study