Enhancing Biomarker-Based Oncology Trial Matching Using Large Language Models

Nour Al Khoury
Maqsood Shaik
Ricardo Wurmus
Altuna Akalin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Clinical trials are an essential component of drug development for new cancer treatments, yet the information required to determine a patient’s eligibility for enrollment is scattered in large amounts of unstructured text. Genomic biomarkers are especially important in precision medicine and targeted therapies, making them essential for matching patients to appropriate trials. Large language models (LLMs) offer a promising solution for extracting this information from clinical trial data, aiding both physicians and patients in identifying suitable matches. In this study, we explore various LLM strategies for extracting genetic biomarkers from oncology trials to improve patient enrollment rates. Our results show that open-source language models, when applied out-of-the-box, effectively capture complex logical expressions and structure genomic biomarkers in disjunctive normal form, outperforming closed-source models such as GPT-4 and GPT-3.5-Turbo. Furthermore, fine-tuning these open-source models with additional data significantly enhances their performance.

Version published to 10.1101/2024.09.13.612922 on bioRxiv
Sep 19, 2024

PRESSnet: a novel framework for patient stratification and biomarker discovery using clinical knowledge graphs

This article has 11 authors:
1. Jake Cohen-Setton
2. Shruti Shikhare
3. Ioannis Kagiampakis
4. Domingo Salazar
5. Miguel Goncalves
6. Elizabeth Coker
7. Sanddhya Jayabalan
8. Damian Bikiel
9. Ben Sidders
10. Etai Jacob
11. Krishna Bulusu
This article has no evaluationsLatest version Dec 15, 2025
Assessing the Impact of Comprehensive Genomic Profiling on Therapeutic Selection for Advanced Solid Tumors in Portugal

This article has 23 authors:
1. Nuno Tavares
2. Pedro Simões
3. Raquel Lopes-Brás
4. Teresa R. Pacheco
5. Sara Damaso
6. Andre Mansinho
7. Leonor Abreu Ribeiro
8. Gonçalo Nogueira-Costa
9. Catarina Abreu
10. Tiago Barroso
11. Nuno Bonito
12. Rita Figueiró
13. Bogdana Darmits
14. Sara Loureiro Melo
15. Tania Rodrigues
16. Helena Guedes
17. Edgar Pratas
18. Diogo Alpuim Costa
19. Frederico Ferreira Filipe
20. Daniela Macedo
21. Ana Cavaco
22. Marina Pavanello
23. Luis Costa
This article has no evaluationsLatest version Jan 23, 2026
12-Gene Signature for Prediction of Chemotherapy Response in Gastric Cancer

This article has 2 authors:
1. Nhan Tran
2. Minh Nam Nguyen
This article has no evaluationsLatest version Jan 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

PRESSnet: a novel framework for patient stratification and biomarker discovery using clinical knowledge graphs

Assessing the Impact of Comprehensive Genomic Profiling on Therapeutic Selection for Advanced Solid Tumors in Portugal

12-Gene Signature for Prediction of Chemotherapy Response in Gastric Cancer