Large Language Models for Mining Biobank-Derived Insights into Health and Disease

Manuel Corpas
Alfredo Iacoangeli

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs) offer transformative potential for analysing biobank-derived datasets, facilitating knowledge extraction, patient stratification, and predictive modelling. This study benchmarks multiple LLMs in retrieving biomedical insights from a leading biobank, the UK Biobank. UK Biobank-related literature is used as gold standard for assessing coverage and retrieval of some of the best known LLMs, including GPT, Claude, Gemini, Mistral, Llama and DeekSeek. The findings highlight each model’s strengths and limitations, emphasising challenges in data heterogeneity and accessibility. We suggest future research should take advantage of the power of LLMs for enhanced precision in biobank knowledge extraction.

Version published to 10.21203/rs.3.rs-6098960/v1 on Research Square
Mar 10, 2025

Artificial Intelligence in Biomedical Data Analysis: A Comparative Assessment of Large Language Models for Automated Clinical Trial Interpretation and Statistical Evaluation

This article has 6 authors:
1. Ravikumar Komandur
2. Jon McDunn
3. Nikita Nair
4. Babacar Fall
5. Adam P. Dicker
6. Sean Khozin
This article has no evaluationsLatest version Feb 14, 2025
Improving the Robustness of Large Language Models in Extracting Social Determinants of Health

This article has 2 authors:
1. Jiashu Chen
2. Chase Simmons
This article has no evaluationsLatest version Mar 24, 2025
Large language models improve transferability of electronic health record-based predictions across countries and coding systems

This article has 6 authors:
1. Matthias Kirchler
2. Matteo Ferro
3. Veronica Lorenzini
4. FinnGen
5. Christoph Lippert
6. Andrea Ganna
This article has no evaluationsLatest version Feb 4, 2025

Listed in

Abstract

Article activity feed

Related articles

Artificial Intelligence in Biomedical Data Analysis: A Comparative Assessment of Large Language Models for Automated Clinical Trial Interpretation and Statistical Evaluation

Improving the Robustness of Large Language Models in Extracting Social Determinants of Health

Large language models improve transferability of electronic health record-based predictions across countries and coding systems