Replicating a High-Impact Scientific Publication Using Systems of Large Language Models

Dennis Bersenev
Ayako Yachie-Kinoshita
Sucheendra K. Palaniappan

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Publications focused on scientific discoveries derived from analyzing large biological datasets typically follow the cycle of hypothesis generation, experimentation, and data interpretation. The reproduction of findings from such papers is crucial for confirming the validity of the scientific, statistical, and computational methods employed in the study, and it also facilitates the foundation for new research. By employing a multi-agent system composed of Large Language Models (LLMs), including both text and code generation agents built on OpenAI’s platform, our study attempts to reproduce the methodology and findings of a high-impact publication that investigated the expression of viral-entry-associated genes using single-cell RNA sequencing (scRNA-seq). The LLM system was critically evaluated against the analysis results from the original study, highlighting the system’s ability to perform simple statistical analysis tasks and literature reviews to establish the purpose of the analyses. However, we also identified significant challenges in the system, such as nondeterminism in code generation, difficulties in data procurement, and the limitations presented by context length and bias from the model’s inherent training data. By addressing these challenges and expanding on the system’s capabilities, we intend to contribute to the goal of automating scientific research for efficiency, reproducibility, and transparency, and to drive the discussion on the role of AI in scientific discovery.

Version published to 10.1101/2024.04.08.588614v1 on bioRxiv
Apr 12, 2024
Version published to 10.1101/2024.04.08.588614v2 on bioRxiv
Apr 12, 2024

Evaluating Large Language Models in Psychological Research: A Guide for Reviewers

This article has 5 authors:
1. Suhaib Abdurahman
2. Alireza S. Ziabari
3. Alexander Moore
4. Daniel Bartels
5. Morteza Dehghani
This article has no evaluationsLatest version Apr 22, 2024
How do Large Language Models understand Genes and Cells

This article has 10 authors:
1. Chen Fang
2. Yidong Wang
3. Yunze Song
4. Qingqing Long
5. Wang Lu
6. Linghui Chen
7. Pengfei Wang
8. Guihai Feng
9. Yuanchun Zhou
10. Xin Li
This article has no evaluationsLatest version Mar 27, 2024
Large Language Models for Pathway Curation: A Preliminary Investigation

This article has 5 authors:
1. Nikitha Karkera
2. Nikshita Karkera
3. Mahanash Kumar
4. Samik Ghosh
5. Sucheendra K. Palaniappan
This article has no evaluationsLatest version Apr 29, 2024

Listed in

Abstract

Article activity feed

Related articles

Evaluating Large Language Models in Psychological Research: A Guide for Reviewers

How do Large Language Models understand Genes and Cells

Large Language Models for Pathway Curation: A Preliminary Investigation