Generating Bounded Linear Temporal Logic in Systems Biology with Large Language Models

Difei Tang
Natasa Miskov-Zivanov

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In computational modeling, Bounded Linear Temporal Logic (BLTL) is a valuable formalism for describing and verifying the temporal behavior of biological systems. However, translating natural language (NL) descriptions of system behaviors into accurate BLTL properties remains a labor-intensive task, requiring deep expertise in both logic syntax and semantic translation. With the advent of large language models (LLMs), automating this translation has become a promising direction. In this work, we propose an accurate and flexible NL-BLTL transformation framework based on transfer learning. Our approach consists of three stages: 1) Synthetic data generation, where we construct a large-scale NL-BLTL dataset. 2) Pre-training, where we fine-tune LLMs on the synthetic dataset to enhance their ability to characterize logical structure and BLTL specifications. 3) Fine-tuning, where we adapt the pre-trained models to a naïve T-cell dataset with manual NL-BLTL annotations. We evaluate the fine-tuned models on the naïve T-cell test set and further assess their generalizability on an unseen NL-BLTL dataset in the context of the pancreatic cancer environment, using comprehensive metrics. Experimental results show that models pre-trained on the synthetic data and fine-tuned on real-world annotations outperform both out-of-the-box LLMs, such as GPT-4, and models trained directly on the naïve T-cell dataset without pre-training, demonstrating the effectiveness of our framework.

Version published to 10.1101/2025.08.06.668950 on bioRxiv
Aug 9, 2025

Toward Efficient and Faithful Reasoning in Large Language Models

This article has 3 authors:
1. Lukas Schneider
2. Anna Muller
3. Mareike Gerhardt
This article has no evaluationsLatest version Jul 18, 2025
ProteinReasoner: A Multi-Modal Protein Language Model with Chain-of-Thought Reasoning for Efficient Protein Design

This article has 9 authors:
1. Chaozhong Liu
2. Linlin Chao
3. Shaomin Ji
4. Hao Wang
5. Taorui Jiang
6. Zhangyang Gao
7. Yucheng Guo
8. Ming Yang
9. Xiaoming Zhang
This article has no evaluationsLatest version Jul 24, 2025
Efficient Inference of Large Language Models through Model Compression

This article has 4 authors:
1. James Whitmore
2. Clara Hastings
3. Amir Patel
4. Stephany Brody
This article has no evaluationsLatest version Aug 5, 2025

Listed in

Abstract

Article activity feed

Related articles

Toward Efficient and Faithful Reasoning in Large Language Models

ProteinReasoner: A Multi-Modal Protein Language Model with Chain-of-Thought Reasoning for Efficient Protein Design

Efficient Inference of Large Language Models through Model Compression