Few-shot in-context learning with large language models for antibody characterization

Sin-Hang Fung
Zhenghao Zhang
Ran Wang
Chen Miao
Brian Shing-Hei Wong
Kelly Yichen Li
Chenyang Hong
Jingying Zhou
Kevin Y. Yip
Stephen Kwok-Wing Tsui
Qin Cao

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) exhibit the emergent ability of few-shot in-context learning (ICL), allowing them to learn from demonstrations in simple prompts without task-specific training. However, applying few-shot ICL to biological sequences for classification, especially in computational immunology, remains underexplored. Here, we apply few-shot ICL with 18 general-purpose LLMs across five families to three antibody characterization scenarios, including predicting antibody humanness, specificity, and isotype. We evaluate performance under zero-shot, few-shot, and few-shot Chain-of-Thought ICL settings. We propose similarity-based few-shot demonstration selection strategies, which significantly improve performance of few-shot ICL compared to random selection. In all three scenarios, few-shot ICL, with as few as 32 examples, matches or exceeds the performance of established machine learning (ML) models trained on large datasets using traditional feature encodings. In two of the evaluated scenarios, few-shot ICL even matches or outperforms ML models that use state-of-the-art protein language model-based embeddings. Moreover, combining few-shot ICL with fine-tuning further enhances performance. We also demonstrate the reproducibility and stability of few-shot ICL results. Our findings establish few-shot ICL as a powerful method for efficiently characterizing antibody properties without task-specific training, enabling a single model to perform multiple tasks immediately. Its simplicity and versatility make few-shot ICL a promising approach to antibody characterization for researchers from diverse backgrounds, especially those without coding knowledge.

Version published to 10.1101/2025.02.11.637772v1 on bioRxiv
Feb 15, 2025

Large Language Models for Text Classification: From Zero-Shot Learning to Instruction-Tuning

This article has 2 authors:
1. Thomas Davidson
2. Youngjin Chae
This article has no evaluationsLatest version Mar 31, 2025
Find Central Dogma Again

This article has 1 author:
1. Wang Liang
This article has no evaluationsLatest version Feb 25, 2025
Low-Rank Adaptation for Scalable Fine-Tuning of Pre-Trained Language Models

This article has 2 authors:
1. Haoyu Dong
2. Jianhong Shun
This article has no evaluationsLatest version Feb 11, 2025

Listed in

Abstract

Article activity feed

Related articles

Large Language Models for Text Classification: From Zero-Shot Learning to Instruction-Tuning

Find Central Dogma Again

Low-Rank Adaptation for Scalable Fine-Tuning of Pre-Trained Language Models