Lense: Optimizing data preprocessing in single-cell omics using LLMs

Jingyun Liu
Zhicheng Ji

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Data preprocessing is critical for single-cell omics analyses, but default pipelines often underperform on diverse datasets, especially from emerging platforms like spatial transcriptomics. We introduce Lense, a language-model-guided method that automatically selects optimal preprocessing by comparing plots that visualize low-dimensional representations across pipeline variants. Integrated with Seurat, Lense streamlines analysis and improves preprocessing robustness without requiring manual tuning.

Biographical Note

Jingyun Liu is a Master’s student in the Department of Biostatistics and Bioinformatics at Duke University. Dr. Zhicheng Ji is a tenure-track Assistant Professor in the Department of Biostatistics and Bioinformatics at Duke University. His research focuses on artificial intelligence and statistical modeling for single-cell genomics, spatial genomics, and biomedical imaging.

Version published to 10.64898/2026.05.07.723465 on bioRxiv
May 11, 2026

Unlocking Multi-Sample Differential Expression for Spatial Transcriptomics Data with TESSERA

This article has 4 authors:
1. Florica Constantine
2. Zoltan Laszik
3. Sandrine Dudoit
4. Elizabeth Purdom
This article has no evaluationsLatest version Apr 30, 2026
H2O: A Foundation Model Bridging Histopathology to Spatial Multi-Omics Profiling

This article has 12 authors:
1. Yunjie Gu
2. Zihan Wu
3. Rui Yan
4. Zhikang Wang
5. Yuan Li
6. Senlin Lin
7. Yan Cui
8. Haoran Lai
9. Xin Luo
10. Shaohua Kevin Zhou
11. Zhiyuan Yuan
12. Jianhua Yao
This article has no evaluationsLatest version Apr 24, 2026
A Context-Aware Single-Cell Proteomics Analysis pipeline

This article has 8 authors:
1. Carla Salomó Coll
2. Agata N Makar
3. Alejandro J Brenes
4. Joseph Inns
5. Matthias Trost
6. Neil Rajan
7. Simon Wilkinson
8. Alex von Kriegsheim
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Biographical Note

Article activity feed

Related articles

Unlocking Multi-Sample Differential Expression for Spatial Transcriptomics Data with TESSERA

H2O: A Foundation Model Bridging Histopathology to Spatial Multi-Omics Profiling

A Context-Aware Single-Cell Proteomics Analysis pipeline