sciLaMA: A Single-Cell Representation Learning Framework to Leverage Prior Knowledge from Large Language Models

Hongru Hu
Shuwen Zhang
Yongin Choi
Venkat S. Malladi
Gerald Quon

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Single-cell RNA sequencing (scRNA-seq) enables high-resolution exploration of cellular diversity and gene regulation, yet analyzing such data remains challenging due to technical and methodological limitations. Existing task-specific deep generative models like Variational Auto-Encoder (VAE) and its variants struggle to incorporate external biological knowledge, while transformer-based foundational large Language Models (LLMs or large LaMs) face limitations in computational cost and applicability to tabular gene expression data. Here, we introduce sciLaMA (single-cell interpretable Language Model Adapter), a novel representation learning framework that bridges these gaps by integrating static gene embeddings from multimodal LaMs with scRNA-seq tabular data through a paired-VAE architecture. Our approach generates context-aware representations for both cells and genes and outperforms state-of-the-art methods in key single-cell downstream tasks, including batch effect correction, cell clustering, and cell-state-specific gene marker and module identification, while maintaining computational efficiency. sciLaMA offers a computationally efficient, unified framework for comprehensive single-cell data analysis and biologically interpretable gene module discovery.

Version published to 10.1101/2025.01.28.635153v1 on bioRxiv
Feb 3, 2025

TabVI: Leveraging Lightweight Transformer Architectures to Learn Biologically Meaningful Cellular Representations

This article has 6 authors:
1. Aditi Chandrashekar
2. Rohan Gala
3. Andreas Tjärnberg
4. Saniya Khullar
5. Grace Huynh
6. Mariano Gabitto
This article has no evaluationsLatest version Feb 17, 2025
scGraphETM: Graph-Based Deep Learning Approach for Unraveling Cell Type-Specific Gene Regulatory Networks from Single-Cell Multi-Omics Data

This article has 5 authors:
1. Wenqi Dong
2. Manqi Zhou
3. Boyu Han
4. Fei Wang
5. Yue Li
This article has no evaluationsLatest version Jan 27, 2025
RegFormer: A Single-Cell Foundation Model Powered by Gene Regulatory Hierarchies

This article has 15 authors:
1. Luni Hu
2. Ping Qiu
3. Hua Qin
4. Lei Cao
5. Wenjian Jiang
6. Boyu Feng
7. Yilin Zhang
8. Qianqian Chen
9. Yanbang Shang
10. Tianyi Xia
11. Ziqing Deng
12. Xun Xu
13. Shuangsang Fang
14. Yuxiang Li
15. Yong Zhang
This article has no evaluationsLatest version Feb 11, 2025

Listed in

Abstract

Article activity feed

Related articles

TabVI: Leveraging Lightweight Transformer Architectures to Learn Biologically Meaningful Cellular Representations

scGraphETM: Graph-Based Deep Learning Approach for Unraveling Cell Type-Specific Gene Regulatory Networks from Single-Cell Multi-Omics Data

RegFormer: A Single-Cell Foundation Model Powered by Gene Regulatory Hierarchies