Geranium: Multimodal Retrieval of Genomics Data Visualizations

Huyen N. Nguyen
Sehi L'Yi
Thomas Chris Smits
Shanghua Gao
Marinka Zitnik
Nils Gehlenborg

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Effective visualization is essential for interpreting genomics data, yet researchers often face challenges in finding relevant, reusable examples. Existing tools offer limited support for searching the vast landscape of genomics visualizations, making the process of authoring new visualizations time-consuming and inefficient. To address this gap, we introduce Geranium, a data visualization retrieval system for searching and authoring genomics visualizations. Geranium supports multimodal retrieval, enabling users to query with images, text, or grammar-based specifications. Retrieved examples serve as scaffolds for authoring, providing templates that researchers can adapt with their own data, thereby streamlining the mechanics of visualization construction. Geranium integrates three embedding methods to combine specialized and general knowledge: grammar-based embeddings tailored to genomics visualizations, multimodal embeddings from a biomedical vision-language foundation model, and text embeddings from a fine-tuned large language model. For each visualization, we construct a multimodal representation that includes a Gosling specification, a pixel-based rendering, and natural language descriptions. We evaluate embedding strategies to maximize top-k retrieval accuracy and conduct user studies with domain collaborators to gather feedback on usability. Our collection comprises 3,200 visualizations across 50 categories, ranging from single-view to coordinated multi-view designs and supporting applications from single-cell epigenomics to structural variation analysis.

Version published to 10.31219/osf.io/zatw9_v6 on OSF Preprints
Dec 27, 2025
Version published to 10.31219/osf.io/zatw9_v5 on OSF Preprints
Sep 26, 2025
Version published to 10.31219/osf.io/zatw9_v4 on OSF Preprints
Sep 21, 2025
Version published to 10.31219/osf.io/zatw9_v3 on OSF Preprints
Sep 21, 2025
Version published to 10.31219/osf.io/zatw9_v2 on OSF Preprints
Aug 13, 2025
Version published to 10.31219/osf.io/zatw9_v1 on OSF Preprints
Apr 18, 2025

DQVis Dataset: Natural Language to Biomedical Visualization

This article has 5 authors:
1. Devin Lange
2. Pengwei Sui
3. Shanghua Gao
4. Marinka Zitnik
5. Nils Gehlenborg
This article has no evaluationsLatest version Dec 15, 2025
LLMAgent4Bio: LLM Agents for Biological Intelligence Across Genomics, Proteomics, Spatial Biology, and Biomedicine

This article has 9 authors:
1. Sajib Acharjee Dip
2. Dipanwita Mallick
3. Uddip Acharjee Shuvo
4. Shovito Barua Soummo
5. Fazle Rafsani
6. Bikash Kumar Paul
7. Nazifa Ahmed Moumi
8. Shafayat Ahmed
9. Liqing Zhang
This article has no evaluationsLatest version Dec 16, 2025
Integrating Microbiome Data Visualization into FAIRDatabase using Edge Functions

This article has 3 authors:
1. Roman van Eldijk
2. Shivam Kumar
3. Vivek Sheraton M
This article has no evaluationsLatest version Jan 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DQVis Dataset: Natural Language to Biomedical Visualization

LLMAgent4Bio: LLM Agents for Biological Intelligence Across Genomics, Proteomics, Spatial Biology, and Biomedicine

Integrating Microbiome Data Visualization into FAIRDatabase using Edge Functions