GraphMana: graph-native data management for population genomics projects

Ehsan Estaji
Shi-Wei Zhao
Zhao-Yang Chen
Shuai Nie
Jian-Feng Mao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Population genomics projects rely on fragmented file-based workflows that lose provenance and require full reprocessing when samples are added. Graph-Mana stores variant data in a graph database as packed genotype arrays with pre-computed population statistics, enabling incremental sample addition, provenance tracking, cohort management, and export to 17 formats. Two access paths serve different needs: a FAST PATH reading population-level arrays in O ( K ) time and a FULL PATH unpacking per-sample genotypes in O ( N ) time. On human 1000 Genomes data (3,202 samples, 70.7M variants), Graph-Mana completed a 46-operation lifecycle in 98 minutes from a single persistent database.

Version published to 10.64898/2026.04.11.717925 on bioRxiv
Apr 14, 2026

GraphPop: graph-native computation decouples population genomics complexity from sample count

This article has 5 authors:
1. Ehsan Estaji
2. Shi-Wei Zhao
3. Zhao-Yang Chen
4. Shuai Nie
5. Jian-Feng Mao
This article has no evaluationsLatest version Apr 14, 2026
vartracker: an end-to-end tool for pathogen longitudinal variant analysis and visualisation

This article has 2 authors:
1. Charles S.P. Foster
2. William D. Rawlinson
This article has no evaluationsLatest version May 8, 2026
General, orders-of-magnitude faster whole-genome analysis with genotype representation graphs

This article has 4 authors:
1. Drew DeHaas
2. Chris Adonizio
3. Ziqing Pan
4. Xinzhu Wei
This article has no evaluationsLatest version Apr 11, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

GraphPop: graph-native computation decouples population genomics complexity from sample count

vartracker: an end-to-end tool for pathogen longitudinal variant analysis and visualisation

General, orders-of-magnitude faster whole-genome analysis with genotype representation graphs