The Clinical Genomic Variation Landscape

Wesley A. Goar
Daniel Puthawala
Kori Kuzma
Anastasia Bratulin
Austin A. Antoniou
Jeremy A. Arbesfeld
Lawrence Babb
Kyle Ferriter
Terry O’Neill
James S. Stevenson
Kathryn Perry
Matthew Cannon
Jiachen Liu
Xuelu Liu
Brian Walsh
Savanna Funk
William C. Ray
Bimal P. Chaudhari
Heidi L. Rehm
Alex H. Wagner

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Interpreting genomic variation requires analysts to collate and process information from disparate genomic evidence resources to discern the contributions to diseases and drug responses. Differences in variant representation across these evidence repositories includes nomenclature (e.g., HGVS, SPDI), reference sequence context (e.g., GRCh37 or GRCh38 genome assemblies), sequence annotation sources (e.g., RefSeq or Ensembl), and aggregate variant concepts (e.g., canonical alleles) collectively make it difficult to reveal whether (and how) genomic variants are associated with clinical outcomes. We evaluated these challenges across established genomic knowledge resources, including content from the CIViC, Molecular Oncology Almanac, and ClinVar knowledgebases, as compared against real-world small variant and CNV data. We used these findings to develop a suite of variant normalization methods to address these gaps. We present our findings as well as an analysis of remaining gaps in the representation of variation data and recommendations for the continued development of genomic knowledge standards to address these gaps.

Version published to 10.1101/2025.11.04.25339115 on medRxiv
Nov 6, 2025

Understanding Pathways in Bioinformatics, Genomics, and Health Applications

This article has 1 author:
1. Diptarup Mallick
This article has no evaluationsLatest version Jan 19, 2026
Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome

This article has 6 authors:
1. Jędrzej Kubica
2. Hetvi Jethwani
3. Krzysztof H. Banecki
4. Mauricio Moldes
5. Dariusz Plewczynski
6. Ben Busby
This article has no evaluationsLatest version Dec 17, 2025
Benchmarking RNA-seq Tools for Real-World Diagnostic Applications

This article has 15 authors:
1. Sarah Silverstein
2. Kaushik Ganapathy
3. Sandra Donkervoort
4. Veronique Bolduc
5. Ying Hu
6. Justin Moy
7. Prech Uapinyoying
8. Svetlana Gorokhova
9. Vijay Ganesh
10. Ben Weisburd
11. Rotem OrBach
12. A. Reghan Foley
13. Pejman Mohammadi
14. David Adams
15. Carsten Bonnemann
This article has no evaluationsLatest version Jan 29, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Understanding Pathways in Bioinformatics, Genomics, and Health Applications

Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome

Benchmarking RNA-seq Tools for Real-World Diagnostic Applications