VAREANT : a bioinformatics application for gene variant reduction and annotation

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The analysis of high-quality genomic variant data may offer a more complete understanding of the human genome, enabling researchers to identify novel biomarkers, stratify patients based on disease risk factors, and decipher underlying biological pathways. Although the availability of genomic data has sharply increased in recent years, the accessibility of bioinformatic tools to aid in its preparation is still lacking. Limitations with processing genomic data primarily include its large volume, associated computational and storage costs, and difficulty in identifying targeted and relevant information. Here, we present VAREANT , an accessible and configurable bioinformatic tool to support the preparation of variant data into a usable analysis-ready format. Designed to simplify the data pre-processing workflow, VAREANT enables the curation of targeted variant datasets. VAREANT is comprised of three standalone modules: (1) Pre- processing, (2) Variant Annotation, (3) Artificial Intelligence (AI) / Machine Learning (ML) Data Preparation. Pre-processing supports the fine-grained filtering of complex variant datasets to eliminate extraneous data. Variant Annotation allows for the addition of variant metadata from public annotation databases for subsequent analysis and interpretation. AI/ML Data Preparation supports the user in creating AI/ML-ready datasets suitable for immediate analysis with minimal pre-processing required. We have successfully tested and validated our tool on numerous variable-sized datasets and implemented VAREANT in two case studies involving patients with CVDs. Efficiently extracting relevant variants into an AI/ML ready format using tools like VAREANT has important scientific implications, such as producing targeted and high-quality datasets and helping reduce overall computational costs.

Article activity feed