IDEAL GENOM: Integrated Downstream Analytical Pipeline for Genomic Analysis
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background: Since the release of the first human genome draft, there has been an exponential increase in the development of technologies that have offered unprecedented opportunities to unravel the genetic underpinnings of rare and complex diseases. However, despite the progress, the challenge persists. This includes the need for developing efficient computational tools to analyse large genomic datasets. The availability of clinical information, along with the development of cost-effective arrays that can scan hundreds of thousands of patients and healthy subjects to identify underlying risk variants, necessitates the development of tools that can streamline and automate the processing of millions of genetic markers. Results: To address this challenge, we present IDEAL-GENOM (Integrated Downstream Analytical Pipeline for Genomic Analysis), a Python-based framework designed to streamline the analytical workflow commonly implemented in genome-wide association studies (GWAS) settings. IDEAL-GENOM integrates widely used tools such as PLINK and GCTA along with custom-developed functionalities, enabling reproducible results through parameter sharing. Initial testing demonstrates that our framework reduces quality control processing time by 10\%, requiring minimal user intervention. Additionally, IDEAL-GENOM supports both local and cloud-based environments, enhancing its accessibility and scalability. Conclusions: IDEAL-GENOM pipeline offers a simplified GWAS pipeline that will allow beginners and advanced users to leverage the in-built functionalities to perform GWAS analysis for complex diseases.