H3AGWAS : A portable workflow for Genome Wide Association Studies
Abstract
Background
Genome-wide association studies (GWAS) are a powerful method to detect associations between variants and phenotypes. A GWAS requires several complex computations with large datasets, and many steps may need to be repeated with varying parameter. Manual running of these analyses can be tedious, error-prone and hard to reproduce.
Results
The H3AGWAS workflow from the Pan-African Bioinformatics Network for H3Africa is a powerful, scalable and portable workflow implementing pre-association analysis, implementation of various association testing methods and postassociation analysis of results.
Conclusions
The workflow is scalable — laptop to cluster to cloud (e.g., SLURM, AWS Batch, Azure). All required software is containerised and can run under Docker on Singularity.