A Nextflow pipeline for molecular quantitative trait loci mapping in small sample size datasets with an application in Atlantic salmon
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background
Molecular quantitative trait loci (molQTL) mapping, particularly for gene expression and chromatin accessibility, provides crucial insights into the regulatory and functional potential of genetic variation. While significant progress has been made in humans and model organisms, aquatic genomics remains underexplored due to the large sample sizes typically required for statistical power.
Results
In this work, we enhance the scalability, reproducibility, and accessibility of the well-established RASQUAL method, which efficiently detects molQTLs in small datasets, by leveraging the Nextflow workflow framework. This adaptation, named nf-RASQUAL, supports fully automated QTL mapping and incorporates a robust, comprehensive multiple-testing correction process. We apply the pipeline to a comprehensive multi-omics dataset from Atlantic salmon, identifying numerous significant expression and chromatin accessibility QTLs across multiple tissues. Our analysis reveals that a large proportion of lead variants for these loci reside in non-coding regions, with caQTL lead SNPs more likely to disrupt transcription factor motifs. Additionally, the enriched colocalization of eQTL and caQTL lead SNPs in brain, liver, and gonad tissues suggests shared regulatory mechanisms.
Conclusions
These findings highlight the scalability and utility of nf-RASQUAL for advancing genetic regulation research in aquaculture, facilitating molQTL studies in less-explored species, and improving our understanding of molecular phenotypes shaped by genetic diversity.