The Illusion of Polygenicity in Pool-seq Genetic Mapping studies: Insufficient Power Can Mask Simple Genetic Architectures

Anthony D. Long
Katherine M. Hanson
Stuart J Macdonald

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Pool-seq (pooled sequencing) combines DNA from multiple individuals prior to sequencing, enabling population-level allele frequency estimation without individual genotyping. When employed in Genome Wide Association Studies (GWAS) pool-seq faces a fundamental power limitation in that errors on allele frequency estimates are proportional to sequence coverage. Although this power limitation is widely appreciated, pool-seq GWAS lacking unambiguous hits are often interpreted as showing a highly polygenic genetic architecture. We illustrate the limitation of inferring architecture from Manhattan plots using empirical data from a Drosophila zinc resistance mapping study. Despite achieving an average of >700× sequencing coverage in case and control pools, a directly ascertained SNP-based GWAS failed to reveal clear evidence for major-effect loci. A unique feature of the dataset is that an advanced intercross multiparent population, with known founders, was employed as the base population for the GWAS. We leverage this unique population structure to carry out a second GWAS using imputed haplotype frequency estimates, which in contrast revealed localized regions of major effect. A third reanalysis of the same data using imputed SNP genotypes derived from the founder haplotype frequency estimates uncovered a similar major gene architecture. The key difference between approaches lies in statistical power: directly ascertained SNP counts have errors proportional to sequencing coverage whereas known founder imputation-based approaches can be considerably more accurate. This work highlights that insufficiently powered GWAS studies can mask simple genetic architectures and create the illusion of polygenicity through statistical noise alone.

Version published to 10.1101/2025.07.23.666414 on bioRxiv
Jul 24, 2025

Bayesian fine-mapping pinpoints candidate genes and pleiotropic loci of production traits from a chicken backcrossing scheme

This article has 8 authors:
1. Chi Mei Sun
2. Johannes Geibel
3. Henner Simianer
4. Björn Andersson
5. David Cavero
6. Rudolf Preisinger
7. Steffen Weigend
8. Christian Reimer
This article has no evaluationsLatest version Jan 13, 2026
A Map of Multi-omics Quantitative Trait Loci in a Chinese Population Reveals Regulatory Variations and Disease Links

This article has 18 authors:
1. Peilin Jia
2. Peng Yang
3. Shuhua Li
4. Qiwen Zheng
5. Xinxuan Liu
6. Siyu Pan
7. Yaning Zhang
8. Tianzi Liu
9. Sin Man Lam
10. Hongen Kang
11. Xiuli Zhu
12. Shiqi Lin
13. Zhanjie Fang
14. Yin-Ying Wang
15. Jian Wang
16. Guanghou Shui
17. Fan Liu
18. Changqing Zeng
This article has no evaluationsLatest version Jan 6, 2026
Genetic estimates of relatedness: Established practices and new opportunities through low coverage whole genome sequencing

This article has 8 authors:
1. Annika Freudiger
2. Natalie Kestel
3. Vladimir Jovanovic
4. Mariana Madruga de Brito
5. Angelina Ruiz-Lambides
6. Katja Nowick
7. Anja Widdig
8. Harald Ringbauer
This article has no evaluationsLatest version Jan 23, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Bayesian fine-mapping pinpoints candidate genes and pleiotropic loci of production traits from a chicken backcrossing scheme

A Map of Multi-omics Quantitative Trait Loci in a Chinese Population Reveals Regulatory Variations and Disease Links

Genetic estimates of relatedness: Established practices and new opportunities through low coverage whole genome sequencing