Evaluating the Effectiveness of Data Reduction Techniques in QTL Mapping

Caroline Keller
Celine Caseys
Daniel J. Kliebenstein

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Data reduction methods are frequently employed in large genomics and phenomics studies to extract core patterns, reduce dimensionality, and alleviate multiple testing effects. Principal component analysis (PCA), in particular, identifies the components that capture the most variance within omics datasets. While data reduction can simplify complex datasets, it remains unclear how the use of PCA impacts downstream analyses such as quantitative trait loci (QTL) or genome-wide association (GWA) approaches and their biological interpretation. In QTL studies, an alternative to data reduction is the use of post-hoc data summarization approaches, such as hotspot analysis, which involves mapping individual traits and consolidating results based on shared genomic locations. To evaluate how different analytical approaches may alter the biological insights derived from multi-dimensional QTL datasets, we compared individual trait hotspots with PCA-based QTL mapping using transcriptomic and metabolomic data from a structured recombinant inbred line population. Interestingly, these two approaches identified different genomic regions and genetic architectures. These findings suggest that mapping PCA-reduced data does not merely streamline analyses but may generate a fundamentally different view of the underlying genetic architecture compared to individual trait mapping and hotspot analysis. Thus, the use of PCA and other data reduction techniques prior to QTL or GWAS mapping should be carefully considered to ensure alignment with the specific biological question being addressed.

Version published to 10.1101/2025.08.29.673132 on bioRxiv
Sep 4, 2025

Local genomic estimates provide a powerful framework for haplotype discovery

This article has 7 authors:
1. Will Shaffer
2. Victor Papin
3. Seema Yadav
4. Kai P. Voss-Fels
5. Lee T. Hickey
6. Ben J. Hayes
7. Eric G. Dinglasan
This article has no evaluationsLatest version Sep 2, 2025
fSuSiE enables fine-mapping of QTLs from genome-scale molecular profiles

This article has 9 authors:
1. William R. P. Denault
2. Hao Sun
3. Peter Carbonetto
4. Anjing Liu
5. Philip L. De Jager
6. David Bennett
7. The Alzheimer’s Disease Functional Genomics Consortium
8. Gao Wang
9. Matthew Stephens
This article has no evaluationsLatest version Aug 17, 2025
On the analysis of genetic association with long-read sequencing data

This article has 3 authors:
1. Gengming He
2. Stephen W. Scherer
3. Lisa J. Strug
This article has no evaluationsLatest version Sep 3, 2025

Listed in

Abstract

Article activity feed

Related articles

Local genomic estimates provide a powerful framework for haplotype discovery

fSuSiE enables fine-mapping of QTLs from genome-scale molecular profiles

On the analysis of genetic association with long-read sequencing data