When does accounting for gene-environment interactions improve complex trait prediction? A case study with Drosophila lifespan

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Gene-environment interactions (G × E) have been shown to explain a non-negligible proportion of variance for a plethora of complex traits in different species, including livestock, plants, and humans. While several studies have shown that including G × E can improve prediction accuracy in agricultural species, no increase in accuracy has been observed in human studies. In this work, we sought to investigate the reasons for the contradictory results between agricultural species and humans. Model organisms are useful for studying G × E, since environments can be controlled and genotypes can be replicated across environments. Thus, we used data from a previous study in Drosophila melanogaster , where the authors measured lifespan in different environments and found evidence of G × E. We devised three different cross-validation (CV) scenarios that mimic the relationships between reference and test populations observed in agriculture and human studies, and fitted a few statistical models with and without including G × E. The results showed that G × E explained 8% of lifespan variance. Despite that, models accounting for G × E improved prediction accuracy only in CV scenarios where the same genotypes are observed in both the reference and test populations. While these scenarios are common in agriculture, where individuals of the same family or variety appear in both populations, they are not encountered in human studies, where individuals are unrelated. Thus, our work clarifies in which prediction scenarios we can expect improvements by incorporating G × E into statistical models, and provides an explanation for previous results found in studies involving human populations.

Article activity feed