Direct inference of the distribution of fitness effects of spontaneous mutations from recombinant inbred C. elegans mutation accumulation lines

Timothy A. Crombie
Moein Rajaei
Ayush S. Saxena
Lindsay M. Johnson
Sayran Saber
Robyn E. Tanny
José Miguel Ponciano
Erik C. Andersen
Juannan Zhou
Charles F. Baer

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (Arcadia Science)

Abstract

The distribution of fitness effects (DFE) of new mutations plays a central role in evolutionary biology. Estimates of the DFE from experimental Mutation Accumulation (MA) lines are compromised by the complete linkage disequilibrium (LD) between mutations in different lines. To reduce LD, we constructed two sets of recombinant inbred lines from a cross of two C. elegans MA lines. One set of lines (“RIAILs”) was intercrossed for ten generations prior to ten generations of selfing; the second set of lines (“RILs”) omitted the intercrossing. Residual LD in the RIAILs is much less than in the RILs, which affects the inferred DFE when the sets of lines are analyzed separately. The best-fit model estimated from all lines (RIAILs + RILs) infers a large fraction of mutations with positive effects (∼40%); models that constrain mutations to have negative effects fit much worse. The conclusion is the same using only the RILs. For the RIAILs, however, models that constrain mutations to have negative effects fit nearly as well as models that allow positive effects. When mutations in high LD are pooled into haplotypes, the inferred DFE becomes increasingly negative-skewed and leptokurtic. We conclude that the conventional wisdom - most mutations have effects near zero, a handful of mutations have effects that are substantially negative and mutations with positive effects are very rare – is likely correct, and that unless it can be shown otherwise, estimates of the DFE that infer a substantial fraction of mutations with positive effects are likely confounded by LD.

Arcadia Science
Aug 2, 2024

Here we employ a classical line-cross strategy with MA lines, to break down the linkage disequilibrium among the accumulated mutations. We then combine whole-genome sequencing with high-throughput competitive fitness assays to estimate the DFE of a set of 169 spontaneous mutations.

I greatly enjoyed reading this paper. True experimental estimates of the DFE in MA studies are super valuable and provide a very interesting comparison for pop-gen based DFE methods as pointed out by the authors.

Read the original source
Arcadia Science
Aug 2, 2024

The magnitude of the raw difference is typically much larger than that of the posterior effects. The difference is likely caused by LD, in that the raw difference of a single mutation contains contributions from other linked mutations, which may inflate the estimates.

Could you constrain this analysis to mutations that are in LE with other de-novo mutations to test this hypothesis?

Read the original source
Arcadia Science
Aug 2, 2024

Averaged over all RI(AI)Ls, accounting for variation among assay blocks and removing two outlying lines, the regression of W on number of mutations is not significantly different from 0 (slope = −0.0051, F1,509=1.83, P>0.17), although the trend suggests that mutations are deleterious, on average.

Is there a chance that false negative mutations (i.e. incorrectly unobserved events in the MA lines) could contribute to this result?

Read the original source
Arcadia Science
Aug 2, 2024

The simplest way to infer the mutational effect at a locus is to calculate the mean value of all lines with a mutant allele and all lines with an ancestral allele at that locus; the difference is the raw difference (uRAW) of the mutation at that locus. As a sanity check, we plotted the inferred Bayesian posterior effect against the raw difference; ideally, the correlation should be +1. The correlations were positive, but well below 1 in all three cases (Figure 4). The magnitude of the raw difference is typically much larger than that of the posterior effects. The difference is likely caused by LD, in that the raw difference of a single mutation contains contributions from other linked mutations, which may inflate the estimates.

Two quick thoughts for further sanity checks. 1) Does this regression look any different for SNPs vs indels? …

The simplest way to infer the mutational effect at a locus is to calculate the mean value of all lines with a mutant allele and all lines with an ancestral allele at that locus; the difference is the raw difference (uRAW) of the mutation at that locus. As a sanity check, we plotted the inferred Bayesian posterior effect against the raw difference; ideally, the correlation should be +1. The correlations were positive, but well below 1 in all three cases (Figure 4). The magnitude of the raw difference is typically much larger than that of the posterior effects. The difference is likely caused by LD, in that the raw difference of a single mutation contains contributions from other linked mutations, which may inflate the estimates.

Two quick thoughts for further sanity checks. 1) Does this regression look any different for SNPs vs indels? 2) Do the individual mutation specific effects conform to expectations one might have based on the functional annotations available for these mutational events?

Read the original source
Version published to 10.1101/2024.05.08.593038 on bioRxiv
May 10, 2024

Heritabilities and genetic correlations of Drosophila melanogaster locomotory behaviour traits: a high-throughput phenotyping approach

This article has 12 authors:
1. Szymon Drobniak
2. Erian Macartney
3. Samantha Burke
4. Patrice Pottier
5. Catharina Vendl
6. Zine Hamoudi
7. Chloe Hart
8. Radiah Ahmed
9. Yong Qi Lin
10. Malgorzata Lagisz
11. Gregory Neely
12. Shinichi Nakagawa
This article has no evaluationsLatest version Mar 23, 2026
Genetic Analysis of Genomic and Methylomic Variation and Construction of Multi-Trait Mutant Library in Rice Carried on Chang'e-5

This article has 15 authors:
1. Kai Sun
2. Jinrui Li
3. Zhe Zhao
4. Shiyi Chen
5. Yuanyuan Guo
6. Ying Ling
7. Yongfen Wang
8. Zhikai Han
9. Zengtong Luo
10. Wuming Xiao
11. Jiafeng Wang
12. Guili Yang
13. Yongzhu Liu
14. Tao Guo
15. Chun Chen
This article has no evaluationsLatest version Feb 13, 2026
From Mendelian Inheritance to Machine Learning: The Evolution of Genetic Improvement in Rabbits: A Review

This article has 3 authors:
1. Samuel Ayeh Ofori
2. Richard Asante Botwe
3. Bismark Yeboah
This article has no evaluationsLatest version Feb 2, 2026

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Heritabilities and genetic correlations of Drosophila melanogaster locomotory behaviour traits: a high-throughput phenotyping approach

Genetic Analysis of Genomic and Methylomic Variation and Construction of Multi-Trait Mutant Library in Rice Carried on Chang'e-5

From Mendelian Inheritance to Machine Learning: The Evolution of Genetic Improvement in Rabbits: A Review