The genotype‐phenotype landscape of an allosteric protein

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

No abstract available

Article activity feed

  1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Reply to the reviewers

    Reviewer #1

    The authors study allostery with a beautiful genotype-phenotype experiment to study the fitness landscape of an allosteric lac repressor protein. The authors make a mutational library using error prone pcr and measure the impact on antibiotic resistance protein expression at varying levels of ligand, IPTG, expression. After measuring the impact of mutations authors fill-in the missing data using a neural net model. This type of dose response is not standard in the field, but the richness of their data and the discovery of the "band pass" phenomena prove its worth here splendidly.

    Using this mixed experimental/predicted data the authors explore how each mutation alters the different parameters of a hill equation fit of a dose response curve. Using higher order mutational space the authors look at how mutations can qualitatively switch phenotypes to inverted or band-stop dose-response curves. To validate and further explore a band-stop novel phenotype, the authors focused on a triple mutant and made all combinations of the 3 mutations. The authors find that only one mutation alone alters the dose-response and only in combination does a band-stop behavior present itself. Overall this paper is a fantastic data heavy dive into the allosteric fitness landscape of protein.

    Overall, the data presented in this paper is thoroughly collected and analyzed making the conclusions well-based. We do not think additional experiments nor substantial changes are needed apart from including basic experimental details and more biophysical rationale/speculation as discussed in further detail below.

    The authors do a genotype-phenotype experiment that requires extensive deep sequencing experiments. However, right now quite a bit of basic statistics on the sequencing is missing. Baseline library quality is somewhat shown in supplementary fig 2 but the figure is hard to interpret. It would be good to have a table that states how many of all possible mutations at different mutation depths (single, double, etc) there are. Similarly, sequencing statistics are missing- it would be useful to know how many reads were acquired and how much sequencing depth that corresponds to. This is particularly important for barcode assignment to phenotype in the long-read sequencing. In addition, a synonymous mutation comparison is mentioned but in my reading that data is not presented in the supplemental figures section.

    We thank the reviewer for this succinct summary of the manuscript and the results. We appreciate the reviewer identifying data of interest that were not included in the original manuscript. We agree that this information is necessary to consider the results. Specific changes are summarized in the comments below.

    The paper is very much written from an "old school" allostery perspective with static end point structures that are mutually exclusive - eg. p5l10 "relative ligand-binding affinity between the two conformations" - however, an ensemble of conformations is likely needed to explain their data. This is especially true for the bandpass and inverted phenotypes they observe. The work by Hilser et al is of particular importance in this area. We would invite the authors to speculate more freely about the molecular origins of their findings.

    We agree with the suggestions to adopt a modern allosteric perspective. We have changed the language throughout the manuscript to align with the ensemble model of allostery. We continue to frame results using the Monad-Wyman-Changeaux model, which reliably predicts LacI activity from biophysical parameters and is not exclusive of more modern models of allostery.

    **Minor**

    There are a number of small modifications. In general this paper is very technical and could use with some explanation and discussion for relevance to make the manuscript more approachable for a broader audience.

    P1L23: Ligand binding at one site causes a conformational change that affects the activity of another > not necessarily true - and related to using more "modern" statistical mechanical language for describing allostery.

    We agree with the reviewer’s comment. We have addressed this comment by adopting language in line with more modern view of allostery, for example:

    “With allosteric regulation, ligand binding at one site on a biomolecule changes the activity of another, often distal, site. Switching between active and inactive states provides a sense-and-response function that defines the allosteric phenotype.”

    P2L20: The core experiment of this paper is a selection using a mutational library. In the main body the authors mention the library was created using mutagenic pcr but leave it at that. More details on what sort of mutagenic pcr was used in the main body would be useful. According to the methods error prone pcr was used. Why use er-pcr vs deep point mutational libraries? Presumably to sample higher order phenotype? Rationale should be included. Were there preliminary experiments that helped calibrate the mutation level?

    We agree that justifying the decision to use error-prone PCR for library construction would be helpful. To explain this decision, we have added to the main text to explain this decision and to reflect on the consequences.

    “We used error-prone PCR across the full lacI CDS to investigate the effects of higher-order substitutions spread across the entire LacI sequence and structure.”

    And

    Novel phenotypes emerged at mutational distances greater than one amino acid substitution, highlighting the value in sampling a broader genotype space with higher-order mutations. Furthermore, the untargeted, random mutagenesis approach used here was critical for finding these novel phenotypes, as the genotypes required for these novel phenotypes were unpredictable.”

    P2L20: Baseline library statistics would be great in a table for coverage, diversity, etc especially as this was done by error prone pcr vs a more saturated library generation method. This is present in sup fig2 but it's a bit complicated.

    To more clearly convey the diversity within the library, we have included a heatmap of amino acid substitution counts found within the library (Supplementary Fig. 4). Additionally, we have added Supplementary Table 1, which lists the distribution of mutational distances of LacI variants found within the library, and the corresponding coverage of all possible mutations for each mutational distance.

    P2L26: How were FACS gates drawn? This is in support fig17 - should be pointed to here.

    We agree that a better description of the FACS process would be helpful. To address this we have included Supplementary Fig. 2, showing flow cytometry measurements of the library before and after FACS. Additionally, we have extended the description of the FACS process:

    __“The initial library had a bimodal distribution of G____­0, as indicated by flow cytometry results, with a mode at low fluorescence (near G____­0 of wildtype LacI), and mode at higher gene expression. To generate a library in which most of the LacI variants could function as allosteric repressors, we used fluorescence activated cell sorting (FACS) to select the portion of the library with low fluorescence in the absence of ligand, gating at the bifurcation of the two modes (Sony SH800S Cell Sorter, Supplementary Fig. 2).”

    __

    P3L4: Where is the figure/data for the synonymous SNP mutations? This should be in the supplement.

    We agree this data is necessary to support the claim that LacI function was not impacted by synonymous mutations. We have included a new Supplementary Fig. 9, which shows the distribution of Hill equation parameters for LacI variants that code for the wild-type amino acid sequence, but with non-identical coding DNA sequences. Additionally, we included the results of a statistical analysis in the main text, this analysis compared all synonymous sequences in the library:

    “____We compared the distributions of the resulting Hill equation parameters between two sets of variants: 39 variants with exactly the wild-type coding DNA sequence for LacI (but with different DNA barcodes) and 310 variants with synonymous nucleotide changes (i.e. the wild-type amino acid sequence, but a non-wild-type DNA coding sequence). __Using the Kolmogorov-Smirnov test, we found no significant differences between the two sets (p-values of 0.71, 0.40, 0.28, and 0.17 for G0, G∞, EC50, and n respectively, Supplementary Fig. 9).” __

    P3L20: The authors use a ML learning deep neural network to predict variant that were not covered in the screen. However, the library generation method is using error prone pcr meaning there could multiple mutations resulting in the same amino acid change. The models performance was determined by looking at withheld data however error prone pcr could result in multiple nonsynomymous mutations of the same amino acid. For testing were mutations truly withheld or was there overlap? Because several mutations are being represented by different codon combinations. Was the withheld data for the machine learning withholding specific substitutions?

    We thank the reviewer for identifying the need to clarify this critical data analysis. Data was held-out at the amino acid level, and so no overlap between the training and testing datasets occurred. We have clarified the description of the method in the main text:

    __“We calculated RMSE using only held-out data not used in the model training, and the split between held-out data and training data was chosen so that all variants with a specific amino acid sequence appear in only one of the two sets.” __


    In addition, higher order protein interactions are complicated and idiosyncratic. I am surprised how well the neural net performs on higher order substitutions.

    P4L4: Authors find mutations at the dimer/tetramer interfaces but don't mention whether polymerization is required. is dimerization required for dna binding? Tetramerization?

    We agree with the reviewer that, overall, a description of LacI structure and function would improve messaging the reported results. As such, we have added Supplementary Table 2, which defines the structural features discussed throughout the manuscript. Additionally, we have strived to describe the relevant structural and functional role of specific amino acids that are discussed in the text. Finally, we have also added a paragraph to the main text that summarizes the structure and function of LacI.

    “The LacI protein has 360 amino acids arranged into three structural domains____22–24____. The first 62 N-terminal amino acids form the DNA-binding domain, comprising a helix-turn-helix DNA-binding motif and a hinge that connects the DNA-binding motif and the core domain. The core domain, comprising amino acid positions 63-324, is divided into two structural subdomains: the N-terminal core and the C-terminal core. The full core domain forms the ligand-binding pocket, core-pivot region, and dimer interface. The tetramerization domain comprises the final 30 amino acids and includes a flexible linker and an 18 amino acid α-helix (Fig. 3, Supplementary Table 2). Naturally, LacI functions as a dimer of dimers: Two LacI monomers form a symmetric dimer that further assembles into a tetramer (a dimer of dimers).”

    P4L8: Substitutions near the dimer interface both impact g0 and ec50, which authors say is consistent with a change in the allosteric constant. Can authors explain their thinking more in the paper to make it easier to follow? Are the any mutations in this area that only impact g0 or ec50 alone? Why may these specific residues modify dimerization?

    We agree that a more in-depth discussion on the possible mechanisms behind these phenotypic changes would improve the manuscript. We have added discussion throughout the subsection “Effects of amino acid substitutions on LacI phenotype,” we believe this added discussion improve the manuscript and clarify the relationship between the observed allosteric phenotypes and the molecular mechanisms behind them. W

    Overall, we have made a number of changes in the manuscript that we hope will address these concerns.

    P4L8: The authors discuss the allosteric constant extensively within the paper but do not explain it. It would be helpful to have an explanation of this to improve readability. This explanation should include the statistical mechanical basis of it and some speculation about the ways it manifests biophysically.

    The allosteric constant is a critical concept, and we agree that it must be defined and discussed clearly throughout the manuscript. We have greatly expanded the discussion of the effects of single amino acid substitutions, and in the process we give examples of biochemical changes in the protein, and how they may affect the allosteric constant. We think this added text improves the manuscript and helps clarify the allosteric constant and the biomolecular processes that affect it.

    P4L1-16: Authors see mutations in the dimerization region that impact either G0 and Gsaturated in combination with Ec50 but not g0 and gsaturated together. Maybe we do not fully understand the hill equation but why are there no mutations that impact both g0 and gsaturated seen in support fig 13c? Why would mutations in the same region potentially impacting dimerization impact either g0 or gsaturated? What might be the mechanism behind divergent responses?

    It is important to recognize that the dimer interface does not just support the formation of dimers. There are many points of contact along the dimer interface that change when LacI switches between the active and inactive states. So, the dimer interface also helps regulate the balance between the active and inactive states. Our results show that different substitutions near the dimer interface can push this balance either toward the active or inactive states to varying degrees. We’ve added text throughout the description of single-substitutions effects to give specific examples and added a new paragraph at the end of that section to provide additional discussion and context. With regard to the more specific question of changes to both G0 and Ginf, the models indicate that simultaneous changes to those Hill Equation parameters requires an unusual combination of biophysical changes. To clarify this point, we added a short paragraph to the text:

    __ “None of the single amino substitutions measured in the library simultaneously decrease ____G∞ and increase G0 (Supplementary Fig. 20c). This is not surprising, since substitutions that shift the biophysics to favor the active state tend to decrease G∞ while those that favor the inactive state tend to increase G0, and the biophysical models____2,14,15____ indicate that only a combination of parameter changes can cause both modifications to the dose-response. The library did, however, contain several multi-substitution variants with simultaneously decrease ____G∞ and increase G0. These inverted variants, and their associated substitutions are discussed below.”__


    P4L29: for interpretability it would be good to explain what log-additive effect means in the context of allostery.

    We agree that this information would be useful to the reader and have added additional text to explain log-additivity. We thank the reviewer for pointing out this oversight.

    “Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50. That is, the proportional effects of two individual amino acid substitutions on the EC50 can be multiplied together. For example, if substitution A results in a 3-fold change, and substitution B results in a 2-fold change, the double substitution, AB, behaving log-additively, results in a 6-fold change____.”

    P4L34-P5L19: This section is wonderful. Really cool results and interesting structural overlap!

    P5L34 Helix 9 of the protein is mentioned but it's functional relevance is not. This is common throughout the paper - it would be useful for there to be an overview somewhere to help the reader contextualize the results with known structural role of these elements.

    We agree with the reviewer that this information would help to contextualize the results. We have made a number of changes to address this. First, we have added Supplementary Table 2, which describes the structural features of LacI. Second, we have added a paragraph overviewing the structure and function of LacI. Third, we have expanded the section “the effects of individual amino acid substitutions on the function of LacI” to discuss the structural or biochemical impact of specific substitutions. We thank the reviewer for this suggestion.

    P5L39: The authors identified a triple mutant with the band-stop phenotype then made all combination of the triple mutant. Of particular interest is R195H/G265D which is nearly the same as the triple mutant. It would be nice if the positions of each of these mutations and have some discussion to begin to rationalize this phenotype, even if to point out how far apart they are and that there is no easy structural rationale!

    We appreciate the reviewer highlighting this area of interest. We have added structural information to Fig. 6, which indicates to position of the amino acid substitutions that result in the band-stop phenotype, as well as a small discussion in the main text:

    “To further investigate the band-stop phenotype, we chose a strong band-stop LacI variant with only three amino acid substitutions (R195H/G265D/A337D). These three positions are distributed distally on the periphery of the C-terminal core domain, and the role that each of these substitutions plays in the emergence of the band-stop phenotype is unclear.”

    P6L9: There should be more discussion of the significance of this work directly compared to what is known. For instance, negative cooperativity is mentioned as an explanation for bi-phasic dose response but this idea is not explained. Why would the relevant free energy changes be more entropic? Another example is the reverse-TetR phenotype observed by Hillen et al.

    We agree that more discussion is necessary to frame the results reported in the manuscript. To address this, we have added additional discussion throughout the manuscript that relates the results to the current understanding of allostery. Also, in the Conclusion, we added specific examples that lead us to link the ideas of bi-phasic dose response, negative cooperativity, and entropy/disorder. We believe these additions have improved the manuscript and we thank the reviewer for this suggestion.

    P6L28: The authors mention that phenotypes exist with genotypes that are discoverable with genotype-phenotype landscapes. This study due to the constraints of error prone pcr were somewhat limited. How big is the phenotypic landscape? Is it worth doing a more systematic study? What is the optimal experimental design: Single mutations, doubles, random - where is there the most information. How far can you drift before your machine learning model breaks down? How robust would it be to indels?

    The reviewer raises some excellent questions here, some of which are appropriate subjects for future work. The optimal experimental design depends on the objective: If the goal is to understand every possible mutation, a systematic site-saturation approach would be more appropriate. However, the landscape of a natural protein is limited by its wild-type DNA coding sequence, and so some substitutions are inaccessible (due to the arrangement of the codon table). The approach we took allowed to us characterize most of the accessible amino acid substitutions, while also allowing us to identify novel functions that would not have been identified with other approaches. We have added a little to the main text to discuss this (below). With regard to the DNN model, in the manuscript (SI Fig. 14), we show how the predictive accuracy degrades with mutational distance from the wild-type. It is possible that the type of DNN that we used could handle indels, since it effectively encodes each variant as a set of step-wise changes from the wild-type. But as with all machine-learning methods, it would require training with a dataset that included indels.

    “Novel phenotypes emerged at mutational distances greater than one amino acid substitution, highlighting the value in sampling a broader genotype space with higher-order mutations. Furthermore, the untargeted, random mutagenesis approach used here was critical for finding these novel phenotypes, as the genotypes required for these novel phenotypes were unpredictable.”

    Figures:

    Sup figs 3-7: The comparison of library-based results and single mutants is a great example of how to validate genotype-phenotype experiments!

    Thank you.

    Supp fig 5.: Missing figure number.

    We appreciate the reviewer catching this error and have attempted to properly label all figures and tables in this revision. Thank you.

    Supp fig7: G0 appears to have very poor fit between library vs single mutant version. Why might this be? R^2 would likely be better to report here as opposed to RMSE as RMSE is sensitize to the magnitude of the data such that you cannot directly compare RMSE of say 'n' to G0.

    We agree that these are important discussion points and have addressed this concern with an expanded discussion in the main text, as well as the addition of coefficient of correlation (R^2) in the caption for Figure 2 (previously supplementary figure 7). We believe these additions contribute meaningfully to the manuscript, and they address the concerns of the reviewer. The additional text reads:

    “We compared the Hill equation parameters from the library-scale measurement to those same parameters determined from flow cytometry measurements for each of the chemically synthesized LacI variants (Fig. 2). This served as a check of the new library-scale method’s overall ability to measure dose-response curves with quantitative accuracy. The accuracy for each Hill equation parameter in the library-scale measurement was: 4-fold for G0, 1.5-fold for G∞, 1.8-fold for EC50, and ± 0.28 for n. For G0, G∞, and EC50, we calculated the accuracy as: ____, where ____ is the root-mean-square difference between the logarithm of each parameter from the library-scale and cytometry measurements. For n, we calculated the accuracy simply as the root-mean-square difference between the library-scale and cytometry results. The accuracy for the gene expression levels (G0 and G∞) was better at higher gene expression levels (typical for G∞) than at low gene expression levels (typical for G0), which is expected based on the non-linearity of the fitness impact of tetracycline (Supplementary Figs. 10-11). Measurements of the Hill coefficient, n, had high relative uncertainties for both barcode-sequencing and flow cytometry, and so the parameter n was not used in any quantitative analysis.”

    Sup fig13c: it is somewhat surprising that mutations only appear to effect g0 and not gsaturated. This implies that basal and saturated activity are not coupled. Is this expected? Why or why not?

    This comment is partially addressed with a response above (P4L1-16). Coupled gene expression increases do occur, especially with substitutions at the start codon that result in fewer copies of LacI in the cell. In this instance, both G0 and G∞ are increased. Otherwise, changes to multiple biophysical parameters are required to increase both G0 and G∞.

    Reviewer #1 (Significance (Required)):

    Allostery is hard to comprehend because it involves many interacting residues propagating information across a protein. The Monod-Wyman-Changeux (MWC) and Koshland, Nemethy, and Filmer (KNF) models have been a long standing framework to explain much of allostery, however recent formulations have focused on the role of the conformational ensemble and a grounding in statistical mechanics. This manuscript focuses on the functional impact of mutations and therefore contribution of the amino acids to regulation. The authors unbiased approach of combining a dose-response curve and mutational library generation let them fit every mutant to a hill equation. This approach let the authors identify the allosteric phenotype of all measured mutations! The authors found inverted phenotypes which happen in homologs of this protein but most interesting is the strange and idiosyncratic 'Band-stop' phenotype. The band-stop phenotype is bi-phasic that will hopefully be followed up with further studies to explain the mechanism. This manuscript is a fascinating exploration of the adaptability of allosteric landscapes with just a handful of mutations.

    Genotype-phenotype experiments allow sampling immense mutational space to study complex phenotypes such as allostery. However, a challenge with these experiments is that allostery and other complicated phenomena come from immense fitness landscapes altering different parameters of the hill equation. The authors approach of using a simple error prone pcr library combined with many ligand concentrations allowed them to sample a very large space somewhat sparsely. However, they were able to predict this data by training and using a neural net model. I think this is a clever way to fill in the gaps that are inherent to somewhat sparse sampling from error prone pcr. The experimental design of the dose response is especially elegant and a great model for how to do these experiments.

    With some small improvements for readability, this manuscript will surely find broad interest to the genotype-phenotype, protein science, allostery, structural biology, and biophysics fields.

    We were prompted to do this by Review Commons and are posting our submitted review here:

    Willow Coyote-Maestas has relevant expertise in high throughput screening, protein engineering, genotype-phenotype experiments, protein allostery, dating mining, and machine learning.

    James Fraser has expertise in structural biology, genotype-phenotype experiments, protein allostery, protein dynamics, protein evolution, etc.

    Reviewer #2 (Evidence, reproducibility and clarity (Required)):

    The authors use deep mutational scanning to infer the dose-response curves of ~60,000 variants of the LacI repressor and so provide an unprecedently systematic dataset of how mutations affect an allosteric protein. Overall this is an interesting dataset that highlights the potential of mutational scanning for rapidly identifying diverse variants of proteins with desired or unexpected activities for synthetic biology/bioengineering. The relatively common inverted phenotypes and their sequence diversity is interesting, as is the identification of several hundred genotypes with non-sigmoidal band-stop dose-response curves and their enrichment in specific protein regions. A weakness of the study is that some of the parameter estimates seem to have high uncertainty and this is not clearly presented or the impact on the conclusions analysed. A second shortcoming is that there is little mechanistic insight beyond the enrichments of mutations with different effects in different regions of the protein. But as a first overview of the diversity of mutational effects on the dose-response curve of an allosteric protein, this is an important dataset and analysis.

    **Comments**

    **Data quality and reproducibility**

    "The flow cytometry results confirmed both the qualitative and quantitative accuracy of the new method (Supplementary Figs. 3-7)"

    • There need to be quantitative measures of accuracy in the text here for the different parameters.

    We believe this comment is addressed along with the following two comments.

    • Sup fig 7 panels should be main text panels - they are vital for understanding the data quality In particular, the G0 parameter estimates from the library appear to have a lower bound ie they provide no information below a cytometry Go of ~10^4. This is an important caveat and needs to be highlighted in the main text. The Hill parameter (n) estimate for wt (dark gray) replicate barcodes is extremely variable - why is this?
    • In general there is not a clear enough presentation of the uncertainty and biases in the parameter estimations which seem to be rather different for the 4 parameters. Only the EC50 parameter seems to correlate very well with the independent measurements.

    We thank the reviewer for identifying a need for more information on the accuracy of this method. So, we have moved Supplementary Fig. 7 to the main text (Fig 2 in the revised manuscript) and have added coefficients of correlation to each Hill equation parameter in that figure caption. Furthermore, we have added new data (Supplementary Fig. 11), which shows the uncertainty associated with different gene expression levels. Finally, we have added a discussion on the accuracy of this method for each parameter of the Hill equation to the main text. Estimation of the Hill coefficient (n) from data is often highly uncertain and variable, because that parameter estimate can be highly sensitive to random measurement errors at a single point on the curve. The estimate for the wild type appears to be highly variable because the plot contains 53 replicate measurements. So, the plotted variability represents approximately 2 standard deviations. The spread of wild-type results in the plot is consistent with the stated RMSE for the Hill coefficient. Furthermore, the Hill coefficient is not used in any of the additional quantitative analysis in our manuscript, partially because of its relatively high measurement uncertainty, but also because, based on the biophysical models, it is not as informative of the underlying biophysical changes.

    “We compared the Hill equation parameters from the library-scale measurement to those same parameters determined from flow cytometry measurements for each of the chemically synthesized LacI variants (Fig. 2). This served as a check of the new library-scale method’s overall ability to measure dose-response curves with quantitative accuracy. The accuracy for each Hill equation parameter in the library-scale measurement was: 4-fold for G0, 1.5-fold for G∞, 1.8-fold for EC50, and ± 0.28 for n. For G0, G∞, and EC50, we calculated the accuracy as: "exp" ["RMSE" ("ln" ("x" ))], where "RMSE" ("ln" ("x" )) is the root-mean-square difference between the logarithm of each parameter from the library-scale and cytometry measurements. For n, we calculated the accuracy simply as the root-mean-square difference between the library-scale and cytometry results. The accuracy for the gene expression levels (G0 and G∞) was better at higher gene expression levels (typical for G∞) than at low gene expression levels (typical for G0), which is expected based on the non-linearity of the fitness impact of tetracycline (Supplementary Figs. 10-11). Measurements of the Hill coefficient, n, had high relative uncertainties for both barcode-sequencing and flow cytometry, and so the parameter n was not used in any quantitative analysis.”

    • The genotypes in the mutagenesis library contain a mean of 4.4 aa substitutions and the authors us a neural network to estimate 3 of the Hill equation parameters (with uncertainties) for the 1991/2110 of the single aa mutations. It would be useful to have an independent experimental evaluation of the reliability of these inferred single aa mutational effects by performing facs on a panel of single aa mutants (using single aa mutants in sup fig 3-7, if there are any, or newly constructed mutants).

    We agree that the predictive performance of the DNN requires experimental validation. We evaluated the performance by withholding data from 20% of the library, including nearly 200 variants with single amino acid substitutions, and then compared the predicted effect of those substitutions to the measured effect. The results of this test are reported in Supplementary Fig. 14. Additionally, we have adjusted the main text to more clearly explain the evaluation process.

    __“To evaluate the accuracy of the model predictions, we used the root-mean-square error (RMSE) for the model predictions compared with the measurement results. We calculated RMSE using only held-out data not used in the model training, and the split between held-out data and training data was chosen so that all variants with a specific amino acid sequence appear in only one of the two sets.” ____ __

    • fig3/"Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50." How additive are the other 2 parameters? this analysis should also be presented in fig 3. If they are not as additive is it simply because of lower accuracy of the measurements? If the mutational effects are largely additive, then a simple linear model (rather than the DNN) could be used to estimate the single mutant effects from the multiple mutant genotypes.

    We agree with the reviewer that exploring the log-additivity of the Hill equation parameters is informative, and have included Supplementary Figure 21, which displays this information. Furthermore, we expanded the discussion of log-additivity on all three parameters in the main text:

    “Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50. That is, the proportional effects of two individual amino acid substitutions on the EC50 can be multiplied together. For example, if substitution A results in a 3-fold change, and substitution B results in a 2-fold change, the double substitution, AB, behaving log-additively, results in a 6-fold change. Only 0.57% (12 of 2101) of double amino acid substitutions in the measured data have EC50 values that differ from the log-additive effects of the single substitutions by more than 2.5-fold (Fig. 4). This result, combined with the wide distribution of residues that affect EC50, reinforces the view that allostery is a distributed biophysical phenomenon controlled by a free energy balance with additive contributions from many residues and interactions, a mechanism proposed previously1,39 and supported by other recent studies17, rather than a process driven by the propagation of local, contiguous structural rearrangements along a defined pathway.

    A similar analysis of log-additivity for G0 and G∞ is complicated by the more limited range of measured values for those parameters, the smaller number of substitutions that cause large shifts in G0 or G∞, and the higher relative measurement uncertainty at low G(L). However, the effects of multiple substitutions on G0 and G∞ are also consistent with log-additivity for almost every measured double substitution variant (Supplementary Fig. 21).”

    **Presentation/clarity of text and figures**

    • The main text implies that the DNN is trained to predict 3 parameters of the Hill equation but not the Hill coefficient (n). This should be clarified / justified in the main text.

    We agree that the decision to exclude the parameter ‘n’ requires explanation in the main text. To address this, we have added to the main text:

    “Measurements of the Hill coefficient, n, had high relative uncertainties for both barcode-sequencing and flow cytometry, and so the parameter n was not used in any quantitative analysis.”

    and

    “We trained the model to predict the Hill equation parameters G0, G∞, and EC50 (Supplementary Fig. 13), the three Hill equation parameters that were determined with relatively low uncertainty by the library-scale measurement.”

    • The DNN needs to be better explained and justified in the main text for a general audience. How do simpler additive models perform for phenotypic prediction / parameter inference?

    We agree with the reviewer that the DNN needs to be justified in the main text. As part of the revision plan, we propose to compare the predictive performance of the DNN to an additive model.

    • Ref 14. analyses a much smaller set of mutants in the same protein but using an explicit biophysical model. It would be helpful to have a more extensive comparison with the approach and conclusions to this previous study.

    Throughout the manuscript, we frame the results and discussion in terms of the referenced biophysical model. Using the model, we describe the biophysical effects that a substitution may have on LacI, based on observed changes to function associated with that substitution. We also comment briefly on the limitations of this model when applied to the extensive dataset presented here.

    “Most of the non-silent substitutions discussed above are more likely to affect the allosteric constant than either the ligand or operator affinities. Within the biophysical model, those affinities are specific to either the active or inactive state of LacI, i.e. they are defined conditionally, assuming that the protein is in the appropriate state. So, almost by definition, substitutions that affect the ligand-binding or operator-binding affinities (as defined in the models) must be at positions that are close to the ligand-binding site or within the DNA-binding domain. Substitutions that modify the ability of the LacI protein to access either the active state or inactive state, by definition, affect the allosteric constant. This includes, for example, substitutions that disrupt dimer formation (dissociated monomers are in the inactive state), substitutions that lock the dimer rigidly into either the active or inactive state, or substitutions that more subtly affect the balance between the active and inactive states. Thus, because there are many more positions far from the ligand- and DNA- binding regions than close to those regions, there are many more opportunities for substitutions to affect the allosteric constant than the other biophysical parameters. Note that this analysis assumes that substitutions don’t perturb the LacI structure too much, so that the active and inactive states remain somehow similar to the wild-type states. Our results suggest that this is not always the case: consider, for example, the substitutions at positions ____K84 and M98 discussed above and the substitutions resulting in the inverted and band-stop phenotypes discussed below.”

    • Enrichments need statistical tests to know how unexpected that results are e.g. p5 line 12 "67% of strongly inverted variants have substitutions near the ligand-binding pocket"

    We agree that this information is necessary to interpret the results. We have included p-values (previously reported only in the Methods section) throughout the main text of the manuscript.

    The publication by Poelwijk et al. was considered extensively when planning this work, and failing to cite that manuscript would have been tremendously unjust. We have included it, as well as a few additional references that have identified and discussed inverted LacI variants. We sincerely thank the reviewer for identifying this oversight.

    • What mechanisms do the authors envisage that could produce the band-stop dose response curves? There is likely previous theoretical work that could be cited here. In general there is little discussion of the biophysical mechanisms that could underlie the various mutational effects.

    We agree with the reviewer, that discussing the biophysical mechanisms that underlie many of the reported mutations is important to understand the results. We have expanded the subsection “Effects of amino acid substitutions on LacI phenotype” to include discussion on several of the key substitutions (or groups of substitutions) and their potential biophysical effects. Additionally, we consider mechanism that may underlie the band-stop sensor, and propose one model that could explain the band-stop phenotype:

    “In particular, the biphasic dose-response of the band-stop variants suggests negative cooperativity: that is, successive ligand binding steps have reduced ligand binding affinity. Negative cooperativity has been shown to be required for biphasic dose-response curves____42,43____. The biphasic dose-response and apparent negative cooperativity are also reminiscent of systems where protein disorder and dynamics have been shown to play an important role in allosteric function____1____, including catabolite activator protein (CAP)____44,45____ and the Doc/Phd toxin-antitoxin system____46____. This suggests that entropic changes may also be important for the band-stop phenotype. A potential mechanism is that band-stop LacI variants have two distinct inactive states: an inactive monomeric state and an inactive dimeric state. In the absence of ligand, inactive monomers may dominate the population. Then, at intermediate ligand concentrations, ligand binding stabilizes dimerization of LacI into an active state which can bind to the DNA operator and repress transcription. When a second ligand binds to the dimer, it returns to an inactive dimeric state, similar to wildtype LacI. This mechanism, and other possible mechanisms, do not match the MWC model of allostery or its extensions____2,13–15____ and require a more comprehensive study and understanding of the ensemble of states in which these band-stop LacI variants exist.”

    • "This result, combined with the wide distribution of residues that affect EC50, suggests that LacI allostery is controlled by a free energy balance with additive contributions from many residues and interactions." 'additive contributions and interactions' covers all possible models of vastly different complexity i.e. this sentence is rather meaningless.

    We have attempted to contextualize this statement by adding additional discussion and references. We hope these additions give more meaning to this section.

    “____This result, combined with the wide distribution of residues that affect EC50, reinforces the view that allostery is a distributed biophysical phenomenon controlled by a free energy balance with additive contributions from many residues and interactions, a mechanism proposed previously____1,39____ and supported by other recent studies____17____, rather than a process driven by the propagation of local, contiguous structural rearrangements along a defined pathway.”

    • fig 4 c and d compress a lot of information into one figure and I found this figure confusing. It may be clearer to have multiple panels with each panel presenting one aspect. It is also not clear to me what the small circular nodes exactly represent, especially when you have one smaller node connected to two polygonal nodes, and why they don't have the same colour scale as the polygonal nodes.

    We agree with the reviewer that figure 4 (or Figure 5 in the revised manuscript) contains a lot of information. The purpose of this figure is to convey the structural and genetic diversity among the sets of inverted variants and band-stop variants. We designed this figure to convey this point at two levels: a brief overview, where the diversity is apparent by quickly considering the figure, and at a more informative level, with some quantitative data and structurally relevant points highlighted. We have modified the caption slightly, in an effort to improve clarity.

    • line 25 - 'causes a conformational change' -> 'energetic change' (allostery does not always involve conformational change

    We thank the reviewer for this comment and have adopted a more modern language describe allostery throughout the manuscript.

    • sup fig 5 legend misses '5'

    We thank the reviewer for pointing this out, we have attempted to number all figures and tables more carefully.

    • sup fig 7. pls add correlation coefficients to these plots (and move to main text figures).

    We agree that this information is of interest and have included this data as main text Figure 2. In addition, we have included coefficients of correlation in the caption of this figure.

    • Reference 21 is just a title and pubmed link

    We thank the reviewer for identifying this error, we have corrected this in the references.

    • "fitness per hour" -> growth rate

    To ensure that this connection is clearly established, when we introduce fitness for the first time, we clarify that it relates to growth rate:

    “Consequently, in the presence of tetracycline, the LacI dose-response modulates cellular fitness (i.e. growth rate) based on the concentration of the input ligand isopropyl-β-D-thiogalactoside (IPTG).”

    Also, we define ‘fitness’ in the Methods section:

    “The experimental approach for this work was designed to maintain bacterial cultures in exponential growth phase for the full duration of the measurements. So, in all analysis, the Malthusian definition of fitness was used, i.e. fitness is the exponential growth rate____58____.”

    • page 6 line 28 - "discoverable only via large-scale landscape measurements" - directed evolution approaches can also discover such genotypes (see e.g. Poelwijk /Tans paper). Please re-phrase.

    We agree with the reviewer and have adjusted the main text accordingly.

    “____Overall, our findings suggest that a surprising diversity of useful and potentially novel allosteric phenotypes exist with genotypes that are readily discoverable via large-scale landscape measurements.”

    • pls define jargon the first time it is used e.g. band-stop and band-pass

    We agree that all unconventional terms should be explicitly defined when used, and we have attempted to define the band-pass and band-stop dose-response curves more clearly in the main text:

    “These include examples of LacI variants with band-stop dose-response curves (i.e. variants with high-low-high gene expression; e.g. Fig. 1e, Supplementary Fig. 7), and LacI variants with band-pass dose-response curves (i.e. variants with low-high-low gene expression; e.g. Supplementary Fig. 8).”

    **Methods/data availability/ experimental and analysis reproducibility:**

    The way that growth rate is calculated on page 17 equation 1- This section is confusing. Please be explicit about how you accounted for the lag phase, what the lag phase was, and total population growth during this time. In addition, please report the growth curves from the wells of the four plates, the final OD600 of the pooled samples, and exact timings of when the samples were removed from 37 degree incubation in a table. These are critical for calculating growth rate in individual clones downstream.

    We thank the reviewer for identifying the need to clarify this section of text. The ‘lag’ in this section referred to a delay before tetracycline began impacting the growth rate of cells. To address this, we have changed ‘lag’ in this context to ‘delay.’ Furthermore, we have attempted to clarify precisely the cause of this delay, and how we accounted for it in calculating growth rates:

    For samples grown with tetracycline, the tetracycline was only added to the culture media for Growth Plates 2‑4. Because of the mode of action of tetracycline (inhibition of translation), there was a delay in its effect on cell fitness: Immediately after diluting cells into Growth Plate 2 (the first plate with tetracycline), the cells still had a normal level of proteins needed for growth and proliferation and they continued to grow at nearly the same rate as without tetracycline. Over time, as the level of proteins required for cell growth decreased due to tetracycline, the growth rate of the cells decreased. Accordingly, the analysis accounts for the variation in cell fitness (growth rate) as a function of time after the cells were exposed to tetracycline. With the assumption that the fitness is approximately proportional to the number of proteins needed for growth, the fitness as a function of time is taken to approach the new value with an exponential decay:

    (3)

    where μitet is the steady-state fitness with tetracycline, and α is a transition rate. The transition rate was kept fixed at α = log(5), determined from a small-scale calibration measurement. Note that at the tetracycline concentration used during the library-scale measurement (20 µg/mL), μitet was greater than zero even at the lowest G(L) levels (Supplementary Fig. 10). From Eq. (3), the number of cells in each Growth Plate for samples grown with tetracycline is:

    • What were the upper and lower bounds of the measurements? (LacI deletion vs Tet deletion / autofluoresence phenotype - true 100% and true 0% activity). Knowing and reporting these bounds will also allow easier comparison between datasets in the future.

    We agree that knowing the limitations of the measurement are important for contextualizing the results. To address this point, we have included Supplementary Fig. 11, which shows the uncertainty of the measurement across gene expression levels.

    Please clarify whether there was only 1 biological replicate (because the plates were pooled before sequencing)? Or if there were replicates present an analysis of reproducibility.

    We thank the reviewer for pointing out the ambiguity in the original manuscript. The library-scale measurement reported here was completed once, the 24 growth conditions were spread across 96 wells, so each condition occupied 4 wells. The 4 wells were combined prior to DNA extraction. We have clarified this process in the methods by removing ‘duplicate’:

    “Growth Plate 2 contained the same IPTG gradient as Growth Plate 1 with the addition of tetracycline (20 µg/mL) to alternating rows in the plate, resulting in 24 chemical environments, with each environment spread across 4 wells.”

    Despite there being only a single library-scale measurement, the accuracy and reliability of the results are supported by many distinct biological replicates within the library (i.e. LacI variants with the same amino acid sequence but with different barcodes, see new Supplementary Fig. 9), as well as over 100 orthogonal dose-response curve measurements completed with flow cytometry (Figure 2). We believe these support the reproducibility of the work and we have included statistical analysis on the accuracy of the library-scale measurement results.

    “To test the accuracy of the new method for library-scale dose-response curve measurements, we independently verified the results for over 100 LacI variants from the library. For each verification measurement, we chemically synthesized the coding DNA sequence for a single variant and inserted it into a plasmid where LacI regulates the expression of a fluorescent protein. We transformed the plasmid into E. coli and measured the resulting dose-response curve with flow cytometry (e.g. Fig. 1e). We compared the Hill equation parameters from the library-scale measurement to those same parameters determined from flow cytometry measurements for each of the chemically synthesized LacI variants (Fig. 2). This served as a check of the new library-scale method’s overall ability to measure dose-response curves with quantitative accuracy. The accuracy for each Hill equation parameter in the library-scale measurement was: 4-fold for G0, 1.5-fold for G∞, 1.8-fold for EC50, and ± 0.28 for n. For G0, G∞, and EC50, we calculated the accuracy as: "exp" ["RMSE" ("ln" ("x" ))], where "RMSE" ("ln" ("x" )) is the root-mean-square difference between the logarithm of each parameter from the library-scale and cytometry measurements. For n, we calculated the accuracy simply as the root-mean-square difference between the library-scale and cytometry results (Supplementary Fig. 7).”

    • Please provide supplementary tables of the data (in addition to the raw sequencing files). Both a table summarising the growth rates, inferred parameter values and uncertainties for genotypes and a second table with the barcode sequence counts across timepoints and associated experimental data.

    We agree that access to this information is critical. Due to the size of the associated data, we have made this data available for download in a public repository. We direct readers to the repository information in the “Data Availability” statement:

    __“The raw sequence data for long-read and short-read DNA sequencing have been deposited in the NCBI Sequence Read Archive and are available under the project accession number PRJNA643436. Plasmid sequences have been deposited in the NCBI Genbank under accession codes MT702633, and MT702634, for pTY1 and pVER, respectively. __

    __The processed data table containing comprehensive data and information for each LacI variant in the library is publicly available via the NIST Science Data Portal, with the identifier ark:/88434/mds2-2259 (https://data.nist.gov/od/id/mds2-2259 or https://doi.org/10.18434/M32259). The data table includes the DNA barcode sequences, the barcode read counts, the time points used for the libarary-scale measurement, fitness estimates for each barcoded variant across the 24 chemical environments, the results of both Bayesian inference models (including posterior medians, covariances, and 0.05, 0.25, 0.75, and 0.95 posterior quantiles), the LacI CDS and amino acid sequence for each barcoded variant (as determined by long-read sequencing), the number of LacI CDS reads in the long-read sequencing dataset for each barcoded variant, and the number of unintended mutations in other regions of the plasmid (from the long-read sequencing data). __

    Code Availability

    __All custom data analysis code is available at https://github.com/djross22/nist_lacI_landscape_analysis.” __

    Reviewer #2 (Significance (Required)):

    The authors present an unprecedently systematic dataset of how mutations affect an allosteric protein. This illustrates the potential of mutational scanning for rapidly identifying diverse variants of allosteric proteins / regulators with desired or unexpected activities for synthetic biology/bioengineering.

    Previous studies have identified inverted dose-response curve for a lacI phenotypes https://www.cell.com/fulltext/S0092-8674(11)00710-0 but using directed evolution i.e. they were not comprehensive in nature.

    The audience of this study would be protein engineers, the allostery field, synthetic biologists and the mutation scanning community and evolutionary biologists interested in fitness landscapes.

    My relevant expertise is in deep mutational scanning and genotype-phenotype landscapes, including work on allosteric proteins and computational methods.

    Reviewer #3 (Evidence, reproducibility and clarity (Required)):

    In this interesting manuscript the authors developed in ingenious high throughput screening approach which utilizes DNA barcoding to select variants of LacI proteins with different allosteric profiles for IPTG control using E. coli fitness (growth rate) in a range of antibiotic concentrations as a readout thus providing a genotype-phenotype map for this enzyme. The authors used library of 10^5-10^ variants of LacI expressed from a plasmid and screened for distinct IPTG activation profiles under different conditions including several antibiotic stressors. As a result they identified various patterns of activation including normal (sigmoidal increase), inverted (decrease) and unusual stop-band where the dependence of growth on [IPTG] is non-monotonic. The study is well-conceived, well executed and provides statistically significant results. The key advance provided by this work is that it allows to identify specific mutations in LacI connected with one of three allosteric profiles. The paper is clearly written all protocols are explained and it can be reproduced in a lab that possesses proper expertise in genetics.

    Reviewer #3 (Significance (Required)):

    The significance of this work is that it discovered libraries of LacI variants which give rise to distinct profiles of allosteric control of activation of specific genes (in this case antibiotic resistance) by the Lac mechanism. The barcoding technology allowed to identify specific mutations which are (presumably) causal of changes in the way how allosteric activation of LacI by IPTG works. As such it provides a rich highly resolved dataset of LacI variants for further exploration and analysis.

    Alongside with these strengths several weaknesses should also be noted:

    1. First and foremost the paper does not provide any molecular-level biophysical insights into the impact of various types of mutations on molecular properties of LacI. Do the mutations change binding affinity to IPTG? Binding side? Communication dynamics? Stability? The diagrams of connectivity for the stop-band mutations (Fig.4) do not provide much help as they do not tell much which molecular properties of LacI are affected by mutations and why certain mutations have specific effect on allostery. A molecular level exploration would make this paper much stronger.

    We address this comment with comment (2), below.

    1. In the same vein a theoretical MD study would be quite illuminating in answering the key unanswered question of this work: Why do mutations have various and pronounced effects of allosteric regulation by LacI?. I think publication of this work should not be conditioned on such study but again adding would make the work much stronger.

    We appreciate the reviewer’s comments and agree that investigating the molecular mechanisms driving the phenotypic changes identified in this work is a compelling proposition. Throughout the manuscript, we identify positions and specific amino acid substitutions that affect the measurable function of LacI, and occasionally discuss the biophysical effects that may underly these changes. We have expanded the discussion to include possible molecular-level effects.

    The dataset reported here identifies many potential candidates for molecular-level study, either computationally or experimentally. However, this manuscript is scoped to report a large-scale method to measure the genotype-phenotype landscape of an allosteric protein, and a limited investigation into the emergence of novel phenotypes that are identified in the landscape.

    1. Lastly a recent study PNAS v.116 pp.11265-74 (2019) explored a library of variants of E. coli Adenylate Kinase and showed the relationship between allosteric effects due to substrate inhibition and stability of the protein. Perhaps a similar relationship can explored in this case of LacI.

    We thank the reviewer for highlighting this publication. We agree with the reviewer that similar effects may play a role in the activity of LacI. Establishing such a relationship would require additional experimentation, and, we think, is outside the scope of the submitted manuscript. Although, we hope follow-up studies using this dataset will investigate this phenomenon and other related mechanisms, that may underlie the band-stop phenotype and other observed effects.

  2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Referee #3

    Evidence, reproducibility and clarity

    In this interesting manuscript the authors developed in ingenious high throughput screening approach which utilizes DNA barcoding to select variants of LacI proteins with different allosteric profiles for IPTG control using E. coli fitness (growth rate) in a range of antibiotic concentrations as a readout thus providing a genotype-phenotype map for this enzyme. The authors used library of 10^5-10^ variants of LacI expressed from a plasmid and screened for distinct IPTG activation profiles under different conditions including several antibiotic stressors. As a result they identified various patterns of activation including normal (sigmoidal increase), inverted (decrease) and unusual stop-band where the dependence of growth on [IPTG] is non-monotonic. The study is well-conceived, well executed and provides statistically significant results. The key advance provided by this work is that it allows to identify specific mutations in LacI connected with one of three allosteric profiles. The paper is clearly written all protocols are explained and it can be reproduced in a lab that possesses proper expertise in genetics.

    Significance

    The significance of this work is that it discovered libraries of LacI variants which give rise to distinct profiles of allosteric control of activation of specific genes (in this case antibiotic resistance) by the Lac mechanism. The barcoding technology allowed to identify specific mutations which are (presumably) causal of changes in the way how allosteric activation of LacI by IPTG works. As such it provides a rich highly resolved dataset of LacI variants for further exploration and analysis.

    Alongside with these strengths several weaknesses should also be noted:

    1. First and foremost the paper does not provide any molecular-level biophysical insights into the impact of various types of mutations on molecular properties of LacI. Do the mutations change binding affinity to IPTG? Binding side? Communication dynamics? Stability? The diagrams of connectivity for the stop-band mutations (Fig.4) do not provide much help as they do not tell much which molecular properties of LacI are affected by mutations and why certain mutations have specific effect on allostery. A molecular level exploration would make this paper much stronger.
    2. In the same vein a theoretical MD study would be quite illuminating in answering the key unanswered question of this work: Why do mutations have various and pronounced effects of allosteric regulation by LacI?. I think publication of this work should not be conditioned on sucgh study but again adding would make the work much stronger.
    3. Lastly a recent study PNAS v.116 pp.11265-74 (2019) explored a library of variants of E. coli Adenylate Kinase and showed the relationship between allosteric effects due to substrate inhibition and stability of the protein. Perhaps a similar relationship can explored in this case of LacI.
  3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Referee #2

    Evidence, reproducibility and clarity

    The authors use deep mutational scanning to infer the dose-response curves of ~60,000 variants of the LacI repressor and so provide an unprecedently systematic dataset of how mutations affect an allosteric protein. Overall this is an interesting dataset that highlights the potential of mutational scanning for rapidly identifying diverse variants of proteins with desired or unexpected activities for synthetic biology/bioengineering. The relatively common inverted phenotypes and their sequence diversity is interesting, as is the identification of several hundred genotypes with non-sigmoidal band-stop dose-response curves and their enrichment in specific protein regions. A weakness of the study is that some of the parameter estimates seem to have high uncertainty and this is not clearly presented or the impact on the conclusions analysed. A second shortcoming is that there is little mechanistic insight beyond the enrichments of mutations with different effects in different regions of the protein. But as a first overview of the diversity of mutational effects on the dose-response curve of an allosteric protein, this is an important dataset and analysis.

    Comments

    Data quality and reproducibility

    "The flow cytometry results confirmed both the qualitative and quantitative accuracy of the new method (Supplementary Figs. 3-7)"

    • There need to be quantitative measures of accuracy in the text here for the different parameters.
    • Sup fig 7 panels should be main text panels - they are vital for understanding the data quality In particular, the G0 parameter estimates from the library appear to have a lower bound ie they provide no information below a cytometry Go of ~10^4. This is an important caveat and needs to be highlighted in the main text. The Hill parameter (n) estimate for wt (dark gray) replicate barcodes is extremely variable - why is this?
    • In general there is not a clear enough presentation of the uncertainty and biases in the parameter estimations which seem to be rather different for the 4 parameters. Only the EC50 parameter seems to correlate very well with the independent measurements.
    • The genotypes in the mutagenesis library contain a mean of 4.4 aa substitutions and the authors us a neural network to estimate 3 of the Hill equation parameters (with uncertainties) for the 1991/2110 of the single aa mutations. It would be useful to have an independent experimental evaluation of the reliability of these inferred single aa mutational effects by performing facs on a panel of single aa mutants (using single aa mutants in sup fig 3-7, if there are any, or newly constructed mutants).
    • fig3/"Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50." How additive are the other 2 parameters? this analysis should also be presented in fig 3. If they are not as additive is it simply because of lower accuracy of the measurements? If the mutational effects are largely additive, then a simple linear model (rather than the DNN) could be used to estimate the single mutant effects from the multiple mutant genotypes.

    Presentation/clarity of text and figures

    • The main text implies that the DNN is trained to predict 3 parameters of the Hill equation but not the Hill coefficient (n). This should be clarified / justified in the main text.
    • The DNN needs to be better explained and justified in the main text for a general audience. How do simpler additive models perform for phenotypic prediction / parameter inference?
    • Ref 14. analyses a much smaller set of mutants in the same protein but using an explicit biophysical model. It would be helpful to have a more extensive comparison with the approach and conclusions o this previous study.
    • Enrichments need statistical tests to know how unexpected that results are e.g. p5 line 12 "67% of strongly inverted variants have substitutions near the ligand-binding pocket"
    • missing citation: Poelwijk et al 2011 https://www.cell.com/fulltext/S0092-8674(11)00710-0 previously reported an inverted dose-response curve for a lacI mutant.
    • What mechanisms do the authors envisage that could produce the band-stop dose response curves? There is likely previous theoretical work that could be cited here. In general there is little discussion of the biophysical mechanisms that could underlie the various mutational effects.
    • "This result, combined with the wide distribution of residues that affect EC50, suggests that LacI allostery is controlled by a free energy balance with additive contributions from many residues and interactions." 'additive contributions and interactions' covers all possible models of vastly different complexity i.e. this sentence is rather meaningless.
    • fig 4 c and d compress a lot of information into one figure and I found this figure confusing. It may be clearer to have multiple panels with each panel presenting one aspect. It is also not clear to me what the small circular nodes exactly represent, especially when you have one smaller node connected to two polygonal nodes, and why they don't have the same colour scale as the polygonal nodes.
    • line 25 - 'causes a conformational change' -> 'energetic change' (allostery does not always involve conformational change
    • sup fig 5 legend misses '5'
    • sup fig 7. pls add correlation coefficients to these plots (and move to main text figures).
    • Reference 21 is just a title and pubmed link
    • "fitness per hour" -> growth rate
    • page 6 line 28 - "discoverable only via large-scale landscape measurements" - directed evolution approaches can also discover such genotypes (see e.g. Poelwijk /Tans paper). Please re-phrase.
    • pls define jargon the first time it is used e.g. band-stop and band-pass

    Methods/data availability/ experimental and analysis reproducibility:

    • The way that growth rate is calculated on page 17 equation 1- This section is confusing. Please be explicit about how you accounted for the lag phase, what the lag phase was, and total population growth during this time. In addition, please report the growth curves from the wells of the four plates, the final OD600 of the pooled samples, and exact timings of when the samples were removed from 37 degree incubation in a table. These are critical for calculating growth rate in individual clones downstream.
    • What were the upper and lower bounds of the measurements? (LacI deletion vs Tet deletion / autofluoresence phenotype - true 100% and true 0% activity). Knowing and reporting these bounds will also allow easier comparison between datasets in the future.
    • Please clarify whether there was only 1 biological replicate (because the plates were pooled before sequencing)? Or if there were replicates present an analysis of reproducibility.
    • Please provide supplementary tables of the data (in addition to the raw sequencing files). Both a table summarising the growth rates, inferred parameter values and uncertainties for genotypes and a second table with the barcode sequence counts across timepoints and associated experimental data.

    Significance

    The authors present an unprecedently systematic dataset of how mutations affect an allosteric protein. This illustrates the potential of mutational scanning for rapidly identifying diverse variants of allosteric proteins / regulators with desired or unexpected activities for synthetic biology/bioengineering.

    Previous studies have identified inverted dose-response curve for a lacI phenotypes https://www.cell.com/fulltext/S0092-8674(11)00710-0 but using directed evolution i.e. they were not comprehensive in nature.

    The audience of this study would be protein engineers, the allostery field, synthetic biologists and the mutation scanning community and evolutionary biologists interested in fitness landscapes.

    My relevant expertise is in deep mutational scanning and genotype-phenotype landscapes, including work on allosteric proteins and computational methods.

  4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Referee #1

    Evidence, reproducibility and clarity

    The authors study allostery with a beautiful genotype-phenotype experiment to study the fitness landscape of an allosteric lac repressor protein. The authors make a mutational library using error prone pcr and measure the impact on antibiotic resistance protein expression at varying levels of ligand, IPTG, expression. After measuring the impact of mutations authors fill-in the missing data using a neural net model. This type of dose response is not standard in the field, but the richness of their data and the discovery of the "band pass" phenomena prove its worth here splendidly.

    Using this mixed experimental/predicted data the authors explore how each mutation alters the different parameters of a hill equation fit of a dose response curve. Using higher order mutational space the authors look at how mutations can qualitatively switch phenotypes to inverted or band-stop dose-response curves. To validate and further explore a band-stop novel phenotype, the authors focused on a triple mutant and made all combinations of the 3 mutations. The authors find that only one mutation alone alters the dose-response and only in combination does a band-stop behavior present itself. Overall this paper is a fantastic data heavy dive into the allosteric fitness landscape of protein.

    Major

    Overall, the data presented in this paper is thoroughly collected and analyzed making the conclusions well-based. We do not think additional experiments nor substantial changes are needed apart from including basic experimental details and more biophysical rationale/speculation as discussed in further detail below.

    The authors do a genotype-phenotype experiment that requires extensive deep sequencing experiments. However, right now quite a bit of basic statistics on the sequencing is missing. Baseline library quality is somewhat shown in supplementary fig 2 but the figure is hard to interpret. It would be good to have a table that states how many of all possible mutations at different mutation depths (single, double, etc) there are. Similarly, sequencing statistics are missing- it would be useful to know how many reads were acquired and how much sequencing depth that corresponds to. This is particularly important for barcode assignment to phenotype in the long-read sequencing. In addition, a synonymous mutation comparison is mentioned but in my reading that data is not presented in the supplemental figures section.

    The paper is very much written from an "old school" allostery perspective with static end point structures that are mutually exclusive - eg. p5l10 "relative ligand-binding affinity between the two conformations" - however, an ensemble of conformations is likely needed to explain their data. This is especially true for the bandpass and inverted phenotypes they observe. The work by Hilser et al is of particular importance in this area. We would invite the authors to speculate more freely about the molecular origins of their findings.

    Minor

    There are a number of small modifications. In general this paper is very technical and could use with some explanation and discussion for relevance to make the manuscript more approachable for a broader audience.

    P1L23: Ligand binding at one site causes a conformational changes that affects the activity of another > not necessarily true - and related to using more "modern" statistical mechanical language for describing allostery.

    P2L20: The core experiment of this paper is a selection using a mutational library. In the main body the authors mention the library was created using mutagenic pcr but leave it at that. More details on what sort of mutagenic pcr was used in the main body would be useful. According to the methods error prone pcr was used. Why use er-pcr vs deep point mutational libraries? Presumably to sample higher order phenotype? Rationale should be included. Were there preliminary experiments that helped calibrate the mutation level?

    P2L20: Baseline library statistics would be great in a table for coverage, diversity, etc especially as this was done by error prone pcr vs a more saturated library generation method. This is present in sup fig2 but it's a bit complicated.

    P2L26: How were FACS gates drawn? This is in support fig17 - should be pointed to here.

    P3L4: Where is the figure/data for the synonymous SNP mutations? This should be in the supplement.

    P3L20: The authors use a ML learning deep neural network to predict variant that were not covered in the screen. However, the library generation method is using error prone pcr meaning there could multiple mutations resulting in the same amino acid change. The models performance was determined by looking at withheld data however error prone pcr could result in multiple nonsynomymous mutations of the same amino acid. For testing were mutations truly withheld or was there overlap? Because several mutations are being represented by different codon combinations. Was the withheld data for the machine learning withholding specific substitutions?

    In addition, higher order protein interactions are complicated and idiosyncratic. I am surprised how well the neural net performs on higher order substitutions.

    P4L4: Authors find mutations at the dimer/tetramer interfaces but don't mention whether polymerization is required. is dimerization required for dna binding? Tetramerization?

    P4L8: Substitutions near the dimer interface both impact g0 and ec50, which authors say is consistent with a change in the allosteric constant. Can authors explain their thinking more in the paper to make it easier to follow? Are the any mutations in this area that only impact g0 or ec50 alone? Why may these specific residues modify dimerization?

    P4L8: The authors discuss the allosteric constant extensively within the paper but do not explain it. It would be helpful to have an explanation of this to improve readability. This explanation should include the statistical mechanical basis of it and some speculation about the ways it manifests biophysically.

    P4L1-16: Authors see mutations in the dimerization region that impact either G0 and Gsaturated in combination with Ec50 but not g0 and gsaturated together. Maybe we do not fully understand the hill equation but why are there no mutations that impact both g0 and gsaturated seen in support fig 13c? Why would mutations in the same region potentially impacting dimerization impact either g0 or gsaturated? What might be the mechanism behind divergent responses?

    P4L29: for interpretability it would be good to explain what log-additive effect means in the context of allostery.

    P4L34-P5L19: This section is wonderful. Really cool results and interesting structural overlap!

    P5L34 Helix 9 of the protein is mentioned but it's functional relevance is not. This is common throughout the paper - it would be useful for there to be an overview somewhere to help the reader contextualize the results with known structural role of these elements.

    P5L39: The authors identified a triple mutant with the band-stop phenotype then made all combination of the triple mutant. Of particular interest is R195H/G265D which is nearly the same as the triple mutant. It would be nice if the positions of each of these mutations and have some discussion to begin to rationalize this phenotype, even if to point out how far apart they are and that there is no easy structural rationale!

    P6L9: There should be more discussion of the significance of this work directly compared to what is known. For instance negative cooperativity is mentioned as an explanation for bi-phasic dose response but this idea is not explained. Why would the relevant free energy changes be more entropic? Another example is the reverse-TetR phenotype observed by Hillen et al.

    P6L28: The authors mention that phenotypes exist with genotypes that are discoverable with genotype-phenotype landscapes. This study due to the constraints of error prone pcr were somewhat limited. How big is the phenotypic landscape? Is it worth doing a more systematic study? What is the optimal experimental design: Single mutations, doubles, random - where is there the most information. How far can you drift before your machine learning model breaks down? How robust would it be to indels?

    Figures:

    Sup figs 3-7: The comparison of library-based results and single mutants is a great example of how to validate genotype-phenotype experiments!

    Supp fig 5.: Missing figure number.

    Supp fig7: G0 appears to have very poor fit between library vs single mutant version. Why might this be? R^2 would likely be better to report here as opposed to RMSE as RMSE is sensitize to the magnitude of the data such that you cannot directly compare RMSE of say 'n' to G0.

    Sup fig13c: it is somewhat surprising that mutations only appear to effect g0 and not gsaturated. This implies that basal and saturated activity are not coupled. Is this expected? Why or why not?

    Significance

    Allostery is hard to comprehend because it involves many interacting residues propagating information across a protein. The Monod-Wyman-Changeux (MWC) and Koshland, Nemethy, and Filmer (KNF) models have been a long standing framework to explain much of allostery, however recent formulations have focused on the role of the conformational ensemble and a grounding in statistical mechanics. This manuscript focuses on the functional impact of mutations and therefore contribution of the amino acids to regulation. The authors unbiased approach of combining a dose-response curve and mutational library generation let them fit every mutant to a hill equation. This approach let the authors identify the allosteric phenotype of all measured mutations! The authors found inverted phenotypes which happen in homologs of this protein but most interesting is the strange and idiosyncratic 'Band-stop' phenotype. The band-stop phenotype is bi-phasic that will hopefully be followed up with further studies to explain the mechanism. This manuscript is a fascinating exploration of the adaptability of allosteric landscapes with just a handful of mutations.

    Genotype-phenotype experiments allow sampling immense mutational space to study complex phenotypes such as allostery. However, a challenge with these experiments is that allostery and other complicated phenomena come from immense fitness landscapes altering different parameters of the hill equation. The authors approach of using a simple error prone pcr library combined with many ligand concentrations allowed them to sample a very large space somewhat sparsely. However, they were able to predict this data by training and using a neural net model. I think this is a clever way to fill in the gaps that are inherent to somewhat sparse sampling from error prone pcr. The experimental design of the dose response is especially elegant and a great model for how to do these experiments.

    With some small improvements for readability, this manuscript will surely find broad interest to the genotype-phenotype, protein science, allostery, structural biology, and biophysics fields.

    We were prompted to do this by Review Commons and are posting our submitted review here:

    Willow Coyote-Maestas has relevant expertise in high throughput screening, protein engineering, genotype-phenotype experiments, protein allostery, dating mining, and machine learning.

    James Fraser has expertise in structural biology, genotype-phenotype experiments, protein allostery, protein dynamics, protein evolution, etc.

    Referees cross-commenting

    Seems like our biggest issues are: better uncertainty estimates of the parameters and more biophysical/mechanistic explanation/speculation. The uncertainty estimates might be tricky with the deep learning approach. The more biophysical speculation will require some re-writing around an ensemble rather than a static structure perspective.