Distant sequence regions of JBP1 contribute to J-DNA binding

This article has been Reviewed by the following groups

Read the full article

Listed in

Log in to save this article

Abstract

Base-J (β-D-glucopyranosyloxymethyluracil) is a modified DNA nucleotide that replaces 1% of thymine in kinetoplastid flagellates. The biosynthesis and maintenance of base-J depends on the base-J-binding protein 1 (JBP1) that has a thymidine hydroxylase domain and a J-DNA-binding domain (JDBD). How the thymidine hydroxylase domain synergizes with the JDBD to hydroxylate thymine in specific genomic sites, maintaining base-J during semi-conservative DNA replication, remains unclear. Here, we present a crystal structure of the JDBD including a previously disordered DNA-contacting loop and use it as starting point for molecular dynamics simulations and computational docking studies to propose recognition models for JDBD binding to J-DNA. These models guided mutagenesis experiments, providing additional data for docking, which reveals a binding mode for JDBD onto J-DNA. This model, together with the crystallographic structure of the TET2 JBP1-homologue in complex with DNA and the AlphaFold model of full-length JBP1, allowed us to hypothesize that the flexible JBP1 N-terminus contributes to DNA-binding, which we confirmed experimentally. Α high-resolution JBP1:J-DNA complex, which must involve conformational changes, would however need to be determined experimentally to further understand this unique underlying molecular mechanism that ensures replication of epigenetic information.

Article activity feed

  1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Reply to the reviewers

    Manuscript number: RC-2023-01846

    Corresponding author(s): Anastassis Perrakis

    1. General Statements

    We thank the reviewers for the feedback, highlighting the synergy between computational modeling approaches and the experimental techniques we used to study the interaction between JBP1 and J-DNA. We would like to re-iterate that this approach has led to new findings regarding JBP1 and J-DNA interactions, namely:

    • We identified and validated an additional interface in DNA binding domain of JBP1 (JDBD), that contributes to J-DNA binding.
    • Through analysis of the AlphaFold model of JBP1, we propose how the Thymidine Hydroxylase domain (THD) of JBP1 binds J-DNA, and how the JDBD and THD domains are connected flexibly but explicitly to each other.
    • The AlphaFold model allowed to hypothesize that the N-terminus of JBP1 is contributing to J-DNA binding, which we confirmed experimentally. These findings collectively suggest a mechanistic and structural basis on the synergy between the JDBD, the THD and the N-terminus of JBP1, providing a possible explanation to the previously observed conformational changes of JBP1 upon J-DNA binding. Our findings on the conservation of the N-terminus region and the new interface of JDBD, could be offering an explanation on the differences on how essential base-J is for different Trypanosomidaespecies. They also offer a first glimpse of how these domains synergize to provide new insights in the semi-conservative replication mechanisms of the base-J epigenetic marker in kinetoplastids.

    2. Point-by-point description of the revisions

    Reviewer #1____: ____Evidence, reproducibility and clarity

    Summary:

    The paper presents a combined experimental (X-ray, SAXS, mutational analysis) and computational (MD, docking, AlphaFold) work that elucidates the mechanism of JDBD:JDNA complex formation.

    Major comment:

    • How did the authors decide the timescale of the production run? Wouldn't the loop motion (which can be necessary for this study) occur on a timescale of 300+ ns?

    While our simulations showed overall stability of the simulated protein (as reflected by the RMSD time series), the RMSF provided clear indications for differences in flexibility in the loops and termini of JBP1. We believe that performing an MD simulation of 100 ns in duplo samples the flexibility and behavior of the JBP1 DNA binding domain (JDBD) sufficiently for obtaining templates for docking studies, which was the purpose of running the simulations. We emphasized this in the manuscript (page 10) by adding: “and obtain additional templates for further docking studies”.

    Minor comments:

    • Did I understand correctly that the LINCS algorithm constrained only hydrogen-involving bonds? It is not mentioned explicitly. Or am I missing something?

    LINCS was indeed used to only constrain hydrogen-involving bonds. We made this more explicit in the MD protocol described in the method section of the manuscript: *“The LINCS algorithm (40) was used to constrain *hydrogen-involving bond lengths to their zero-energy value”.

    • The authors should increase the resolution of figures S1 and S3. They look a bit blurry.

    We apologize for that; we tried to improve that by adjusting the figure sizes, but we are constrained by the output from e.g., the Bitclust software. We sincerely hope that the current resolution does not present an obstacle to the reader.

    Significance:

    The J DNA base is critical for transcription termination at the ends of the polycistronic gene clusters in Leishmania and related species. Hence, understanding the formation mechanism of the JDBD:J-DNA complex can provide an opportunity to develop novel chemotherapeutic treatments against these diseases. This work provides the first crystal structure of JDBD with the disordered loop and suggests that R448 and N455, as well as the N-terminus, are involved in the J-DNA binding process. The article is well-written and can interest readers from biological and biochemical societies. However, my field of expertise is computational chemistry and biochemistry; therefore, I cannot adequately evaluate the accuracy of the experimental techniques used in this work.

    We thank the reviewer, but would like to emphasize that our work goes well beyond the new interface of the JDBD, offering significant new insights on the synergy between the THD, the JDBD and the newly identified N-terminus binding to J-DNA, as also outlined in the general summary above.

    __Reviewer #2____:____ Evidence, reproducibility and clarity __

    The manuscript by de Vries et al. reported the crystal structure of the J-DNA binding domain of JBP1. Although, the structure was already solved, the new structure allows to observe a loop that was disordered in the previous structure. This structure was next used to propose models of DNA complex analyzed by MD. The proposed model was then validated by mutagenesis.

    Overall, the findings are interesting and the technical quality of the work is high.

    We believe, as we outlined in the summary, that our work goes well beyond showing the new structure and validate the new interface by mutagenesis. In our view, the major findings of the paper have to do with the AlphaFold modeling analysis and validation, and the finding that the N-terminus of JBP1 is involved in DNA binding, something that is not only new, but has also been totally unexpected.

    Comments:

    -For clarity, a figure showing the domain organization of JBP1 could help the reader in the introduction part.

    The domain architecture of JBP1 was added to Figure 1 as panel B.

    -In addition to Figure 2 showing the newly observed loop in 2Fo-Fc map, an omit map should be included in Supp data.

    A figure of the omit map was added to the supplemental information as Supplemental Figure S1.

    -Figure S6 legend should be more precise about the type of HDX MS analysis.

    As the HDX-MS data and methods were described in detail in previous work (Heidebrecht *et al. *2011), we left the details out in the current manuscript. The reference to this paper was added to Figure S6 for clarity.

    -The authors performed MD simulations. But what about DNA curvature upon complex formation?

    For the JDBD MD simulations, we did not add at all the (J-)DNA and the current simulations provide no information about its curvature. As mentioned in the discussion, we do expect conformational changes when the J-DNA:JBP1 complex forms, and this likely includes DNA curvature as well as conformational changes between the protein domains. We felt that the current data would not allow to extract new insights from such complicated simulations.

    -p.12 Some mutants were characterized, notably their melting temperatures. One mutant shows decreased stability while R448A shows increased stability. What is the structural explanation?

    Indeed, the E437A mutant (that showed lower expression compared to the other mutants) showed decreased thermal stability. The R448A shows an increase in stability (~3o C), and so does the H440 mutant (~2o C). While there is no specific structural explanation for these observations, in general mutation to alanine reduces the entropy-loss upon protein folding. The reason we comment about the stability is to point out that the dramatically decreased binding of the N455A and R448A mutants is not due to a decrease of the protein stability. This is now clarified in the manuscript: “the other mutants are as stable or slightly more stable compared to the wild-type, suggesting that the DNA-binding analysis is not affected significantly by altered protein stability.”

    -Figure 6E: The chi2 value for the comparison of the experimental curve for THD domain and calculated curve is very high indicating a poor fit. What is the explanation?

    The χ2 value indeed reflects differences between the JBP1 THD selected from the AlphaFold model and the structure used in the SAXS experiment. The experimental model is the so-called ΔDBD, which is the full length JBP1, where the JDBD is missing (it has been “spliced out”). Hence, the connecting loops and the N-terminus are present in this ΔDBD structure, whereas in the THD of the AlphaFold model, these parts of the structure were left out. In other words, while the full-length computational model refers to the exact same purified protein, the computationally truncated model and the purified protein for the experiments, have actual differences. Thus, the shape and fit of the experimental curve to the calculated curve can be considered pretty good. This is now clarified in the manuscript by adding: “The χ2 value of the fit is slightly elevated due to presence of the connecting loops between the THD and the JDBD and the N-terminus in the protein used for measuring the SAXS curve, which were removed from the computational model.”.

    Significance:

    The novelty of the manuscript mainly relies on the description of the crystal structure of JDBD protein without DNA and proposed models of DNA complex within full length protein, models that were validated by mutations or truncations. The current manuscript well suited for a specialized journal.

    These findings are indeed novel, especially the discovery and validation of the new interface of JDBD to J-DNA. We want to iterate that we are most excited by the totally unexpected and mechanistically important discovery of the role of the N-terminus of JBP1, that brings together legacy data and raises interest for additional structural studies.

    __Reviewer #3____: ____Evidence, reproducibility and clarity __

    Base J, also known as β-D-glucopyranosyloxymethyluracil, is a modified form of thymidine that has been identified in kinetoplastids and related organisms. It is worth noting that the distribution of Base J in the genome may vary depending on the organism and its life stage and influences its genome dynamics. The synthesis of this hypermodified nucleotide occurs in two steps, which involve the participation of two distinct thymidine hydroxylases, namely J-binding protein 1 and 2 (JBP1 and JBP2), along with a β-glucosyl transferase. In this study, the authors have presented a crystal structure of JBP1 J-DNA binding domain (J-DBD), which includes a previously reported disordered loop that might be involved in JBP1:J-DNA contact. Using this disordered structure as a starting point, the authors conducted Molecular Dynamics simulations and computational docking studies to propose models for the recognition of J-DNA by JBP1 J-DBD. The results from these studies were validated by punctual mutagenesis experiments, which provided additional data for docking and revealed a binding pattern for JBP1 J-DBD on J-DNA. By combining the crystallographic structure of the TET2 JBP1-homologue in complex with DNA and the AlphaFold model of full-length JBP1, the authors have hypothesized that the flexible N-terminus of JBP1 contributes to DNA-binding, which they have confirmed experimentally. However, according to the authors, to gain a deeper understanding of the unique molecular mechanism that underlies the replication of epigenetic information, an experimental determination of a high-resolution JBP1:J-DNA complex involving conformational changes would be necessary. Nevertheless, the present proposed objectives were fully contemplated by the authors.

    Major comments:

    In my opinion, the present article effectively achieved all the described objectives using appropriate and reproducible methodology, including protein expression and crystallization analysis, Molecular Dynamics analysis using GROMACS-2020.2 software, docking analysis, punctual mutations analyses, and modeling of JBP1:J-DNA complex using the AlphaFold tool. The authors presented the results in a logical and organized manner, making it easy for readers to extract the most important points. However, I believe that the section titled "Results and Discussion" contains more "results" than "discussion". While I understand that the literature on JBPs and base J is still in its early stages, other species of kinetoplastids have JBP1, in which mutations were not lethal as in L. tarentolae (e.g. T. brucei). Therefore, providing information about the structure of JBP1 and how the present results relate to what is known about JBP1 in other species in terms of structure and J-DNA interactions would significantly enrich the discussion of the findings and reinforce their significance and impact. Thus, the authors should have been clearer about the impact of their findings. When discussing the results, the authors should have answered questions such as how the identification of the new residues involved in JBP1 J-DNA binding impacts the current model of JBP1:J-DNA interactions, how this improved model contributes to the understanding of base J synthesis, and if the new model can be extrapolated to other species of kinetoplastids, according to the conservation of JBP1 among them.

    Although the article is more focused on protein research rather than parasite general molecular biology and medical studies, the findings may have implications for the development of new treatments for leishmaniases. Therefore, the authors should have discussed the potential of their new improved model as a target for lacking treatments of leishmaniases or at least brought up the point at conclusion section.

    We thank the reviewer for pointing out that the comparison between kinetoplastid species could be described more explicit to highlight the impact of the presented results with respect to the variety in JBP1 sequence, and possibly contribute to understanding the role of base-J in the differences in lethality and transcription regulation within these species. We now elaborate on our results in multiple places in the manuscript:

    • In the introduction we introduce the differences in lethality and transcription regulation within Leishmania and Trypanosoma (see also minor comment 2 below).
    • An alignment of full-length JBP1 sequences of different Trypanosomatidae species was added as Supplemental Figure S10 to support the discussion below.
    • The section describing the docking model of JDBD:J-DNA has been ammended with a description regarding the conservation of the residues involved in the binding interface: “The residues described in the JDBD:J-DNA interface are in general highly conserved (Supplemental Figure S12). Asp525 is fully conserved in Leishmania, Trypanosoma, Leptomonas and Bodo saltans species, so are Lys522, Arg532 and ArgR448. Asn455, which we identify in this study, is not conserved in Leptomonas, and Arg517 is not conserved in Trypanosoma also.
    • We renamed the final section to “Conclusions and Outlook” and added some discussion focusing on the conservation of the residues in the N-terminus and in the JDBD between different Trypanosomatidae Specifically, we now discuss how these could contribute in understanding the differences in lethality and transcription termination between Leishmania and *Trypanosoma *in the absence of base-J.

    Minor Comments:

    Please, re-check the affirmation "99% of base-J is found in telomeric repeats, mainly in GGGTTA repeats wherein the second thymine is modified to base-J (2-4)" in the Introduction. I believe that the distribution of base-J varies among different species of trypanosomatids and, therefore, cannot be generalized. Moreover, among different life stages in some organisms such as T. brucei and Leishmania major, differences on base-J distribution are found. The 99% of telomeric base-J mentioned would be a feature of Leishmania genus. Please, re-check the references 3 and 4.

    Indeed, the referee is right to mention that the 99% is a Leishmania-specific observation. We have modified the introduction accordingly. *“Base-J is found mainly in telomeric repeats and other **repetitive sequences. In **Leishmania **99% *of base-J is found in telomers, mainly in GGGTTA repeats, wherein the second thymine is modified to base-J (2–4).”

    Please, enrich the introduction topic with information about the model species, such as importance as pathological agent, its genomic organisation (core, subtelomeres, telomeres, what is present in subtelomeres, including base j) and polycistronic transcription and base J relevance on this aspect. That way, the reader will have a broad and more complete overview of the relevance of the present study.

    We have enriched the introduction with a paragraph (“Leishmania species are uni-cellular […] essentiality of base-J for the life circle of these parasites.”) outlining the issues raised by the referee. As suggested by the referee, this makes it easier to both understand the relevance of the present study and to enrich the discussion about our findings discussed earlier in this letter.

    Please, inform the expression vector for Leishmania tarentolae JBP1 used to express the mentioned protein on BL21(DE3)T1R.

    The expression vector for JDBD JBP1 used for the crystallization was mentioned in Heidebrecht et al. 2011, which we refer to in the text. For clarity we added the vector to the first sentence in the protein expression and purification section in the material and methods: “using the pETNKI-his-3C-JBP1-JDBD plasmid”.

    Please, supply the picture of the gel containing the extracted protein.

    The gel of the JDBD mutants and the JBP1 N-terminus truncations was added to the supplemental information as Figure S10.

    Significance:

    Overall, this study provides important insights into the JBP1 and DNA interactions, which were lacking in the literature. The use of techniques such as protein expression and crystallization analysis, molecular dynamics, and docking analysis is in line with the research objectives. However, the lack of some information about methodology needs to be addressed (minor comments 3 and 4). Personally, methodology such as molecular dynamics and docking analysis is not easy to critique but the results are clear and understandable.

    Although the authors should have been clearer about the impact of their findings, as addressed on my major comments, I believe that protein focused molecular parasitologists would benefit from the finds and methodology presented on this manuscript, since the article is more focused on protein research rather than parasite general molecular biology and medical studies, as mentioned on my major comments.

    In summary, this study provides new insights into JBP1 and DNA interactions and uses appropriate and reproducible techniques to achieve its objectives. However, the authors should provide more clarity on the impact of their findings and discuss the potential of their new improved model.

    My area of expertise: Cell and molecular biology; stem cells and factor controlling their fate; DNA, RNA, and molecular biology related techniques; Trypanosomatids telomere and telomerase

    We would like to thank the referee for his positive and constructive outlook, which we believe resulted in changes that put the impact of our findings in clearer perspective.

  2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Referee #3

    Evidence, reproducibility and clarity

    Base J, also known as β-D-glucopyranosyloxymethyluracil, is a modified form of thymidine that has been identified in kinetoplastids and related organisms. It is worth noting that the distribution of Base J in the genome may vary depending on the organism and its life stage and influences its genome dynamics. The synthesis of this hypermodified nucleotide occurs in two steps, which involve the participation of two distinct thymidine hydroxylases, namely J-binding protein 1 and 2 (JBP1 and JBP2), along with a β-glucosyl transferase. In this study, the authors have presented a crystal structure of JBP1 J-DNA binding domain (J-DBD), which includes a previously reported disordered loop that might be involved in JBP1:J-DNA contact. Using this disordered structure as a starting point, the authors conducted Molecular Dynamics simulations and computational docking studies to propose models for the recognition of J-DNA by JBP1 J-DBD. The results from these studies were validated by punctual mutagenesis experiments, which provided additional data for docking and revealed a binding pattern for JBP1 J-DBD on J-DNA. By combining the crystallographic structure of the TET2 JBP1-homologue in complex with DNA and the AlphaFold model of full-length JBP1, the authors have hypothesized that the flexible N-terminus of JBP1 contributes to DNA-binding, which they have confirmed experimentally. However, according to the authors, to gain a deeper understanding of the unique molecular mechanism that underlies the replication of epigenetic information, an experimental determination of a high-resolution JBP1:J-DNA complex involving conformational changes would be necessary. Nevertheless, the present proposed objectives were fully contemplated by the authors.

    Major comments:

    In my opinion, the present article effectively achieved all the described objectives using appropriate and reproducible methodology, including protein expression and crystallization analysis, Molecular Dynamics analysis using GROMACS-2020.2 software, docking analysis, punctual mutations analyses, and modeling of JBP1:J-DNA complex using the AlphaFold tool. The authors presented the results in a logical and organized manner, making it easy for readers to extract the most important points. However, I believe that the section titled "Results and Discussion" contains more "results" than "discussion". While I understand that the literature on JBPs and base J is still in its early stages, other species of kinetoplastids have JBP1, in which mutations were not lethal as in L. tarentolae (e.g. T. brucei). Therefore, providing information about the structure of JBP1 and how the present results relate to what is known about JBP1 in other species in terms of structure and J-DNA interactions would significantly enrich the discussion of the findings and reinforce their significance and impact. Thus, the authors should have been clearer about the impact of their findings. When discussing the results, the authors should have answered questions such as how the identification of the new residues involved in JBP1 J-DNA binding impacts the current model of JBP1:J-DNA interactions, how this improved model contributes to the understanding of base J synthesis, and if the new model can be extrapolated to other species of kinetoplastids, according to the conservation of JBP1 among them. Although the article is more focused on protein research rather than parasite general molecular biology and medical studies, the findings may have implications for the development of new treatments for leishmaniases. Therefore, the authors should have discussed the potential of their new improved model as a target for lacking treatments of leishmaniases or at least brought up the point at conclusion section.

    Minor Comments:

    I have some minor comments regarding the text:

    1. Please, re-check the affirmation "99% of base-J is found in telomeric repeats, mainly in GGGTTA repeats wherein the second thymine is modified to base-J (2-4)" in the Introduction. I believe that the distribution of base-J varies among different species of trypanosomatids and, therefore, cannot be generalized. Moreover, among different life stages in some organisms such as T. brucei and Leishmania major, differences on base-J distribution are found. The 99% of telomeric base-J mentioned would be a feature of Leishmania genus. Please, re-check the references 3 and 4.
    2. Please, enrich the introduction topic with information about the model species, such as importance as pathological agent, its genomic organisation (core, subtelomeres, telomeres, what is present in subtelomeres, including base j) and polycistronic transcription and base J relevance on this aspect. That way, the reader will have a broad and more complete overview of the relevance of the present study.
    3. Please, inform the expression vector for Leishmania tarentolae JBP1 used to express the mentioned protein on BL21(DE3)T1R.
    4. Please, supply the picture of the gel containing the extracted protein.

    Significance

    Overall, this study provides important insights into the JBP1 and DNA interactions, which were lacking in the literature. The use of techniques such as protein expression and crystallization analysis, molecular dynamics, and docking analysis is in line with the research objectives. However, the lack of some information about methodology needs to be addressed (minor comments 3 and 4). Personally, methodology such as molecular dynamics and docking analysis is not easy to critique but the results are clear and understandable.

    Although the authors should have been clearer about the impact of their findings, as addressed on my major comments, I believe that protein focused molecular parasitologists would benefit from the finds and methodology presented on this manuscript, since the article is more focused on protein research rather than parasite general molecular biology and medical studies, as mentioned on my major comments.

    In summary, this study provides new insights into JBP1 and DNA interactions and uses appropriate and reproducible techniques to achieve its objectives. However, the authors should provide more clarity on the impact of their findings and discuss the potential of their new improved model.

    My area of expertise: Cell and molecular biology; stem cells and factor controlling their fate; DNA, RNA, and molecular biology related techniques; Trypanosomatids telomere and telomerase

  3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Referee #2

    Evidence, reproducibility and clarity

    The manuscript by de Vries et al. reported the crystal structure of the J-DNA binding domain of JBP1. Although, the structure was already solved, the new structure allows to observe a loop that was disordered in the previous structure. This structure was next used to propose models of DNA complex analyzed by MD. The proposed model was then validated by mutagenesis.

    Overall, the findings are interesting and the technical quality of the work is high.

    Comments:

    • For clarity, a figure showing the domain organization of JBP1 could help the reader in the introduction part.
    • In addition to Figure 2 showing the newly observed loop in 2Fo-Fc map, an omit map should be included in Supp data.
    • Figure S6 legend should be more precise about the type of HDX MS analysis.
    • The authors performed MD simulations. But what about DNA curvature upon complex formation?
    • p.12 Some mutants were characterized, notably their melting temperatures. One mutant shows decreased stability while R448A shows increased stability. What is the structural explanation?
    • Figure 6E: The chi2 value for the comparison of the experimental curve for THD domain and calculated curve is very high indicating a poor fit. What is the explanation?

    Significance

    The novelty of the manuscript mainly relies on the description of the crystal structure of JDBD protein without DNA and proposed models of DNA complex within full length protein, models that were validated by mutations or truncations. The current manuscript well suited for a specialized journal.

  4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

    Learn more at Review Commons


    Referee #1

    Evidence, reproducibility and clarity

    Summary:

    The paper presents a combined experimental (X-ray, SAXS, mutational analysis) and computational (MD, docking, AlphaFold) work that elucidates the mechanism of JDBD:JDNA complex formation.

    Major comment:

    • How did the authors decide the timescale of the production run? Wouldn't the loop motion (which can be necessary for this study) occur on a timescale of 300+ ns?

    Minor comments:

    • Did I understand correctly that the LINCS algorithm constrained only hydrogen-involving bonds? It is not mentioned explicitly. Or am I missing something?
    • The authors should increase the resolution of figures S1 and S3. They look a bit blurry.

    Significance

    The J DNA base is critical for transcription termination at the ends of the polycistronic gene clusters in Leishmania and related species. Hence, understanding the formation mechanism of the JDBD:J-DNA complex can provide an opportunity to develop novel chemotherapeutic treatments against these diseases. This work provides the first crystal structure of JDBD with the disordered loop and suggests that R448 and N455, as well as the N-terminus, are involved in the J-DNA binding process. The article is well-written and can interest readers from biological and biochemical societies.

    However, my field of expertise is computational chemistry and biochemistry; therefore, I cannot adequately evaluate the accuracy of the experimental techniques used in this work.