A molecular grammar for environmental sensitivity in intrinsically disordered protein regions

This article has been Reviewed by the following groups

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Log in to save this article

Abstract

Cells must continuously sense and respond to changes in their physicochemical environment, yet the molecular mechanisms underlying this process remain incompletely understood. Intrinsically disordered protein regions (IDRs) constitute a substantial fraction of the eukaryotic proteome and participate in nearly every major cellular process. IDRs exist as dynamic conformational ensembles whose properties are strongly influenced by both amino acid sequence and the physicochemical environment. Although this intrinsic sensitivity has suggested that IDRs may contribute to environmental sensing of the cellular environment, the sequence-encoded principles that enable such sensing remain poorly understood. Here we identify a molecular grammar that captures key sequence features underlying variation in environmental ensemble sensitivity across diverse IDRs. Using 188 IDRs derived from organisms across all kingdoms of life, we systematically quantify changes in ensemble properties in response to hyperosmotic stress in living cells. We show that ensemble sensitivity is significantly associated with specific sequence features and predicted ensemble dimensions, with homogeneous patterning of oppositely charged residues emerging as a strong predictor of high sensitivity in charged IDRs. This sequence organization is necessary for high sensitivity in naturally-occurring IDRs and sufficient to confer high responsiveness in de novo designed low-complexity sequences. We further demonstrate that the presence of additional structured domains within IDR-containing proteins can substantially modulate ensemble sensitivity relative to isolated IDRs, revealing an additional layer of modulation at the multi-domain protein level. Applying the molecular grammar to the with-no-lysine (WNK) kinase crowding-sensing system, we find that regions within the disordered C-terminal domain – the domain necessary for crowding-induced phase separation and cell volume recovery – are predicted as hypersensitive, a property that is conserved across distant WNK homologues despite sequence divergence, and that is shared by heterologous IDRs capable of functionally replacing it. Together, our results establish environmental ensemble sensitivity as a sequence-encoded, context-dependent, and quantitatively tunable property of IDRs, provide a framework for understanding how physicochemical changes can be transduced in cells without dedicated receptor architectures as exemplified by the WNK crowding-sensing system, and offer design principles for engineering synthetic protein-based environmental sensors.

Article activity feed

  1. Whether distinct environmental perturbations engage overlapping or orthogonal sequence grammars within the disordered proteome remains an open question that requires systematic investigation.

    The collection of yeast strains you've built is a great resource for exploring how more environmental factors influence FRET behavior. It will be interesting to compare future datasets to what you've found here and start to characterize IDRs as sensitive to specific stresses vs. generally sensitive to environmental perturbations.

  2. From this library, 188 IDRs were robustly expressed in living yeast cells

    Were there any interesting shared features of the IDRs that didn't express well, or was it seemingly random?